Sayari Logo

Sayari

Senior Data Engineer

Posted 9 Days Ago
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Build and maintain scalable ETL pipelines using Python, Spark, and Airflow; collaborate with AI/ML and Product teams to deliver AI-native data products; identify and resolve ETL bottlenecks; ensure code quality through reviews and tests; own sprint deliverables and contribute to roadmap planning and major epics.
The summary above was generated by AI
About Sayari: 

Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

POSITION DESCRIPTION

As a Data Engineer at Sayari, you will be the engine behind the world’s most comprehensive commercial world model. You will join a high-autonomy team responsible for building and scaling the complex orchestration systems that transform billions of primary-source records into actionable intelligence. This is a role for a "builder" who respects the complexity of large-scale ETL and graph databases and is "PhD-curious" about the future of AI-native data products and modern orchestration.

JOB RESPONSIBILITIES
  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution engines.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native products.
  • Proactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebase.
  • Contribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisions.
  • Own the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performance.
  • Participate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibility.
SKILLS & EXPERIENCE

Required

  • 5 or more years of production data engineering experience, with clear ownership of systems you built and operated end to end
  • Strong Python, with meaningful experience in a JVM language (Scala preferred) or willingness to ramp quickly
  • Hands-on Snowflake experience, or equivalent depth in BigQuery or Redshift with demonstrated ability to transfer
  • Experience deploying and operating AI or ML applications in production, including output validation, monitoring, and cost management at scale
  • Orchestration experience with Apache Airflow or a comparable workflow tool
  • Track record of operating production systems reliably, with comfort navigating failure, monitoring, and recovery

Preferred

  • Experience with Spark on Dataproc Serverless or other serverless Spark environments
  • Familiarity with Kubernetes for deployment
  • Experience with data quality tooling such as deequ, Great Expectations, or equivalent
  • GCP experience (BigQuery, Dataproc, Cloud Storage)
  • Experience leading or contributing to a data warehouse migration
  • Background in team mergers or migrating a team onto a new operating process

The target base salary for this position is $140,000-$160,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$140,000$160,000 USD

Similar Jobs

Yesterday
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and operate scalable, secure cloud-based data platforms and pipelines across the full data engineering lifecycle. Instrument and monitor pipelines, optimize performance, troubleshoot production issues, reduce technical debt, drive cloud and open-source adoption, and maintain documentation and governance for federal and military healthcare data solutions.
Top Skills: AzureCi/CdDevOpsGoogle Cloud Platform (Gcp)OraclePostgresSQLSQL Server
3 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and optimize scalable ETL/data pipelines (SQL Server, Snowflake, Databricks) for large healthcare datasets. Support production cycles, monitor and resolve issues, perform root cause analysis, ensure data quality, conduct code reviews, estimate work, and partner with stakeholders to deliver reliable data solutions.
Top Skills: .NetAzureDatabricksOraclePythonSnowflakeSQLSQL ServerSsisTeradata
7 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
120K-201K Annually
Senior level
120K-201K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and maintain scalable Spark-based ETL pipelines and computed tables in a central data lake. Integrate structured and unstructured IoT, sensor, and external data for analytics, model training, and dashboards. Collaborate with Data Science, Analytics, and ML teams to ensure reliable, high-quality customer-facing datasets.
Top Skills: AirflowAWSAzureDagsterData LakeDatabricksDelta LakeETLGCPGitGitPrefectPysparkPythonRest ApisSparksqlSQL

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account