Sayari Logo

Sayari

Principal Data Engineer

Posted 10 Days Ago
Easy Apply
Remote
Hiring Remotely in United States
200K-220K Annually
Senior level
Easy Apply
Remote
Hiring Remotely in United States
200K-220K Annually
Senior level
The Principal Data Engineer will lead the Data Resolution team, focusing on complex data challenges using Spark, system architecture, and mentorship, to optimize graph data pipelines.
The summary above was generated by AI
About Sayari: 

Sayari is a risk intelligence provider that equips the public and private sectors with immediate visibility into complex commercial relationships by delivering the largest commercially available collection of corporate and trade data from over 250 jurisdictions worldwide. Sayari's solutions enable risk resilience, mission-critical investigations, and better economic decisions. 

Headquartered in Washington, D.C., its solutions are trusted by Fortune 500 companies, financial institutions, and government agencies, and are used globally by thousands of users in over 35 countries. Funded by world-class investors, with a strategic $228 million investment by TPG Inc. (NASDAQ: TPG) in 2024, Sayari has been recognized by the Inc. 5000 and the Deloitte Technology Fast 500 as one of the fastest growing private companies in the United States and was featured as one of Inc.’s “Best Workplaces” for 2025.

POSITION DESCRIPTION

We are looking for a Principal Data Engineer to join our Data Resolution team and serve as a technical anchor for our most complex data challenges. In this role, you will be a "player-coach," spending the majority of your time (70%) hands-on with Spark and graph data logic while dedicating the remainder of your time to system architecture, design planning, and technical mentorship. You will be instrumental in evolving our graph build pipelines, optimizing our cloud footprint, and overseeing the long-term planning and execution of major data pipeline re-architectures. This is a high-impact role where your work directly powers the data products used by global systems defenders.


JOB RESPONSIBILITIES
  • Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution.
  • Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient.
  • Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale.
  • Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency.
  • Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions.
  • Support the development of the engineering team through code reviews, design docs, and architectural best practices.
  • Ensure the accuracy of mission-critical data outputs.
SKILLS & EXPERIENCE

Required Skills & Experience

  • 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
  • Expert-level mastery of Apache Spark for large-scale data processing.
  • Strong experience with orchestration tools (Airflow) and cloud computing environments.
  • Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
  • Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
  • A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.

Preferred Skills & Experience

  • Experience working specifically with graph data or graph databases.
  • Prior experience with entity resolution or identity resolution systems.
  • Experience evaluating and selecting modern analytical database architectures.

The target base salary for this position is $200,000-$220,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$200,000$220,000 USD

Top Skills

Airflow
Spark
Cassandra
Elasticsearch
Infrastructure As Code (Iac)
Memgraph

Similar Jobs

Yesterday
Remote or Hybrid
USA
170K-260K Annually
Senior level
170K-260K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Principal AI Data Engineer is responsible for designing and operating AI-driven data platforms, focusing on Snowflake and LLM integrations, while leading technical initiatives and mentoring teams.
Top Skills: AWSDbtGeminiGitlabGleanSalesforce AiSnowflakeTerraform
2 Days Ago
Remote
California, USA
148K-239K Annually
Senior level
148K-239K Annually
Senior level
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
Lead delivery of advanced data platform services, data architecture, and data products while managing a globally distributed engineering team, driving data strategy in GTM domains.
Top Skills: AWSDbtGoPythonSnowflakeSQL
3 Days Ago
Remote
United States
155K-184K Annually
Senior level
155K-184K Annually
Senior level
Healthtech • Software
The Principal Data Engineer will lead the design of data architectures and analytics applications, collaborating with teams to build scalable platforms and improve data quality, while mentoring engineers and driving engineering best practices.
Top Skills: AirflowAWSAws GlueAzureDbtDmsDockerGCPKubernetesLookerPower BIPythonSnowflakeSQLTableau

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account