Thoughtful AI Logo

Thoughtful AI

Staff Data Engineer

Reposted 11 Days Ago
In-Office or Remote
2 Locations
190K-250K Annually
Senior level
In-Office or Remote
2 Locations
190K-250K Annually
Senior level
Design, build, and maintain data pipelines, optimize performance and cost-efficiency, extend capabilities, and collaborate across teams to improve data governance and quality.
The summary above was generated by AI

Join Our Mission to Revolutionize Healthcare

Thoughtful is pioneering a new approach to automation for all healthcare providers! Our AI-powered Revenue Cycle Automation platform enables the healthcare industry to automate and improve its core business operations.

We're looking for Staff Data Engineers to help scale and strengthen our data platform.

Our data stack today consists of Aurora RDS, AWS Glue, Apache Iceberg, S3 (Parquet), Spark and Athena - supporting a range of use cases from operational reporting to downstream services. We’re looking to grow the team with engineers who can help improve performance, increase reliability, and expand the platform's capabilities as our data volume and complexity continue to grow.

You’ll work closely with other engineers to evolve our existing pipelines, improve observability and data quality, and enable faster, more flexible access to data across the company. The platform is deployed on AWS using OpenTofu, and we’re looking for engineers who bring strong cloud infrastructure fundamentals alongside deep experience in data engineering.

Your Role:

  • Build: Develop and maintain data pipelines and transformations across the stack. Starting from ingesting transactional data into the data lakehouse to refining data up the medallion data architecture.
  • Optimize: Tune performance, storage layout, and cost-efficiency across our data storage and query engines.
  • Extend: Help design and implement new data ingestion patterns and improve platform observability and reliability.
  • Collaborate: Partner with engineering, product, and operations teams to deliver well-structured, trustworthy data for diverse use cases.
  • Contribute: Help establish and evolve best practices for our data infrastructure, from pipeline design to OpenTofu-managed resource provisioning.
  • ​​Secure: Help design and implement a data governance strategy to secure our data lakehouse.

Your Qualifications:

  • 8-10+ years of experience building and maintaining data pipelines in production environments
  • Strong knowledge of the data lakehouse ecosystem, with an emphasis on AWS data services - particularly Glue, S3, Athena/Trino/PrestoDB, and Aurora
  • Proficiency in Python, Spark and Athena/Trino/PrestoDB for data transformation and orchestration
  • Experience managing infrastructure with OpenTofu/Terraform or other Infrastructure-as-Code tools
  • Solid understanding of data modeling, partitioning strategies, schema evolution, and performance tuning
  • Comfortable working with cloud-native data pipelines and batch processing (streaming experience is a plus but not required)

What Sets You Apart:

  • Systems thinker - you understand the tradeoffs in data architecture and design for long-term stability and clarity
  • Outcome-driven - you focus on building useful, maintainable systems that serve real business needs
  • Strong collaborator - you're comfortable working across teams and surfacing data requirements early
  • Practical and hands-on - able to dive into logs, schemas, and IAM policies when needed
  • Thoughtful contributor - committed to improving code quality, developer experience, and documentation across the board

Why Thoughtful?

  • Competitive compensation
  • Health benefits: Comprehensive medical, dental, and vision insurance.
  • Time off: Generous leave policies and paid company holidays.
California Salary Range
$190,000$250,000 USD

Top Skills

Apache Iceberg
Athena
Aurora Rds
Aws Glue
Opentofu
Parquet
S3
Spark

Thoughtful AI Austin, Texas, USA Office

823 Congress Ave, Suite 300, , Austin, Texas , United States, 78701

Similar Jobs

3 Days Ago
Remote
US
160K-210K Annually
Senior level
160K-210K Annually
Senior level
HR Tech • Logistics • Software
The Staff Data Engineer will design data architecture, develop scalable data pipelines, ensure data quality, and lead governance practices while collaborating across teams.
Top Skills: AirflowAWSPythonRedshiftSQL
4 Days Ago
Remote
United States
190K-212K Annually
Senior level
190K-212K Annually
Senior level
eCommerce • Payments
The Staff Data Engineer will architect and implement a next-generation data platform, build data pipelines, and ensure data integrity for actionable insights.
Top Skills: Amazon KinesisApache FlinkApache KafkaDbtDebeziumGoogle Pub/SubLookerPythonSnowflakeSnowpipeSpark StreamingSQL
8 Days Ago
Remote
USA
Senior level
Senior level
Pharmaceutical
The Staff Data Engineer will design and maintain data systems, streamline workflow for Data Analytics and Machine Learning teams, and ensure data pipelines are efficient and reliable. Responsibilities also include driving large projects, collaborating across teams, mentoring, and supporting the company's strategy.
Top Skills: Apache AirflowSparkC#C++DbtGoJavaLookerPysparkPythonScalaSnowflakeSQLSuperset

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account