MDCalc Logo

MDCalc

Senior Data Engineer

Posted Yesterday
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Design, build, and maintain scalable ETL/ELT data pipelines and data platform architecture. Build programmatic pipelines (primarily Python), optimize analytical data models (SQL), integrate sources into Snowflake, use orchestration/transformation tools (dbt, Airflow, Dagster), improve data quality/observability, and partner with product, engineering, and analytics to deliver reliable data for decision-making.
The summary above was generated by AI
The Opportunity

Since 2005, MDCalc has been an essential part of the clinician’s workflow to help achieve better patient outcomes. Actively used by more than 65% of physicians worldwide, MDCalc is the most broadly used medical reference – at the point-of-care – for clinical decision tools and content, and one of only four references used by >50% of US HCPs. These evidence-based tools and content are used by millions of medical professionals globally and support 50+ specialties and cover 200+ patient conditions.

To continue accelerating this growth, we are expanding the Engineering team with a Senior Data Engineer who will help build and scale the data infrastructure that powers decision-making across the company. This is an opportunity for an experienced data engineer who enjoys working close to product and business teams, building reliable data systems, and transforming complex data into actionable insights.

This role will help define how data moves through MDCalc’s platform, designing the pipelines and architecture that enable reliable analytics, product insights, and data-driven decision making across the organization.

The Role

As a Senior Data Engineer at MDCalc, you will design, build, and maintain the data pipelines and infrastructure that support analytics, product insights, and operational decision-making across the company. A key part of this role is managing how data moves across systems, shaping and transforming it through robust ETL/ELT pipelines so it can be reliably used by downstream analytics, product, and business applications.

You will work closely with product, engineering, and business stakeholders to ensure data is reliable, accessible, and structured for effective use. This includes building programmatic data pipelines, primarily in Python, to extract, transform, and deliver data across MDCalc’s systems and data platform.

You will also contribute to the architecture of MDCalc’s data platform, helping define how data is structured and delivered across the organization. As a senior individual contributor, you will help establish best practices for data modeling, pipeline development, and data governance.

The responsibilities of this individual include the following, but are not limited to:

  • Design, build, and maintain scalable data pipelines and ELT/ETL workflows that support analytics, operational reporting, and business intelligence use cases

  • Build programmatic data pipelines (primarily in Python) that extract data from application and third-party systems, transform it into usable formats, and deliver it to downstream data platforms and consumers

  • Own and improve core data models and transformations to ensure data is accurate, well-structured, and easy for stakeholders to use

  • Partner with Product, Engineering, and Analytics teams to understand data needs and translate them into reliable data solutions

  • Develop and maintain systems that move data across the platform, ensuring it is properly shaped, structured, and available for downstream analysis and product use cases

  • Help shape and maintain the architecture of MDCalc’s modern data stack, including warehousing, orchestration, transformation, and monitoring

  • Improve data quality, observability, and reliability through testing, validation, and proactive monitoring practices

  • Support the ingestion and integration of data from a variety of application, product, and third-party sources

  • Establish and reinforce best practices around data governance, documentation, naming conventions, and maintainability

  • Identify and drive opportunities to improve performance, scalability, and efficiency across our data systems

  • Design efficient data workflows that query, transform, and deliver datasets to downstream systems and stakeholders

  • Contribute to technical direction and architectural decisions as a senior member of the team

  • Serve as a thought partner to teammates and cross-functional stakeholders on how to best leverage data across the business

Your Background
  • 5+ years experience in data engineering

  • Strong SQL skills and experience building and optimizing data models for analytical use cases

  • Experience building and maintaining reliable data pipelines in a modern cloud data environment

  • Strong proficiency in Python or a comparable programming language commonly used in data engineering

  • Experience building programmatic ETL/ELT pipelines using Python or similar tools to move and transform data across systems

  • Experience working with data warehouses such as Snowflake

  • Experience with transformation and orchestration tools such as dbt, Airflow, Dagster, or similar tools

  • Strong understanding of data architecture, data modeling, and pipeline design best practices

  • Ability to operate independently, prioritize effectively, and drive work forward in a fast-moving environment

What MDCalc offers:
  • Ability to make a true difference in medicine: MDCalc is the most broadly used medical reference used by 65% of physicians worldwide.

  • Medical, Dental, & Vision coverage, with option to extend to your dependents

  • Company-sponsored short-term insurance

  • Fully-paid 8 week parental leave, after 6 months of employment

  • Company-sponsored 401k, after 3 months of employment

  • Unlimited vacation for salaried roles - we trust you to take the time you need

  • Tri-annual company offsites to connect, reflect, and plan together

  • Work from home monthly stipend

  • Hybrid work environment with a great team office in Greenwich Village, NYC

  • A culture of fun and motivated team members who believe in a greater mission here at

Similar Jobs

Yesterday
Remote or Hybrid
US
135K-155K Annually
Senior level
135K-155K Annually
Senior level
Professional Services • Software
Lead architecture and buildout of a new graph-backed enterprise data platform: design ingestion, graph and relational storage, entity resolution pipelines, temporal models, ETL/ELT pipelines, governance, APIs, and production connectors. Ship scalable graph data models, traversal queries, and platform roadmap while enabling observability, security, and containerized deployments.
Top Skills: AirflowAzureCypherDagsterDbtDockerGremlinHelmJavaKubernetesPythonSalesforceServicenowSparqlSQL
2 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and maintain enterprise ETL and data transformation pipelines to support Medicaid analytics and federal reporting. Optimize data processing with Python, Spark/Databricks, and relational platforms; ensure data validation, reconciliation, auditability, and production support. Collaborate across architects, analysts, QA, and BI teams during cloud migration and modernization efforts.
Top Skills: Azure Data FactoryAzure DevopsBashCi/CdDatabricksGitInformatica PowercenterOraclePowershellPythonRest ApiSnowflakeSparkSQLSQL ServerTeradata
4 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
186K-222K Annually
Senior level
186K-222K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Design and scale data pipelines and ML/LLM systems, build agentic automation for pipeline generation and maintenance, improve data monitoring, and collaborate with analysts, product, and ML teams to deliver reliable end-to-end data and AI infrastructure for a high-growth e-commerce platform.
Top Skills: AirflowAws Ec2Aws EksAws LambdaAws S3DbtLlmsMcp ServersMl PipelinesPythonRagSnowflake

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account