RefinedScience Logo

RefinedScience

Data Engineering Intern

Posted 25 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
Internship
Easy Apply
Remote
Hiring Remotely in USA
Internship
Seeking a Data Engineering Intern to assist in building data infrastructure and pipelines, supporting analytics and machine learning in healthcare and life sciences.
The summary above was generated by AI

Data Engineering Intern

At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.   

WHAT WE ARE LOOKING FOR

We are seeking a motivated Data Engineering Intern to join our team. This internship is open to undergraduate and graduate students who are interested in building data infrastructure that supports advanced analytics, data science, and AI-driven insights in healthcare and life sciences.

You will work closely with data scientists, bioinformaticians, and engineers to help design, build, and improve data pipelines and platforms that power RefinedScience’s research and analytics initiatives.

KEY ACTIVITIES

  • Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data
  • Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets)
  • Help develop and optimize ETL/ELT workflows to ensure data quality and reliability
  • Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows
  • Contribute to data modeling, documentation, and best practices for data infrastructure
  • Participate in code reviews, testing, and performance improvements
  • Participate in Quality Reviews and Troubleshooting
  • Communicate progress and findings to cross-functional teams

MUST HAVES

  • Currently enrolled in a Bachelor’s, Master’s, or Ph.D. program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field
  • Experience with Python and/or SQL through coursework, projects, or internships
  • Basic understanding of data pipelines, databases, and data transformation concepts
  • Familiarity with version control (e.g., Git)
  • Strong analytical thinking and problem-solving skills
  • Ability to learn quickly and work collaboratively in a team environment

NICE TO HAVE

  • Exposure to cloud platforms (AWS, GCP, or Azure)
  • Familiarity with data tools such as Airflow, dbt, Spark, or similar frameworks
  • Experience working with large or complex datasets
  • Interest in healthcare, life sciences, or applied AI

Duration:  8 – 10 Weeks

WHY YOU’LL LOVE REFINED SCIENCE 

Team + Values

At RefinedScience, we seamlessly integrate top-tier clinical and biological data with expert knowledge to provide unparalleled insights.  We maximize patient impact with these unique insights by optimizing clinical trial probability of success and time to actionable results. We work across biopharma and we are a trusted partner in achieving better results, faster – working together to unlock strategic advantage.

Our Values

  • Act with Purpose – We believe in rigor through deliberate and thoughtful actions
  • Be Curious – Curiosity is the spark that ignites innovation and growth
  • Take Ownership – True ownership leads to pride and commitment in the work we do
  • Invest in Relationships – Building strong connections is the foundation for effective collaboration and trust for long term success
  • Embrace Agility – We celebrate agile thinking, resilience, and adaptability


Top Skills

Airflow
AWS
Azure
Dbt
GCP
Python
Spark
SQL

Similar Jobs

Yesterday
Remote
USA
23-23 Hourly
Internship
23-23 Hourly
Internship
Financial Services
Collaborate with cross-functional teams to design, develop, test, and maintain data pipelines and data marts. Write unit tests, participate in code reviews, document code and designs, debug production issues, and support Agile sprint commitments while learning business domain and best practices.
Top Skills: Data WarehouseDatabase DesignLakehousePythonSQL
Yesterday
In-Office or Remote
Santa Ana, CA, USA
40-45 Hourly
Internship
40-45 Hourly
Internship
Insurance • Real Estate
Work on cloud modernization and ETL for production data systems—migrate pipelines from Azure to GCP, build and optimize ETL workflows, validate data quality, contribute to documentation, and collaborate on integration and code reviews.
Top Skills: Sql,Python,Pandas,Numpy,Scikit-Learn,Matplotlib,Seaborn,Informatica,Bigquery,Dataflow,Cloud Composer,Cloud Storage,Azure,Gcp,Github,Etl
4 Days Ago
Remote
US
27-27 Hourly
Internship
27-27 Hourly
Internship
News + Entertainment • Software
Software engineering intern role building data extraction, export, and delivery pipelines. Gather pipeline requirements, design and implement ETL/ELT processes, document pipeline architecture, and learn cloud, container, and data warehousing technologies under mentorship.
Top Skills: AWSC#Containerized ApplicationsData VisualizationData WarehousingDatabasesEtl/EltJavaPythonReporting

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account