Reddit Logo

Reddit

Staff Machine Learning Engineer, ML Platform

Reposted 14 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
230K-322K Annually
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
230K-322K Annually
Senior level
Lead development of a large-scale machine learning platform, optimizing model training, data processing, and collaborating on MLOps, while ensuring scalability and performance.
The summary above was generated by AI
Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 116 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.

Who We Are:
The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.

What You’ll Do:
As a Staff ML Infrastructure Engineer, you will lead development of a platform for large scale ML models at Reddit.

  • Design end-to-end model lifecycle patterns (MLOps) to boost velocity of development for ML engineers, including data preparation, model management, experiment tracking, and more
  • Zero-to-one development and support of a graph ML codebase and platform that abstracts away common patterns and enables greater model scalability and iteration
  • Collaborate with ML engineers on performance tuning, including improving model training time, efficiency, and GPU training costs in a large, distributed ML training environment
  • Optimize batch data processing within a data warehouse and with tools such as Apache Beam, Apache Spark, Ray Data, and more
  • Architect pipelines to build and maintain massive graph data structures on the order of billions of nodes and tens of billions of edges

Who You Might Be:

  • 7+ years of experience in ML infrastructure, including model training and model deployments
  • Hands-on experience with ML optimization, including memory and GPU profiling
  • Deep experience with cloud-based technologies for supporting an ML platform, including tools like GCP BigQuery, Google Cloud Storage, infrastructure-as-code (Terraform), and more
  • Hands-on experience administering and integrating MLOps tools for experiment tracking, model serving, and model registries (e.g. MLflow or Wandb)
  • Proficiency with the common programming languages and frameworks of ML, such as Python, PyTorch, Tensorflow, etc.
  • Deep experience working with distributed training frameworks, including Ray and Kubernetes
  • Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle.
  • Strong organizational & communication skills
  • Experience working with graph databases (Neo4j, JanusGraph, TigerGraph) is a big plus
  • Experience working with graph neural networks (GNNs) and associated graph ML frameworks (PyTorch Geometric, Deep Graph Library) is a big plus

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k with Employer Match
  • Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Paid Volunteer Time Off
  • Generous Paid Parental Leave  



Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base pay range for this position is:
$230,000$322,000 USD

In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.

During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable.  We will not sell your personal information or disclose it to any third party for their marketing purposes.  We will delete any recording of your interview promptly after making a hiring decision.  For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors.

Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve.  Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.

Top Skills

Apache Beam
Spark
Deep Graph Library
Gcp Bigquery
Google Cloud Storage
Janusgraph
Mlflow
Neo4J
Python
PyTorch
Pytorch Geometric
Ray
TensorFlow
Terraform
Tigergraph
Wandb

Similar Jobs

Yesterday
Easy Apply
Remote
USA
Easy Apply
218K-257K Annually
Senior level
218K-257K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Staff Machine Learning Engineer will define technical strategies, design ML models for document processing and risk assessment, and build scalable ML systems to protect Coinbase users' funds.
Top Skills: ClassificationDnnsGenaiLlmsMachine LearningTransformers
18 Days Ago
Easy Apply
Remote
US
Easy Apply
200K-245K Annually
Mid level
200K-245K Annually
Mid level
Healthtech • Other • Telehealth
The Staff ML/AI Platform Engineer will design, prototype and deploy AI/ML models, optimize pipelines, mentor junior engineers, and collaborate closely with other teams to develop machine learning solutions.
Top Skills: Aws LambdaCi/CdDockerEcrEcsSagemakerTerraform
22 Days Ago
Remote
US
Senior level
Senior level
Healthtech • Information Technology • Software
The Staff Software Engineer will lead the design and architecture of the AI/ML platform, mentor engineers, and drive technical direction for MLOps infrastructure in healthcare automation.
Top Skills: AWSAzureDockerGCPGoJavaScriptKubernetesPythonTypescript

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account