Parallel Systems Logo

Parallel Systems

Senior ML Ops Engineer (Machine Learning Infrastructure)

Posted 17 Days Ago
Easy Apply
In-Office or Remote
Hiring Remotely in Los Angeles, CA
150K-240K Annually
Senior level
Easy Apply
In-Office or Remote
Hiring Remotely in Los Angeles, CA
150K-240K Annually
Senior level
Lead the design and development of scalable ML infrastructure for autonomous vehicles, enabling efficient model training and deployment. Collaborate across teams to develop and manage automated pipelines and cloud-based systems while focusing on continuous integration and deployment.
The summary above was generated by AI

Parallel Systems is pioneering autonomous battery-electric rail vehicles designed to transform freight transportation by shifting portions of the $900 billion U.S. trucking industry onto rail. Our innovative technology offers cleaner, safer, and more efficient logistics solutions. Join our dynamic team and help shape a smarter, greener future for global freight.

Senior ML Ops Engineer (Machine Learning Infrastructure)

Parallel Systems is seeking an experienced MLOps/ML Infrastructure Engineer to lead the design and development of the scalable systems that power our autonomy and perception pipelines. As we build the first fully autonomous, battery-electric rail vehicles, you will play a critical role in enabling the ML teams to develop, train, and deploy models efficiently and reliably in both R&D and real-world environments.

This is an opportunity to take full ownership of the ML infrastructure stack, from distributed training environments and experiment tracking to deployment and monitoring at scale. You’ll collaborate closely with world-class engineers in autonomy, robotics, and software, helping shape the core systems that make real-time, safety-critical ML possible. If you're driven by building robust platforms that unlock innovation in AI and robotics, we’d love to work with you. 

This can be a remote role for a senior engineer with experience in 0 to 1 builds of perception systems. 

Responsibilities:

  • Design and implement robust MLOps solutions, including automated pipelines for data management, model training, deployment and monitoring. 
  • Architect, deploy, and manage scalable ML infrastructure for distributed training and inference. 
  • Collaborate with ML engineers to gather requirements and develop strategies for data management, model development and deployment. 
  • Build and operate cloud-based systems (e.g., AWS, GCP) optimized for ML workloads in R&D, and production environments. 
  • Build scalable ML infrastructure to support continuous integration/deployment, experiment management, and governance of models and datasets. 
  • Support the automation of model evaluation, selection, and deployment workflows. 

What Success Looks Like: 

  • After 30 Days: You have developed a deep understanding of the product goals, existing infrastructure, and stakeholder requirements. You've conducted technical discovery and proposed a preliminary MLOps architecture—evaluating various ML tools, cloud services, and workflow strategies—clearly outlining pros and cons for each option. 
  • After 60 Days: You’ve delivered a detailed design document that outlines the end-to-end ML pipeline, including data ingestion, model training, deployment, and monitoring. Based on feedback from ML engineers and stakeholders, you’ve iterated on the design and built PoC for the core ML workflow aligned with the approved architecture. 
  • After 90 Days: You have delivered the core features of the MLOps pipeline and successfully integrated key tools (e.g., MLflow, SageMaker, or Kubeflow). You’ve also initiated the implementation of the remaining features, ensuring the infrastructure supports scalable, repeatable workflows for model experimentation and deployment in both R&D and production environments. 

Basic Requirements: 

  • Bachelor’s or higher degree in Computer Science, Machine Learning, or a relevant engineering discipline. 
  • 5+ years of experience building large-scale, reliable systems; 2+ years focused on ML infrastructure or MLOps. 
  • Proven experience architecting and deploying production-grade ML pipelines and platforms. 
  • Strong knowledge of ML lifecycle: data ingestion, model training, evaluation, packaging, and deployment. 
  • Hands-on experience with MLOps tools (e.g., MLflow, Kubeflow, SageMaker, Airflow, Metaflow, or similar). 
  • Deep understanding of CI/CD practices applied to ML workflows. 
  • Proficiency in Python, Git, and system design with solid software engineering fundamentals. 
  • Experience with cloud platforms (AWS, GCP, or Azure) and designing ML architectures in those environments. 

Preferred Qualifications: 

  • Experience with deep learning architectures (CNNs, RNNs, Transformers) or computer vision. 
  • Hands-on experience with distributed training tools (e.g., PyTorch DDP, Horovod, Ray). 
  • Background in real-time ML systems and batch inference, including CPU/GPU-aware orchestration. 
  • Previous work in autonomous vehicles, robotics, or other real-time ML-driven systems. 

We are committed to providing fair and transparent compensation in accordance with applicable laws. Salary ranges are listed below and reflect the expected range for new hires in this role, based on factors such as skills, experience, qualifications, and location. Final compensation may vary and will be determined during the interview process. The target hiring range for this position is listed below.

Target Salary Range:
$150,000$240,000 USD

Parallel Systems is an equal opportunity employer committed to diversity in the workplace. All qualified applicants will receive consideration for employment without regard to any discriminatory factor protected by applicable federal, state or local laws. We work to build an inclusive environment in which all people can come to do their best work.

Parallel Systems is committed to the full inclusion of all qualified individuals. As part of this commitment, Parallel Systems will ensure that persons with disabilities are provided reasonable accommodations. If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please contact your recruiter.

Top Skills

Airflow
AWS
Azure
GCP
Git
Kubeflow
Metaflow
Mlflow
Python
Sagemaker

Similar Jobs

11 Minutes Ago
Easy Apply
Remote
United States
Easy Apply
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Senior Account Manager will build and manage relationships with strategic lending partners, drive business results, advise on risk and performance, collaborate across teams, and maintain accurate account records.
Top Skills: Google SuiteJIRASalesforceSlackZoom
11 Minutes Ago
Easy Apply
Remote or Hybrid
California, USA
Easy Apply
177K-221K Annually
Senior level
177K-221K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Senior Sales Engineer will create and deliver technical product presentations, gather requirements, lead evaluations, and design test plans for customer solutions.
Top Skills: Cloud SecurityDnsFirewallsTcp/IpVpn
11 Minutes Ago
In-Office or Remote
5 Locations
159K-273K Annually
Expert/Leader
159K-273K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead the development and execution of AI strategy in healthcare, drive innovation, and ensure reliable integration of AI technologies. Mentor talent, collaborate with leadership, and present strategic insights to improve health outcomes at scale.
Top Skills: Ai GovernanceAi ToolsArtificial IntelligenceAWSAzureDeep LearningGCPMachine LearningMachine Learning LibrariesNlpPythonStatistical Analysis

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account