Calix Logo

Calix

Staff ML Ops Engineer

Reposted 9 Days Ago
Remote
2 Locations
Senior level
Remote
2 Locations
Senior level
The Staff ML Ops Engineer builds and maintains infrastructure for ML applications, ensuring they are robust and production-ready, while collaborating with data scientists and ML engineers.
The summary above was generated by AI
Calix provides the cloud, software platforms, systems and services required for communications service providers to simplify their businesses, excite their subscribers and grow their value.

Calix is where passionate innovators come together with a shared mission: to reimagine broadband experiences and empower communities like never before. As a true pioneer in broadband technology, we ignite transformation by equipping service providers of all sizes with an unrivaled platform, state-of-the-art cloud technologies, and AI-driven solutions that redefine what’s possible. Every tool and breakthrough we offer is designed to simplify operations and unlock extraordinary subscriber experiences through innovation.

Calix is seeking a highly skilled ML Ops Engineer with hands-on experience with GCP to join our cutting-edge AI/ML team. In this role, you will be responsible for building, scaling, and maintaining the infrastructure that powers our machine learning and generative AI applications. You will work closely with data scientists, ML engineers, and software developers to ensure our ML/AI systems are robust, efficient, and production ready.

This is a remote-based position that can be located anywhere in the United States or Canada.

Key Responsibilities:

  • Design, implement, and maintain scalable infrastructure for ML and GenAI applications.

  • Deploy, operate, and troubleshoot production ML pipelines and generative AI services.

  • Build and optimize CI/CD pipelines for ML model deployment and serving.

  • Scale compute resources across CPU/GPU/TPU/NPU architectures to meet performance requirements.

  • Implement container orchestration with Kubernetes for ML workloads.

  • Architect and optimize cloud resources on GCP for ML training and inference.

  • Set up and maintain runtime frameworks and job management systems (Airflow, KubeFlow, MLflow).

  • Establish monitoring, logging, and alerting for ML system observability.

  • Collaborate with data scientists and ML engineers to translate models into production systems.

  • Optimize system performance and resource utilization for cost efficiency.

  • Develop and enforce MLOps best practices across the organization.

Qualifications:

  • Bachelor's degree in computer science, Information Technology, or a related field (or equivalent experience).

  • 8+ years of overall software engineering experience.

  • 3+ years of focused experience in MLOps or similar ML infrastructure roles.

  • Strong experience with Docker container services and Kubernetes orchestration.

  • Demonstrated expertise in cloud infrastructure management, preferably on GCP (AWS or Azure experience also valued).

  • Proficiency with workflow management and ML runtime frameworks such as Airflow, Kubeflow, and MLflow.

  • Strong CI/CD expertise with experience implementing automated testing and deployment pipelines.

  • Experience with scaling distributed compute architectures utilizing various accelerators (CPU/GPU/TPU/NPU).

  • Solid understanding of system performance optimization techniques.

  • Experience implementing comprehensive observability solutions for complex systems.

  • Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack).

  • Proficient in at least two of the following: Shell Scripting, Python, Go, C/C++

  • Familiarity with ML frameworks such as PyTorch and ML platforms like SageMaker or Vertex AI.

  • Excellent problem-solving skills and ability to work independently

  • Strong communication skills and ability to work effectively in cross-functional teams.

The base pay range for this position varies based on the geographic location. More information about the pay range specific to candidate location and other factors will be shared during the recruitment process. Individual pay is determined based on location of residence and multiple factors, including job-related knowledge, skills and experience.

San Francisco Bay Area:

156,400 - 265,700 USD Annual

All Other US Locations:

136,000 - 231,000 USD Annual

As a part of the total compensation package, this role may be eligible for a bonus. For information on our benefits click here.

Top Skills

Airflow
C/C++
Docker
Elk Stack
GCP
Go
Grafana
Kubeflow
Kubernetes
Mlflow
Prometheus
Python

Similar Jobs

17 Days Ago
Easy Apply
Remote
Canada
Easy Apply
186K-224K Annually
Senior level
186K-224K Annually
Senior level
Software
Develop AI solutions and enhance observability data using AI-powered features. Collaborate cross-functionally, iterate rapidly, and take ownership of AI projects while ensuring scalability and impact.
Top Skills: AIAWSAzureDockerGCPGenaiKubernetesLlmsTerraform
22 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Software • Energy • Utilities
The Senior Machine Learning Ops Engineer will design and build machine learning operations infrastructure, manage ML pipelines, experiment tracking, model deployment, and collaborate with engineering teams to enhance data and ML processes.
Top Skills: AirflowGCPKubeflowMlflowVertexai
An Hour Ago
Remote
Canada
168K-228K Annually
Junior
168K-228K Annually
Junior
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Design and develop scalable software solutions, collaborate with cross-functional teams, contribute to team culture, and support operational excellence.
Top Skills: AngularCSSHTMLJavaScriptMongoDBMySQLNode.jsPostgresPythonReact

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account