NVIDIA Logo

NVIDIA

Senior Machine Learning Engineer

Reposted Yesterday
In-Office or Remote
5 Locations
184K-288K Annually
Senior level
In-Office or Remote
5 Locations
184K-288K Annually
Senior level
The Senior Machine Learning Engineer will develop models and algorithms for NVIDIA's DGX Cloud, focusing on anomaly detection, optimization, and forecasting. Responsibilities include analyzing data, collaborating with teams, and delivering impactful machine learning solutions.
The summary above was generated by AI

As a Senior Machine Learning Engineer at NVIDIA, you will build the machine learning brain that keeps NVIDIA’s global DGX Cloud healthy, efficient and ready for the next waves of AI breakthroughs. DGX Cloud fuses NVIDIA GPUs, NVLink networking and the full AI software stack into elastic infrastructure powering large language models, drug discovery, autonomous driving and climate science. Your models will turn billions of telemetry signals into predictive insight. This frees customers to innovate while our platform runs smarter.

What you'll be doing:

  • Ground breaking and developing innovative machine learning algorithms and models that propel our AI products.

  • Build production models for anomaly detection, predictive maintenance and usage optimization.

  • Develop tools surfacing real time telemetry, efficiency metrics and long term trends.

  • Develop forecasting and simulation models for global scale planning.

  • Analyzing complex datasets to determine the best approach for model training and optimization.

  • Translate findings into clear engineering actions with infrastructure, operations and product teams.

  • Participating in cross-functional projects to integrate machine learning capabilities into various NVIDIA products.

What we need to see:

  • Master's degree or PhD in Mathematics, Statistics, Machine Learning or related quantitative field (or equivalent experience).

  • 8+ years experience applying Machine Learning to operational systems.

  • Proven track record of building and deploying Machine Learning models in production environments.

  • Experience with time series analysis and optimization algorithms.

  • Familiarity with distributed systems and cloud platforms such as AWS and Kubernetes.

  • Strong software engineering skills and proficiency in Python.

  • Effective verbal/written communication, and technical presentation skills.

  • Experience with machine learning frameworks such as TensorFlow, PyTorch, or similar.

  • A track record of delivering high-impact projects to compete in a fast-paced environment.

Ways to stand out from the crowd:

  • Experience solving capacity planning problems.

  • Deep understanding of GPU performance metrics.

  • Familiarity with prometheus and PromQL.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 14, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

AWS
Kubernetes
Machine Learning
Python
PyTorch
TensorFlow

Similar Jobs

Yesterday
In-Office or Remote
San Francisco, CA, USA
168K-264K Annually
Senior level
168K-264K Annually
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Machine Learning Engineer, you will develop and scale machine learning models for Rovo Chat, ensuring reliability and utility, while communicating solutions across teams.
Top Skills: AWSDatabricksJavaPythonSparkSQL
5 Days Ago
Remote or Hybrid
United States
119K-222K Annually
Senior level
119K-222K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Senior Machine Learning Engineer will design and optimize ML models, manage end-to-end ML projects, and enhance SailPoint's AI capabilities while collaborating with various teams to integrate AI features into products.
Top Skills: AirflowAws SagemakerCloudbeesDbtGoJenkinsKafkaPythonPyTorchQlikScikit-LearnShell/BashSnowflakeSparkSQLTableauTensorFlow
20 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Machine Learning Engineer will design, build, and lead strategies for risk detection models, mentor team members, and apply advanced AI/ML methodologies to enhance user security and experience.
Top Skills: Ai/Ml FrameworksApache AirflowFeature StoresGnnsKafkaLlmsLstmsPythonPyTorchRayserveSparkTensorFlow

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account