NVIDIA Logo

NVIDIA

Senior Generative AI Software Engineer

Reposted 8 Days Ago
In-Office or Remote
Hiring Remotely in Santa Clara, CA
224K-426K Annually
Expert/Leader
In-Office or Remote
Hiring Remotely in Santa Clara, CA
224K-426K Annually
Expert/Leader
The Senior Generative AI Software Engineer will develop core infrastructure for generative AI model research, refactor codebases, and implement evaluation pipelines, ensuring maintainability and quality in an innovative AI environment.
The summary above was generated by AI

At NVIDIA, we're not just building the future, we're generating it! Our Cosmos generative AI engineering team is pushing the boundaries of what’s possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. We are looking for exceptionally driven engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

What you'll be doing:

  • You will own and evolve the Cosmos open-source and internal research codebases, crafting core infrastructure that supports our foundation model research and deployment.

  • Refactor and modularize large research-driven code into clean, testable, maintainable libraries for use across teams.

  • Integrate and adapt off-the-shelf models into our pipelines as preprocessors, postprocessors, or evaluation components.

  • Build model-serving endpoints (e.g., with Gradio or FastAPI) to enable researchers and internal users to experiment with models interactively.

  • Design, implement, and maintain evaluation pipelines, providing high-quality tooling to the broader team to measure model quality and track improvements.

  • Improve configuration hygiene and reproducibility using systems like Hydra, and ensure smooth overrides, templates, and environment switching.

  • Lead efforts in packaging and release of Python modules using modern tools (uv, just, pydantic) for both OSS and internal consumption.

  • Set the standard for code health, test coverage, and release readiness across the team. Write documentation and automation to scale good practices.

What we need to see:

  • Expert-level proficiency in Python, with a strong foundation in modular design, abstraction boundaries, and collaborative codebase evolution.

  • Fluency with PyTorch, including the ability to run, debug, and patch inference-time model behavior in research-level codebases. Comfort modifying pre/post-processors, model wrappers, and checkpoint logic.

  • Proven experience in refactoring large codebases—cleaning up legacy implementations, eliminating anti-patterns, and paying down tech debt to improve long-term maintainability.

  • Strong grasp of configuration systems, especially Hydra, with an emphasis on reproducibility, override logic, and environment scoping.

  • Familiarity with Python packaging tools like uv, just, and pydantic, including experience managing environment consistency and shipping libraries as artifacts.

  • Strong instincts around code health: API design, directory structure, writing unit and integration tests, exception hygiene, docstrings, and dependency isolation.

  • Comfortable deploying models internally via Gradio or similar frameworks to enable interactive evaluation and feedback from researchers or downstream users.

  • BS or MS (or equivalent experience) in Computer Science, Software Engineering, or a related technical field and 10+ years of industry experience.

Ways to stand out from the crowd:

  • Proficiency in model configs, especially Hydra! Comfortable crafting hierarchical config systems with reusable templates, environment scoping, and overrides for evaluation, inference, or release.

  • Prior work cleaning up sophisticated generative model codebases—adding tests, improving wrappers, and instrumenting code for observability and debugging.

  • Demonstrated success raising engineering quality in a research setting: taking exploratory code and evolving it into a robust, production-friendly module.

  • Track record of mentoring teammates on software engineering best practices and proactively identifying long-term structural risks in fast-moving teams.

  • Passion for building ML tooling that is not only functional, but also elegant, intuitive, and maintainable by others.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 431,250 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until February 24, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Fastapi
Gradio
Hydra
Just
Pydantic
Python
PyTorch
Uv

Similar Jobs

9 Days Ago
In-Office or Remote
2 Locations
80K-130K Annually
Senior level
80K-130K Annually
Senior level
Insurance • Software
Design, build, and maintain production Generative AI capabilities (LLMs, RAG, agents). Lead scoping, model selection, deployment, monitoring, and troubleshooting. Mentor engineers, produce documentation, ensure AI safety, and integrate AI into enterprise applications.
Top Skills: Java,Python,Javascript,Db2,Postgresql,Sql Server,Vector Databases,Vue,React,Angular,Ci/Cd Pipelines,Version Control Tools,Llms,Retrieval-Augmented Generation (Rag),Agent Frameworks,Prompt Engineering,Langchain,Ai Observability Tools,Api Management,Azure,Ibm,Gcp,Aws
14 Minutes Ago
Remote or Hybrid
United States
156K-210K Annually
Senior level
156K-210K Annually
Senior level
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Director of Quality Assurance leads a QA organization to ensure technology systems deliver flawless, secure experiences, establishing standards and strategies for testing and quality culture across the firm.
Top Skills: AppiumAWSAzureCircleCICypressGCPGitlab CiJenkinsSelenium
14 Minutes Ago
Remote or Hybrid
United States
107K-160K Annually
Mid level
107K-160K Annually
Mid level
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Manager will lead advisory services for skilled nursing clients, focusing on financial reporting, budgeting, and operational improvements while mentoring staff.
Top Skills: Bill.ComIntaactMicrosoft Office SuiteNetSuiteQuickbooks Online

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account