NVIDIA Logo

NVIDIA

Senior Systems Software Engineer - GPU Performance at Scale

Reposted 6 Days Ago
Be an Early Applicant
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
Lead performance practices for large-scale AI systems, enhance datacenter products, and resolve complex performance issues through collaboration and technical expertise.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for a dedicated engineer for the Senior Systems Software Engineer role, focusing on GPU Performance at Scale. At NVIDIA, this role is uniquely positioned to drive innovation in AI and GPU computing. You will contribute to world-class computing hardware and software, fueling groundbreaking advancements in artificial intelligence. You will provide insights on large-scale system composition and tuning mechanisms for high-performance compute runs. Collaborate with researchers, developers, and customers to craft improved workflows and develop new, leading solutions. Engage with HPC, OS, CPU, GPU compute, and systems specialists to architect, build, and optimize large-scale performance platforms.

What you'll be doing:

  • Lead the implementation of performance practices in large-scale GPU infrastructure, delivering powerful tools, methodologies, and flows to validate and improve multiple datacenter products concurrently.

  • Align next-generation AI workloads with next-generation datacenter builds for NVIDIA GPUs, CPUs, and networking hardware. Engage early with HW/FW/SW/platform internal and customer teams.

  • Develop engineering solutions that provide continuous insights into the performance of AI workloads in evolving environments, generating swift insights into improvements and regressions.

  • Decompose high-complexity performance or stability issues into minimal reproduction cases, working towards identifying the root cause.

  • Participate in collaborations with various SW and FW teams (BMC/SBIOS/OS/drivers, etc.) to develop outstanding methods and tools. Analyze, debug, and resolve critical firmware and software issues to achieve the highest AI workload performance at scale.

What we need to see:

  • Proven understanding of accelerated computing software stacks (CUDA).

  • Experience with modern cloud and container-based enterprise computing architectures, with Slurm preferred.

  • Strong programming and scripting experience in C/C++/Python/Bash.

  • Deep expertise in systems architecture and the impact of various components on performance.

  • Experience with container technology and Linux-based OSes, with Docker preferred.

  • Experience supporting high-performance computing or deep learning in engineering or academic research communities.

  • Strong teamwork and communication skills, coupled with results-focused analytical abilities.

  • BS in Engineering, Mathematics, Physics, or Computer Science (or equivalent experience); MS or PhD desirable with 8+ years of applicable experience.

Ways to Stand Out From the Crowd

  • End-to-end GPU performance engineering from the profiler to systems analysis.

  • Linux systems programming and optimization experience.

  • Exposure to virtualization techniques and cloud platform solutions.

  • Experience with scheduling and resource management systems.

  • Experience with large-scale HPC environments.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 22, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

10 Minutes Ago
Remote
US
110K-130K Annually
Mid level
110K-130K Annually
Mid level
Big Data • Cloud • Fintech • Professional Services • Software
Serve as primary contact for strategic mortgage customers—onboard, drive adoption, manage renewals, and support upsells. Monitor KPIs and customer health, troubleshoot API issues, escalate complex support cases, and build repeatable success processes to reduce churn and scale retention.
Top Skills: Ai ToolsAPIsLoan Origination SystemsPoint-Of-Sale
24 Minutes Ago
Remote
USA
Entry level
Entry level
Insurance • Financial Services
Remote role guiding aspiring insurance agents through state licensing: manage a 150-200 person pipeline, maintain notes in MS Access, communicate licensing requirements, make outbound/inbound calls, provide progress updates to agency partners, and coordinate changes with supervisors and management.
Top Skills: ExcelMicrosoft AccessMS Office
2 Hours Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
119K-160K Annually
Mid level
119K-160K Annually
Mid level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Provide end-to-end commercial litigation support, advise on subpoenas and customer data privacy, manage eDiscovery lifecycle with automation/AI, mitigate and resolve disputes, drive process and technology-enabled innovation, and deliver actionable legal insights to cross-functional stakeholders.
Top Skills: AIEdiscoveryInternet Of Things (Iot)Tofu

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account