NVIDIA Logo

NVIDIA

Senior Software Engineer - HPC

Reposted 4 Days Ago
Be an Early Applicant
In-Office
Austin, TX, USA
152K-242K Annually
Senior level
In-Office
Austin, TX, USA
152K-242K Annually
Senior level
As a Senior Software Engineer at NVIDIA, you will enhance HPC infrastructure, improve systems reliability, and optimize cloud operations, focusing on distributed systems and automation.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. The ideal candidate is strong in software development, crafting and building reliable distributed systems, and has the ability to implement well thought out long term maintenance strategy. 

What you’ll be doing:

  • Apply modern distributed systems patterns to push the limits of scale, latency, and reliability.

  • Continuously improve infrastructure provisioning and operations with automation, APIs, and self‑service platforms.

  • Operate in a globally distributed, hybrid multi‑cloud environment (AWS, GCP, on‑prem), building systems that are cloud‑native and location‑agnostic.

  • Build strong cross-functional relationships and align with collaborators across various business units.

  • Improve uptime and Quality of Service (QoS) through data-driven operations, strong SLOs, and robust incident practices.

  • Participate in the team’s on‑call rotation and lead high‑impact incident response when needed.

What we need to see:

  • Strong coding skills in at least two of: Go, Java, C/C++, Scala, Python, Elixir, with a focus on backend, systems, or infrastructure engineering.

  • Deep understanding of scalability, consistency, and performance trade‑offs in server‑side systems; ability to build horizontally scalable, resilient, and low‑latency services.

  • Experience owning services end‑to‑end: architecture, build reviews, implementation, testing, rollout, observability, and iterative improvement.

  • Hands‑on experience with at least one major cloud provider (GCP, AWS, or Azure) and cloud‑native primitives (managed storage, messaging, compute).

  • Proficiency with modern CI/CD, GitOps workflows, and Infrastructure as Code practices for safe, repeatable changes.

  • Bias for action, strong problem‑solving skills, and a track record of simplifying complex systems.

  • B.S. in Computer Science or related field (or equivalent experience), with 5+ years of relevant experience.

  • Careful communication and collaboration skills; comfortable guiding technical decisions across teams.

Ways to stand out from the crowd:

  • Prior experience building core infrastructure or control planes for HPC clusters, large-scale AI/ML platforms, or systems managed by job schedulers (e.g., Slurm or Kubernetes).

  • Maintainer or co‑maintainer responsibilities for an open source component used in production (plugins, operators, exporters, controllers, or SDKs) at large scale.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 19, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

29 Minutes Ago
Hybrid
173K-259K Annually
Senior level
173K-259K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Lead data architecture organization to design and deliver AI-ready data foundations, domains, and data products for Physical AI, autonomy, and digital twins. Define scalable data patterns for collection, ingestion, processing, quality, governance, annotation, and availability to accelerate model development, deployment, and continuous learning across edge-to-cloud systems.
Top Skills: AIAPIsData AnnotationData GovernanceData ModelingData PipelinesDigital TwinsEdge ComputingFeature EngineeringHybrid CloudMachine LearningMetadata ManagementPublic CloudSimulationStreaming DataSynthetic DataTelemetry PlatformsVector Data
37 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
90K-121K Annually
Senior level
90K-121K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Serve as the technical authority for ELD compliance product escalations, analyze device telemetry and fleet patterns, coordinate fixes with R&D via Jira, support audits/inspections, lead post-mortems and runbook/validation creation, build internal tools, and coach support engineers to improve resolution speed and product reliability.
Top Skills: Device TelemetryEldIotJIRAPythonSaaSSQLTableau
38 Minutes Ago
Hybrid
Austin, TX, USA
Senior level
Senior level
Big Data • Real Estate • Software
Lead and implement digital marketing strategies for ecommerce platforms, focusing on demand generation and customer engagement. Collaborate cross-functionally and optimize user experiences through data-driven insights and innovation.
Top Skills: AmplitudeGoogle AnalyticsKnotchOptimizelySalesforceTableauWordpress

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account