NVIDIA Logo

NVIDIA

Senior HPC Storage Engineer

Reposted 3 Days Ago
Be an Early Applicant
In-Office
Austin, TX, USA
184K-357K Annually
Senior level
In-Office
Austin, TX, USA
184K-357K Annually
Senior level
The role involves researching, designing, and implementing advanced storage solutions for HPC workloads, optimizing for performance, cost, and scalability across cloud infrastructures, while automating management processes and providing support for deep learning workflows.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

As a member of the HW Infrastructure Storage Strategy team, you will provide leadership in the research, design and implementation of ground breaking fast storage solutions to enable runs of demanding high performance computing, and computationally intensive workloads. We seek an expert to identify architectural changes encompassing file, block, and object storage, to cater to the scaling and performance requirements of an expanding cloud infrastructure. As an expert, you will help us with the next-gen storage solutions strategic challenges we encounter with storage design for large scale, high performance workloads, evolving our private/public cloud strategy, capacity modelling, and growth planning across our global computing environment.

What you'll be doing:

  • Research and analyze existing internal distributed storage services.

  • Research, design, and implement scalable, next-gen distributed storage services for HPC workloads, optimizing both performance and cost-effectiveness to meet NVIDIA’s growing infrastructure needs

  • Develop tooling to automate management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.

  • Detail the general procedures and practices, perform technology evaluations, related to distributed file systems.

  • Collaborate across teams to better understand developers' workflows and capture their infrastructure requirements.

  • Influence and guide methodologies for building, testing, and deploying applications to ensure efficient performance and resource utilization.

  • Supporting our researchers to run their flows on our clusters including performance analysis and optimizations of deep learning workflows

  • Root cause analysis and suggest corrective action for problems large and small scales

What we need to see:

  • Bachelor’s degree in Computer Science, Electrical Engineering or related field or equivalent experience.

  • 8+ years of experience designing and/or operating large scale storage infrastructure.

  • Experience analyzing and tuning storage performance for a variety of workloads.

  • Proficient in Centos/RHEL and/or Ubuntu Linux distros including Python programming and bash scripting

  • In depth understanding of container technologies like Docker, Enroot

Ways to stand out from the crowd:

  • Distributed Storage Expertise: Extensive experience with parallel and distributed filesystems (Ceph, Weka.io, Vast, Lustre, GPFS) and Linux storage kernel development.

  • GPU & AI Infrastructure: Proficient with NVIDIA GPUs, CUDA programming, and NCCL, including performance benchmarking via MLPerf.

  • Hardware & Storage Engineering: Deep familiarity with storage hardware (HDDs, SSDs, NVMe), enclosures, and specialized appliances like Network Appliance.

  • Advanced Networking: Strong background in Software Defined Networking (SDN) and high-performance networking for AI/HPC clusters.

  • Deep Learning Frameworks: Practical experience applying industry-standard frameworks, specifically PyTorch and TensorFlow.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most resourceful and talented people in the world working for us and, due to unprecedented growth, our extraordinary engineering teams are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to hear from you. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 13, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

42 Minutes Ago
Hybrid
Mid level
Mid level
eCommerce • Healthtech • Pet • Retail • Pharmaceutical
Manage end-to-end non-inventory procurement for fulfillment centers including purchasing corrugate, shipping materials, and consumables. Maintain stocking strategy and DOH targets, perform counts, manage purchase requests, monitor vendor performance, ensure budget and policy compliance, and support site audits, 6S, and cross-functional coordination.
Top Skills: Erp PlatformsExcelMS OfficeProcurement Systems
44 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
119K-160K Annually
Mid level
119K-160K Annually
Mid level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Provide end-to-end commercial litigation support, advise on subpoenas and customer data privacy, manage eDiscovery lifecycle with automation/AI, mitigate and resolve disputes, drive process and technology-enabled innovation, and deliver actionable legal insights to cross-functional stakeholders.
Top Skills: AIEdiscoveryInternet Of Things (Iot)Tofu
46 Minutes Ago
Hybrid
Austin, TX, USA
130K-176K Annually
Senior level
130K-176K Annually
Senior level
Artificial Intelligence • Internet of Things • Semiconductor
Define and evaluate next-generation Arm CPU micro-architectures by developing and using C++ performance models, collaborating with RTL designers, pruning design trade-offs, and mentoring junior engineers to improve modelling methodologies.
Top Skills: Arm Architecture And Instruction SetsC++Power ModelsRtl SimulatorsSystemcSystemc SimulatorsUnix

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account