NVIDIA Logo

NVIDIA

Senior Solutions Architect - AI Infrastructure

Posted 2 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in CA, USA
184K-357K Annually
Senior level
In-Office or Remote
Hiring Remotely in CA, USA
184K-357K Annually
Senior level
Lead GPU and NVLink-based cluster design and validation for large-scale AI and HPC deployments. Advise cloud partners on architectures, perform performance modeling, debug deployment issues, support NPI rollouts, and relay field feedback to engineering.
The summary above was generated by AI

NVIDIA is building the world’s most groundbreaking and innovative accelerated computing platforms for AI and HPC.  Because of our work, scientists, researchers, and engineers can push the boundaries of what’s possible.  We pioneered a supercharged form of computing that powers everything from breakthrough AI research to the world’s fastest supercomputers.

We are seeking a highly motivated Senior Solutions Architect to join the NVIDIA Cloud Partners team with a focus on GPU, NVLink, and infrastructure design. In this role, you will be at the forefront of assisting with designs and architectures for some for the largest next-generation GPU-based clusters enabling the world’s most advanced AI supercomputers and enterprise AI infrastructure in the field. As a Solutions Architect, you will serve as a key technical expert bridging NVIDIA’s ground breaking GPU and NVLink technology designs as well as all of our software solutions directly between engineering and field teams supporting customers with the most demanding requirements.  You will work on end-to-end cluster design and architecture, performance modeling, validation, and NPI cluster deployments.  Your expertise will directly influence how the world’s leading AI companies, cloud providers, hyperscalers, research institutions, and enterprises build their infrastructure.

What you’ll be doing:

  • Partner with NVIDIA Cloud Partners in GPU cluster design and networking and convey architecture and optimal process information for building next-generation architectures.

  • Guide NVIDIA Cloud Partners in cluster design, weighing design principles but also complex, situational limitations to make the most performant and supportable GPU clusters possible.

  • Work closely with NVIDIA Cloud Partners to ensure successful first deployments with new products, including new network architectures and topologies.

  • Feedback customer/field perspectives on cluster design and workflows back to engineering teams designing internal clusters.

  • Perform hands-on work to assist NVIDIA Cloud Partners debugging issues relating to cluster design, configuration, and performance employing internal engineering expertise and known bugs.

  • Support NPI customer deployments with new GPU/Networking architectures.

What we need to see:

  • BS, MS, or PhD in Computer Science, Electrical Engineering, Computer Engineering, Physics, or related field (or equivalent experience).

  • 8+ years of experience in cluster design, validation, and issue resolution, specifically on GPU and HPC clusters.

  • Proven expertise in designing large-scale distributed systems, AI clusters, or HPC infrastructure.

  • Ability to translate sophisticated engineering concepts into customer-ready documentation, diagrams, and reference material.

  • Expertise in driving customer/partner issues to a close with product and engineering teams.

  • Ability to handle multi-functional communications across customer, product team, support team, engineering team, etc.

Ways to stand out from the crowd:

  • Experience leading large-scale AI Factory or HPC cluster bring-ups or builds.

  • Hands-on experience with NVIDIA products including, but not limited to, GPUs, NVLink, NVIDIA Networking, etc.; specifically debugging issues that occur during deployment on NVLink, etc.

  • Knowledge of NCCL, MPI, IMEX, NMX, and collectives in distributed training as it pertains to cluster designs.

  • External customer facing skill-set and background.

  • Effective time management and capability to balance multiple tasks and customers while thinking creatively to debug and solve problems.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 6, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Austin, Texas, USA Office

Austin, United States

Similar Jobs

7 Days Ago
In-Office or Remote
2 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Drive deployment of NVIDIA GPU and networking solutions at customer data centers: advise on network/compute/storage design, perform bring-up visits, debug performance, build POCs/demos, and liaise with product, engineering, and sales teams.
Top Skills: ArmCC++ContainersCudaDockerEthernetInfinibandKernel DriversKubernetesLinuxLinux KernelNicsNvidia Gpu SystemsNvidia SdksRoceVirtualization
An Hour Ago
Remote or Hybrid
Austin, TX, USA
99K-232K Annually
Senior level
99K-232K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead consulting engagements to improve pricing and revenue strategies for technology clients. Supervise and mentor teams, manage client relationships, apply pricing models and digital solutions, analyze market dynamics to drive profitability, and ensure project quality and timelines.
Top Skills: Anaplan
An Hour Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Lead development and continuous improvement of EHS safety management systems and DriveSafe program. Oversee driver scorecards, performance analytics, dashboards, and ad hoc reporting. Partner with EHS, operations, IT, and leadership to align systems, ensure data governance, drive projects, implement EHS technologies, train users, support compliance, and use data-driven insights to reduce incidents and improve safety outcomes.
Top Skills: BenchmarkDatabasesGensuite

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account