Tenstorrent Inc. Logo

Tenstorrent Inc.

DevOps Architect

Posted 22 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Austin, TX, USA
100K-500K Annually
Expert/Leader
Easy Apply
In-Office
Austin, TX, USA
100K-500K Annually
Expert/Leader
Design and define the end-to-end architecture for a next-generation AI cluster control plane. Responsible for provisioning, orchestration, monitoring, security, workload placement, resource allocation, and multi-tenant integration across multi-thousand-accelerator, multi-megawatt data centers.
The summary above was generated by AI

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

We are building multi-megawatt AI data centers with thousands of accelerators and seeking a DevOps Architect to define the next generation cluster control plane that provisions, operates, and secures large-scale AI training and inference environments.

This is a foundational architecture role. You will define how clusters are configured, orchestrated, monitored, and secured at scale.

This role is hybrid, based out of Austin, TX; Santa Clara, CA; or Toronto, ON.

We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.


Who You Are

  • 10+ years designing and operating enterprise, HPC, or large-scale data center infrastructure
  • Deep expertise in cloud-native and bare-metal infrastructure management
  • Strong hands-on experience with Infrastructure-as-Code tools such as Terraform, Ansible, and Helm
  • Experienced building and operating observability stacks including Prometheus, Grafana, ELK or EFK, and OpenTelemetry
  • Strong understanding of networking, storage systems, accelerator resource management, and security models including RBAC, IAM, TLS, and secrets management

What We Need

  • Define the end-to-end architecture for the AI cluster control plane, covering provisioning, configuration, lifecycle management, and monitoring
  • Architect scalable systems for system, network, and storage provisioning across multi-thousand accelerator environments
  • Establish telemetry, logging, metrics, tracing, and alerting frameworks with operational guardrails
  • Define workload placement, resource allocation, scheduling, and preemption policies to maximize accelerator utilization
  • Integrate authentication, authorization, account management, key management, backup, checkpointing, and DCIM infrastructure into a secure multi-tenant environment

What You Will Learn

  • How multi-megawatt AI data centers are architected and operated at scale
  • The operational challenges of orchestrating thousands of AI accelerators during training and inference
  • How distributed infrastructure decisions directly impact AI model throughput, reliability, and cost efficiency
  • Advanced strategies for multi-tenant isolation, security hardening, and workload optimization in AI clusters
  • How hardware, runtime systems, and control plane architecture co-evolve in next-generation AI infrastructure

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.  Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).   These requirements apply to persons located in the U.S. and all countries outside the U.S.  As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency.  If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

Top Skills

Ansible
Bare-Metal
Cloud-Native
Dcim
Efk
Elk
Grafana
Helm
Iam
Opentelemetry
Prometheus
Rbac
Risc-V
Secrets Management
Terraform
Tls

Similar Jobs

4 Days Ago
In-Office or Remote
Texas, USA
195K-300K Annually
Expert/Leader
195K-300K Annually
Expert/Leader
Cloud • Information Technology • Security • Software
As a DevOps Architect, you'll lead automation efforts for SaaS services, mentor junior team members, and set strategic directions for CI/CD and monitoring solutions.
Top Skills: AIAWSAzureFluxGCPGoGrafanaJavaJenkinsKubernetesOciProgramming Languages: C/C++PrometheusPython
14 Days Ago
In-Office
Senior level
Senior level
Insurance • Financial Services
The DevOps Architect will design and implement DevOps practices, automate deployment processes, manage Azure environments, and ensure timely software releases while improving processes.
Top Skills: AnsibleAzureAzure DevopsBashC#.Net CoreCi/CdDockerInfrastructure As CodeJenkinsKubernetesPowershellPythonSQLTerraform
23 Days Ago
In-Office
Mid level
Mid level
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Analytics • Generative AI
This role involves guiding and implementing DevOps practices, coaching teams, managing automation environments, and delivering cloud infrastructure solutions.
Top Skills: .NetAnsibleArm TemplatesAWSAzureBambooChefCloudFormationData DogElkGCPPowershellPuppetPythonSplunkTeamcityTerraformVso/Vsts

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account