Bespoke Labs Logo

Bespoke Labs

DevOps / Site Reliability Engineer

Sorry, this job was removed at 10:12 p.m. (CST) on Friday, May 15, 2026
Remote
Hiring Remotely in USA
Remote
Hiring Remotely in USA

Similar Jobs

10 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
Lead the development of Launch Potato's cloud infrastructure, establishing SRE practices including on-call rotations and monitoring systems, while ensuring cost efficiency and reliability.
Top Skills: AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform
10 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead DevOps/SRE Engineer will own and evolve cloud infrastructure, build the SRE function, manage CI/CD platforms, and ensure compliance while enhancing infrastructure reliability and cost control.
Top Skills: AWSCi/CdGrafanaOpentelemetryPagerdutyTerraform
10 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead Engineer, DevOps & SRE will oversee the cloud infrastructure, build the SRE function, and manage CI/CD processes to ensure reliable operations and compliance.
Top Skills: AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform

About Bespoke Labs

Bespoke Labs is an AI research and data company building the datasets, benchmarks, and evaluation infrastructure that power frontier AI models. We're backed by leading investors, trusted by top AI labs, and have research accepted at venues like ICLR 2026. Our team is small, moves fast, and has an outsized impact on how the next generation of AI is built.

The Role

We're looking for a mid-level DevOps / Site Reliability Engineer to own and scale our cloud infrastructure. You'll work closely with engineering and ML teams to keep our systems reliable, observable, and fast — directly supporting the infrastructure that powers AI data pipelines at scale.

What You'll Do

  • Own cloud infrastructure on AWS — EC2, EKS, RDS, S3, IAM, VPC

  • Manage Kubernetes clusters and container orchestration end-to-end

  • Build and maintain CI/CD pipelines using GitHub Actions or similar

  • Implement monitoring, alerting, and observability stacks (Prometheus, Grafana, or DataDog)

  • Improve reliability, performance, and security of production systems

  • Automate infrastructure with Terraform or similar IaC tools

  • Debug and resolve issues across complex, distributed systems

  • Participate in design reviews and help raise the infrastructure bar

What We're Looking For

  • 3–5 years in DevOps, SRE, or infrastructure engineering

  • Strong AWS experience — EKS, EC2, RDS, S3, IAM

  • Kubernetes — deployment, scaling, troubleshooting in production

  • CI/CD pipelines — GitHub Actions, ArgoCD, or similar

  • Infrastructure as Code — Terraform, Pulumi, or CDK

  • Python or Go scripting

  • Experience working in production environments with real users

  • Comfort with ambiguity and ability to operate autonomously

Nice to Have

  • Experience supporting ML training workloads or GPU clusters

  • Familiarity with distributed computing or large-scale data pipelines

  • Prior work at an AI, ML, or data company

  • Open-source contributions or published technical writing

What We Offer

  • Competitive compensation and meaningful equity

  • Direct impact on frontier AI model training and evaluation infrastructure

  • Flexible, remote-friendly environment with low bureaucracy

  • A small, high-caliber team with deep AI research expertise

  • Health, wellness, and learning & development benefits

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account