Voxel51 Logo

Voxel51

Senior Infrastructure Engineer

Posted Yesterday
Remote
Hiring Remotely in USA
200K-240K Annually
Senior level
Remote
Hiring Remotely in USA
200K-240K Annually
Senior level
As a Senior Infrastructure Engineer, you'll shape architecture and strategy for managing unstructured data, design cloud-based deployment systems, and mentor teams while enhancing infrastructure best practices.
The summary above was generated by AI

First and most importantly: our mission is to bring transparency and clarity to the world's data.

Our platform, FiftyOne, is where AI work happens. Our enterprise platform is the mission critical linchpin for managing unstructured data, model development, and AI systems at the world's largest companies.

We believe that open source is the way to lead the data-centric AI revolution. Our open source version has 4 million downloads to-date.

Our software massively impacts AI work across almost every vertical: from self-driving cars to medical imaging to revolutionizing agriculture, we are at the thrilling center of real-world AI advancement’s next wave.

And we’re built on three key tenets:

  • We are all human beings: we strive to be a “human-first” organization and treat everyone with the respect, care, and flexibility that all people deserve. 
  • We are distributed: we believe in getting autonomy and power into the hands of people actually doing the work.
  • We believe in the power of community.

We are fully remote, hiring for people based in the United States, and who are prepared to travel to at least 2 in-person retreats per year.

About your role

As a Senior Infrastructure Engineer at Voxel51, you will shape the architecture and strategy of the systems that power our platform — from individual researchers to enterprise-scale deployments. You’ll lead the design of containerized systems, CI/CD pipelines, and deployment solutions across cloud and on-premises environments, while solving the unique challenges of serving unstructured data (images and video) at scale.

You’ll partner with enterprise customers, guiding and troubleshooting their production deployments. You’ll collaborate across engineering teams to improve developer productivity, and mentor peers while setting infrastructure best practices. Your work will directly shape the reliability, security, and scalability of Voxel51’s platform — and accelerate our mission to democratize data-centric ML.

What you will do

  • Shape the architecture and evolution of Voxel51’s infrastructure to support deployments ranging from individual researchers to Fortune 500 enterprises
  • Design, build, and scale deployment systems across cloud (GCP, AWS, Azure) and on-premises environments, ensuring reliability, security, and repeatability
  • Partner with enterprise customers (and our Customer Success Machine Learning Engineers) to deliver and support production-grade deployments in their environments, guiding them through installation, troubleshooting, and scaling
  • Lead infrastructure initiatives across engineering teams, enabling peers to develop, test, and ship features faster with robust internal tooling and automation
  • Drive best practices in CI/CD, evolving our pipelines (currently GitHub Actions + Google Cloud Build) and introducing new approaches where they add value
  • Develop and maintain deployment solutions for Voxel51-hosted environments (GKE) as well as customer on-prem installations (K8s or Docker Compose)
  • Champion developer productivity, improving workflows for development and automated cloud deployments
  • Troubleshoot and resolve complex infrastructure issues, spanning build failures, runtime failures, and customer deployment challenges
  • Anticipate and prevent failures by designing monitoring, alerting, and predictive solutions for both internal and customer environments
  • Mentor engineers and set technical direction, ensuring Voxel51’s infrastructure remains ahead of customer needs and industry trends

What you should bring

  • Deep experience with containerized environments
    • Building, packaging, and debugging container images
    • Kubernetes (and Docker Compose) for orchestration
    • Building, maintaining, and deploying Helm charts
  • Infrastructure as Code expertise (Terraform, Ansible, or equivalent)
  • Scripting and automation skills (Bash or similar)
  • Python expertise, including build and environment management, packaging/distribution, release management, and dependency debugging
  • CI/CD systems experience, ideally GitHub Actions (we use this today)
  • Cloud infrastructure knowledge, especially GCP (IAM, VPC, load balancing, ingress/egress routing, proxies, firewall rules)
  • Database fundamentals, ideally MongoDB or similar NoSQL systems
  • Observability skills, including designing meaningful monitors, logging, tracing, and alerting
  • Security best practices, including certificates, service accounts, least privilege, and role assumptions
  • Troubleshooting ability across complex, distributed systems (including with customers in the loop)
  • Testing mindset: comfortable with designing and applying different types of tests to validate functionality
  • Strong communication skills, with the ability to work directly with enterprise customers as well as collaborate across teams in a remote-first, collaborative environment
  • Adaptability and curiosity, with the ability to ramp quickly on unfamiliar concepts and technologies

The cash compensation for this person is in the $200K - $240K range. In addition to base comp for this role, we offer equity in the form of options, a variety of benefits, and the opportunity to grow in an exciting and collaborative environment.

Similar Jobs

5 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves leading AI product development, enhancing CI/CD frameworks, automating IT workflows, supporting AWS services, and driving cloud security best practices.
Top Skills: AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
11 Days Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
As a Senior Software Engineer, you will design and implement AI agents to automate DevOps and DBA workflows, ensuring agents operate safely and effectively in real infrastructure environments.
Top Skills: AWSCloud ApisGCPGoJavaKotlinKubernetesPythonTerraform
25 Days Ago
Easy Apply
Remote
United States
Easy Apply
169K-240K Annually
Senior level
169K-240K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
As a Senior Software Engineer, you'll lead teams to deliver goals, enhance infrastructure security, and support system availability, while also fostering team quality and talent development.
Top Skills: AWSKotlinKubernetesMySQLPython

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account