Axiom (axiom.co) Logo

Axiom (axiom.co)

Site Reliability Engineer

Reposted 8 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.
The summary above was generated by AI
Site Reliability Engineer (SRE)

Global (UTC-3 preferred)

Axiom’s mission is to empower developers to get the best insights into their data, as fast as possible. We are a remote-first and globally distributed team building a cloud native, serverless data analytics platform. Axiom completely changes the way in which developers and organizations think about their data: they can now send unlimited data with cost-effective storage and lightning-fast querying.

As a Site Reliability Engineer at Axiom, you will be pivotal in upholding our promise of superior reliability and performance to our customers. Collaborating with backend engineers and product teams, you will emphasize creating and operating scalable and reliable systems. Axiom's emphasis on SREs revolves around automating, measuring, and continuously improving the reliability and efficiency of our systems.

Your primary responsibilities:

  • Engineer and maintain a robust, secure, and scalable infrastructure for Axiom Cloud.

  • Collaborate with engineering teams to define and refine service level objectives.

  • Contribute to disaster recovery planning, capacity engineering, performance analysis, and system tuning.

  • Foster best practices for code deployments, aiding in the education of the broader development team.

  • Roll out tooling and solutions that improve system reliability and reduce manual toil.

  • Address and remediate service incidents and contribute to postmortems and root cause analyses.

  • Foster a culture of monitoring, alerting, and observability across the organization.

You are an ideal candidate if:

  • You have over two years of experience in a reliability-focused engineering environment.

  • You are passionate about system reliability, latency, performance, and efficiency.

  • You're familiar with AWS tools and technologies.

  • You have hands-on experience with Docker, Kubernetes, and Amazon EKS.

  • You understand infrastructure-as-code tools such as Terraform/Pulumi.

  • You possess strong networking knowledge and are adept with Linux systems.

  • Familiarity with CI platforms like GitHub Actions, GitLab, CircleCI or others.

  • You can efficiently use LLMs.

  • Experience with monitoring, alerting, and observability tools.

Bonus skills and experiences:

  • Proven track record of maintaining production systems at scale.

  • A software engineering background with expertise in Golang.

We provide:
  • Flexibility to work from wherever suits you best. For this role, we are considering individuals based in the timezone range UTC-5 (EST) to UTC +2.

  • Budget to build your home office set-up.

  • Monthly budget to support mental and physical wellness.

  • A focus day each week with no meetings, Slack or Zoom. Uninterrupted time to focus on work.

  • Uncapped vacation to unplug and rejuvenate.

  • Generous and flexible family leave for everyone.

Top Skills

Amazon Eks
AWS
CircleCI
Docker
Github Actions
Gitlab
Go
Kubernetes
Linux
Llms
Monitoring And Observability Tools
Pulumi
Terraform

Similar Jobs

6 Days Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.
Top Skills: AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform
8 Hours Ago
Remote or Hybrid
United States
165K-235K Annually
Mid level
165K-235K Annually
Mid level
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will automate tasks, enhance platform infrastructure, improve observability, and lead incident response efforts for optimal performance.
Top Skills: AWSGrafanaHoneycombLinuxPythonTerraform
15 Days Ago
Remote or Hybrid
130K-170K Annually
Senior level
130K-170K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.
Top Skills: Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account