Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in Austin, TX
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
HR Tech • Software
The Site Reliability Engineer will architect and manage AWS infrastructure, implement CI/CD pipelines, lead incident responses, and mentor junior engineers to maintain reliability and security for a B2B SaaS platform.
Top Skills:
AlbAWSBashCloudFormationDatadogEcsGitGoGuarddutyJenkinsLambdaPythonS3Terraform
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Artificial Intelligence • Information Technology • Consulting
The Linux Systems Administrator will maintain and troubleshoot Linux systems, support network services, and work on systems integration while collaborating with infrastructure teams.
Top Skills:
DhcpDnsLinuxNtpPython
Information Technology • Internet of Things • Software • Virtual Reality
Lead reliability, availability, and resiliency strategies for large-scale systems, drive operational excellence, and provide technical mentorship across engineering teams.
Top Skills:
AWSCi/CdJavaMongoDBRabbitMQZookeeper
AdTech • Digital Media • Information Technology • Other
As a Software Engineer in the Tooling and Reliability Platforms team, you'll develop AI services, manage incident tools, and utilize Infrastructure as Code for high-availability systems. You'll focus on integrating AI workflows and improving operational resilience for Yahoo's brands.
Top Skills:
AWSCloudFormationDockerGCPGoJavaKubernetesPythonTerraform
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
Lead the development of Launch Potato's cloud infrastructure, establishing SRE practices including on-call rotations and monitoring systems, while ensuring cost efficiency and reliability.
Top Skills:
AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead Engineer, DevOps & SRE will oversee the cloud infrastructure, build the SRE function, and manage CI/CD processes to ensure reliable operations and compliance.
Top Skills:
AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead DevOps/SRE Engineer will own and evolve cloud infrastructure, build the SRE function, manage CI/CD platforms, and ensure compliance while enhancing infrastructure reliability and cost control.
Top Skills:
AWSCi/CdGrafanaOpentelemetryPagerdutyTerraform
Real Estate • Travel • PropTech
The Engineering Manager for Storage SRE will lead a team to ensure reliable database operations, improve developer experience, and expand tooling and operational models, focusing on mission-critical systems.
Top Skills:
Cloud InfrastructureDatabasesSite Reliability EngineeringStorage Systems
Cloud • Software
Responsible for ensuring reliability, availability, and performance of cloud production systems, leading incident response, automating workflows, and improving system observability and scalability.
Top Skills:
AWSAzureBashDatadogElkGCPGrafanaKubernetesOpentelemetryPrometheusPythonTerraform
Cloud • Security • Software • Generative AI
The role involves designing and developing tooling for the Elastic Stack, managing production services, and supporting internal Elastic Stack usage for development and analytics.
Top Skills:
AnsibleChefClojureDockerHaskellJavaScriptKubernetesPackerPuppetPythonSaltTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Blockchain • Fintech • Cryptocurrency
Responsible for the design, implementation, and reliability of systems across hybrid cloud and on-premises environments, while leading technical initiatives and mentoring engineers.
Top Skills:
AnsibleDatadogGithub ActionsHelmKubernetesPythonTerraform
Real Estate • Financial Services • PropTech
As a Senior Associate, Site Reliability Engineer, you will support AWS Cloud products, ensuring stability, optimizing performance, and enhancing automation. Responsibilities include collaborating with teams, applying cloud best practices, and improving application observability.
Top Skills:
AWSAzure DevopsBashCi/CdDockerGitopsKubernetesLoad BalancersPowershellPythonSQLTerraform
Artificial Intelligence • Fintech • Software • Financial Services
The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.
Top Skills:
AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills:
ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
Fintech
The Staff Site Reliability Engineer role involves leading architecture, automating GCP environment, defining SLIs and SLOs, mentoring teammates, and enhancing system reliability and performance.
Top Skills:
ArgocdDatadogGCPGoHelmJavaScriptKubernetesPythonTerraformTypescript
Cloud • Information Technology • Security • Software
As a DevOps Architect, you'll lead automation efforts for SaaS services, mentor junior team members, and set strategic directions for CI/CD and monitoring solutions.
Top Skills:
AIAWSAzureFluxGCPGoGrafanaJavaJenkinsKubernetesOciProgramming Languages: C/C++PrometheusPython
Gaming • Information Technology • Mobile • Software • Esports
Seeking a Senior Site Reliability Engineer to design and operate scalable platform solutions, enhance reliability, and improve developer experience and operational efficiency across engineering teams.
Top Skills:
AWSGCP
Cloud • Software • Database
Lead design, build, and operate the YugabyteDB DBaaS infrastructure. Drive architecture, automate lifecycle and maintenance, manage incidents and on-call rotations, implement security/encryption processes, and optimize reliability using SRE principles and observability.
Top Skills:
AksAnsibleAWSAzureBashDockerEksGCPGitGithub ActionsGkeJavaKubernetesLinuxPostgresPrometheusPythonShellTerraform
Cloud • Software
As a Site Reliability Engineer, you will ensure system reliability, handle technical escalations, create automation tools, and collaborate with engineering teams during incidents.
Top Skills:
AnsibleBashChefDockerElkGitGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonSplunkSvnTcp/IpTerraformUnix
Healthtech • Software
Maintain reliability, performance, and scalability of cloud-hosted services and databases. Implement SRE best practices, define SLIs/SLOs, respond to incidents, build monitoring and automation, perform DBA tasks (backups, restores, tuning), support CI/CD and DB migrations, and document runbooks and procedures.
Top Skills:
Amazon RdsAzure Sql DatabaseBashEcs FargateFlywayGitlabJenkinsKubernetesLiquibaseOctopus DeployOraclePostgresPowershellPythonRedisSolarwinds DpaSQL Server
Logistics • Software • Transportation
Lead and mentor teams in DevOps and SRE, architect scalable Azure Cloud infrastructure, implement CI/CD and IaC, ensure database reliability, and drive cross-functional collaboration.
Top Skills:
Azure CloudAzure DevopsCi/CdCosmosdbDockerElkGrafanaKubernetesMySQLPostgresPrometheusRedisSQL ServerTerraform
Software
The role involves managing compute infrastructure for decentralized applications, requiring critical thinking, documentation skills, and experience in Kubernetes and blockchain management.
Top Skills:
BlockchainGitopsInfrastructure-As-CodeKubernetesProgramming Languages
Artificial Intelligence • Big Data • Information Technology • Security • Software
The Site Reliability Engineer ensures operational excellence in a telecommunication solution on the public cloud, handling automation, incident management, performance planning, and security collaboration.
Top Skills:
AnsibleAWSDatadogDockerGCPGitlabHelmJavaJenkinsKubernetesNoSQLTerraform
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top Austin, TX Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs in Austin
.NET Developer Jobs in Austin
Android Developer Jobs in Austin
C# Jobs in Austin
C++ Jobs in Austin
DevOps Jobs in Austin
Engineering Manager Jobs in Austin
Front-End Developer Jobs in Austin
Golang Jobs in Austin
Hardware Engineer Jobs in Austin
iOS Developer Jobs in Austin
Java Developer Jobs in Austin
Javascript Jobs in Austin
Linux Jobs in Austin
Perl Jobs in Austin
PHP Developer Jobs in Austin
Python Jobs in Austin
QA Engineer Jobs in Austin
Ruby Jobs in Austin
Sales Engineer Jobs in Austin
Salesforce Developer Jobs in Austin
Scala Jobs in Austin
Backend Engineer Jobs in Austin
Devops Engineer Jobs in Austin
Engineering Jobs in Austin
Field Engineer Jobs in Austin
Full-Stack Engineer Jobs in Austin
Infrastructure Engineer Jobs in Austin
Principal Software Engineer Jobs in Austin
Senior Android Engineer Jobs in Austin
Senior Front-End Engineer Jobs in Austin
Senior Full-Stack Engineer Jobs in Austin
Senior Ios Engineer Jobs in Austin
Senior Site Reliability Engineer Jobs in Austin
Senior Systems Engineer Jobs in Austin
Software Engineering Manager Jobs in Austin
Software Test Engineer Jobs in Austin
Solutions Architect Jobs in Austin
Solutions Engineer Jobs in Austin
Staff Software Engineer Jobs in Austin
Systems Engineer Jobs in Austin
Web Developer Jobs in Austin
All Filters
Total selected ()
No Results
No Results





_1.png)
























