Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in Austin, TX
Software
Join the SRE team to improve monitoring, alerting, observability, and reliability of Fireblocks' production systems. Triage incidents, run RCA, create runbooks and automation (Python, Lambda, shell, Ansible, ArgoCD), collaborate with R&D/support, and participate in on-call rotation.
Top Skills:
AnsibleArgocdAWSAws LambdaAzureBashBitbucketC++ChefCoralogixDatadogDockerGerritGitGitlabGCPHelmJavaScriptKubernetesLinuxMySQLNew RelicNginxNode.jsPhabricatorPrometheusPuppetPythonShellSplunk
Big Data • Cloud • Information Technology
The Site Reliability Engineer at Iron Mountain will troubleshoot escalated tickets, manage Windows Server builds, perform security patching, and collaborate with customers and vendors to resolve issues and maintain systems.
Top Skills:
CloudComputeHyper-Converged InfrastructureLinuxMicrosoft Endpoint Configuration ManagerNetworkNutanixPowershellRubrikStorageVirtualizationWindows Server
Information Technology • Software • Cybersecurity • Automation
Design, build, and operate an agentic platform to automate vulnerability remediation and incident response while ensuring reliability in security operations.
Top Skills:
DatadogGitGrafanaLinearLlmsOpentelemetryPrometheusSlack
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
The Senior Site Reliability Engineer will enhance reliability in production SaaS systems, implement AI agents, improve observability, and mentor junior engineers.
Top Skills:
.NetAksAWSAzureBashC#DatadogEksGCPGoGrafanaKubernetesLinuxOpentelemetryPrometheusPythonTerraform
Cannabis • Consumer Web • eCommerce • Software
As a Site Reliability Engineer, you will enhance the performance and reliability of web services, collaborate on best practices for monitoring and CI/CD, troubleshoot deployment issues, and drive DevOps culture.
Top Skills:
AWSCloudwatchDatadogDockerElixirGitGoGrafanaKubernetesNode.jsOpen CensusOpen MetricsOpen TracingPrometheusPythonRubyTerraform
Artificial Intelligence • Insurance • Software • Automation
The Staff Site Reliability Engineer will build and scale infrastructure for Assured's platform, automate delivery, enhance observability, and lead mentoring initiatives.
Top Skills:
AWSKubernetesPostgresTerraform
Software
Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.
Top Skills:
AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform
Healthtech • Other • Software
The role involves managing PostgreSQL services, ensuring high availability and performance, driving incident response, automating tasks, and improving observability for a 24x7 SaaS platform.
Top Skills:
AnsibleBashDatadogGrafanaHaproxyNew RelicPgbackrestPgbouncerPostgresPowershellPrometheusPythonRepmgrTerraform
Software • Analytics
The role involves automating and managing AWS infrastructure, ensuring reliability and scalability of stateful systems, and optimizing deployment processes. You'll also handle incident responses and improve operational tooling.
Top Skills:
AWSKubernetesTerraformTerragrunt
Big Data • Machine Learning • Software • Analytics
As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.
Top Skills:
ArgocdC++Ci/CdCloud PlatformsDatadogGitopsGrafanaInfrastructure As CodeJavaJavaScriptKubernetesPython
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills:
AWSComputer VisionIacLarge Language ModelsNlpTerraform
Artificial Intelligence • Other • Sales • Software
The role involves designing and advancing infrastructure for the engineering team, ensuring the reliability of Kubernetes clusters, automating operations, and building machine learning infrastructure.
Top Skills:
ArgoAWSAzureCloudFormationFluxGithub ActionsGoGCPKubernetesPostgresPythonTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Cloud • Information Technology
The Site Reliability Engineer I is responsible for supporting Backblaze’s infrastructure stability by addressing customer issues, monitoring system health, and improving operational processes through documentation and automation.
Top Skills:
AnsibleLinuxZabbix
Artificial Intelligence • Healthtech • Information Technology • Software
As the first Site Reliability Engineer in the US, you'll ensure platform stability and oversee incident responses during PST hours, bridging infrastructure and code, while improving operability and compliance in a medical-device environment.
Top Skills:
AWSElixirKubernetesTerraform
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead the design and operation of large scale Kubernetes clusters, ensuring high availability and performance while supporting system lifecycle and reliability improvements.
Top Skills:
ContainersGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills:
AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills:
Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills:
ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills:
AWSClickhouseKubernetesMySQLPostgresRedis
Healthtech
Develop and implement processes to ensure high availability and reliability of services. Responsibilities include incident management, automation, capacity planning, and risk mitigation.
Top Skills:
AWSAzureDatadogDockerGrafanaJavaScriptNew RelicPrometheusPythonRubySplunkTerraform
Internet of Things • Cybersecurity
The Site Reliability Engineer will manage AWS GovCloud infrastructure, ensuring compliance and high availability while driving automation, security, and incident response best practices.
Top Skills:
AnsibleAws GovcloudBashDockerElk StackGitlab Ci/CdGrafanaJenkinsKubernetesPrometheusPythonTerraform
Financial Services
The Senior Site Reliability Engineer will design, build, and maintain scalable infrastructure, manage Kubernetes and cloud services, and ensure system reliability.
Top Skills:
ArgocdAWSGCPGoGrafanaHelmInfrastructure-As-CodeKubernetesLinuxPythonSplunkTerraform
Financial Services
Own reliability and scalability of on-prem observability platforms (ELK, Grafana); handle production escalations, capacity planning, SLOs, onboarding, automation, IaC (Terraform/Helm/Ansible), upgrades, security hardening, and platform modernization.
Top Skills:
AnsibleApm InstrumentationBashBeatsChefElasticsearchElk StackFluent BitFluentdGrafanaHelmKibanaLinuxLogstashNew RelicOpentelemetryPrometheusPuppetPythonShell Scripting/Linux ShellSolarwindsTerraform
Healthtech • Software
The SRE Technical Project Manager will lead project delivery, incident management, automation processes, and uptime communication, partnering with SRE and development teams to ensure system stability and scalability.
Top Skills:
Ai BotsDatadogJIRAJira Service ManagementMs TeamsOpsgeniePagerduty
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top Austin, TX Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs in Austin
.NET Developer Jobs in Austin
Android Developer Jobs in Austin
C# Jobs in Austin
C++ Jobs in Austin
DevOps Jobs in Austin
Engineering Manager Jobs in Austin
Front-End Developer Jobs in Austin
Golang Jobs in Austin
Hardware Engineer Jobs in Austin
iOS Developer Jobs in Austin
Java Developer Jobs in Austin
Javascript Jobs in Austin
Linux Jobs in Austin
Perl Jobs in Austin
PHP Developer Jobs in Austin
Python Jobs in Austin
QA Engineer Jobs in Austin
Ruby Jobs in Austin
Sales Engineer Jobs in Austin
Salesforce Developer Jobs in Austin
Scala Jobs in Austin
Backend Engineer Jobs in Austin
Devops Engineer Jobs in Austin
Engineering Jobs in Austin
Field Engineer Jobs in Austin
Full-Stack Engineer Jobs in Austin
Infrastructure Engineer Jobs in Austin
Principal Software Engineer Jobs in Austin
Senior Android Engineer Jobs in Austin
Senior Front-End Engineer Jobs in Austin
Senior Full-Stack Engineer Jobs in Austin
Senior Ios Engineer Jobs in Austin
Senior Site Reliability Engineer Jobs in Austin
Senior Systems Engineer Jobs in Austin
Software Engineering Manager Jobs in Austin
Software Test Engineer Jobs in Austin
Solutions Architect Jobs in Austin
Solutions Engineer Jobs in Austin
Staff Software Engineer Jobs in Austin
Systems Engineer Jobs in Austin
Web Developer Jobs in Austin
All Filters
Total selected ()
No Results
No Results


































