Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Senior Site Reliability Engineer Jobs in Austin, TX
Big Data
You will manage AWS infrastructure, automate deployments, debug application issues, and improve the operational health of Metabase Cloud.
Top Skills:
AWSDatadogGoGrafanaKubernetesPrometheusPythonTerraform
Information Technology
The Senior Platform Product Engineer improves the infrastructure and data platform, promotes DevOps culture, and aligns technical solutions with business objectives to drive innovation.
Top Skills:
Argo WorkflowsArgocdAws Core ServicesCrossplaneGitlab PipelinesGoGrafanaJavaK8SPrometheusPythonTerraform
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills:
AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Financial Services
The Senior Site Reliability Engineer will ensure infrastructure reliability, scalability, and performance, focusing on container orchestration and collaborating across teams to integrate applications. Responsibilities include system monitoring, security vulnerability management, and optimizing performance and cost.
Top Skills:
.NetAWSAzureBashCi/CdDockerElk StackGCPGrafanaIacJavaKubernetesPowershellPrometheusPython
Database
The Senior Site Reliability Engineer at Niche will manage cloud infrastructure, oversee incident responses, mentor team members, and promote best practices to ensure reliability across distributed systems and applications.
Top Skills:
AWSBashDockerGCPGitGoGrafanaKafkaKubernetesPrometheusPythonSQLSumo LogicTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills:
AWSInfrastructure As CodeJavaScriptNode.jsPython
Cloud • Security
As a Senior Site Reliability Engineer, you'll ensure the reliability of production SaaS applications in Azure, automate operational processes, maintain compliance in a FedRAMP environment, and address incident responses.
Top Skills:
AzureAzure Application InsightsAzure DevopsDatadogKubernetesLog AnalyticsPowershellPythonTerraform
Software
As a Site Reliability Engineer at Podium, you'll ensure product stability and scalability, collaborate with engineering teams, handle on-call production issues, and mentor junior engineers.
Top Skills:
AnsibleAWSCi/CdDatadogDockerGitGitlabGoHelmHoneycombKubernetesPrometheusPythonRubyStrongdmTerraform
Blockchain • Fintech • Social Media • Cryptocurrency • NFT • Web3
Design, build, and operate scalable, highly available infrastructure and platform software for Zora's blockchain services (indexer, APIs, data pipelines). Automate workflows, maintain core systems, improve developer experience, participate in on-call rotation, and contribute strategic technical direction.
Top Skills:
AsyncioBaseBridgesCephCloudflare Pages FunctionsDatadogDockerEthereumGoIpfsKubernetesMongoDBOpentelemetryOptimismOptimistic RollupsPlasmaPolygonPostgresPythonRpc NodesSidechainsVercelZk-Rollups
Security • Cybersecurity
Lead the design and implementation of observability, SLO/SLA frameworks, and AI-enabled infrastructure automation. Architect scalable AWS infrastructure, improve incident management and on-call practices, and drive organization-wide adoption of telemetry and reliability standards.
Top Skills:
Ai-Assisted ToolingAWSCi/CdClaudeCodexCursorGrafanaHoneycombInfrastructure-As-CodeObservabilityPulumiSupabaseTelemetryTerraformVercel
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills:
AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Reposted 21 Days AgoSaved
Easy Apply
Easy Apply
Real Estate • Software
As a Senior Site Reliability Engineer, you'll enhance system performance, reliability, and cost efficiency in a large-scale production environment, shifting manual operations to AI-assisted engineering.
Top Skills:
AnsibleDatadogElkGrafanaKubernetesLinuxPrometheusPythonRubyTerraform
Insurance
Lead reliability strategy and architecture for critical systems, drive incident management and root-cause analysis, build automation and SRE tooling, influence release/change practices and compliance, and mentor junior engineers to improve operational reliability.
Top Skills:
AngularAWSCi/CdCloudFormationContainerizationJavaJavaScriptLogsNettyNext.JsNode.jsNon-Relational DatabasesObservability (MetricsOrchestrationOrmReactRelational DatabasesServicenowSpringSpring BootTomcatTracing)
Artificial Intelligence • Information Technology • Machine Learning • Software • Cybersecurity • Generative AI • Data Privacy
Lead global SRE and infrastructure teams to ensure reliability, scalability, and cost-efficiency of production and developer platforms. Define cloud and Kubernetes architecture, IaC, CI/CD, SLOs/SLIs, incident management, and cloud cost optimization while partnering with Security, Product, Finance, and Engineering.
Top Skills:
AIAutomationAWSCi/CdCloud-Native SystemsGCPInfrastructure As CodeKubernetesTerraform
Cloud • Security • Software • Cybersecurity
The Senior Lead Site Reliability Engineer will ensure performance and uptime of security products, develop automation pipelines, and improve monitoring systems, working closely with various teams.
Top Skills:
AzureDatabricksDockerGoJenkinsKubernetesPythonTerraform
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
The Senior Director of SRE leads and defines reliability and operational excellence across products, manages the SRE team, and scales reliability practices within the organization.
Top Skills:
AWSAzureCloud-Native NetworkingDistributed SystemsGCPKubernetesMicroservicesSite Reliability Engineering Principles
Big Data • Information Technology • Security • Software
The Senior Developer will drive observability roadmaps using SRE Golden Signals, establish monitoring strategies, enhance system reliability, and act as an expert in New Relic technology for performance management.
Top Skills:
BashCri-OCshKubernetesNew RelicPerlWindows Powershell
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
AdTech • Marketing Tech
The Senior Software Engineer for Core Services SRE will maintain infrastructure, develop reliable systems, lead technical initiatives, and conduct security reviews.
Top Skills:
AerospikeAWSBoundaryConsulElasticsearchEnvoyGoGrafanaKafkaNginxNomadPackerPrometheusRdsRedisScylladbTerraformVagrantVaultWaypoint
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Reposted 11 Days AgoSaved
Easy Apply
Easy Apply
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills:
Cloud InfrastructureDistributed SystemsObservability
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills:
AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Cloud • Security • Software
As a Senior Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, ensuring software deployment through automated CI/CD pipelines, while collaborating with teams to enhance service delivery.
Top Skills:
Ci/CdCloud PlatformsDockerGoGoKubernetes
Software • Consulting
Lead production support for external web applications: manage incidents, perform root cause analysis, expand observability (Splunk/OpenTelemetry), build dashboards, collaborate with dev and platform teams, and participate in 24x7 on-call rotations to improve availability and reliability.
Top Skills:
Splunk,Opentelemetry,Appdynamics,Datadog,Aws,Kubernetes,Python,Servicenow,Mulesoft,Postman,Linux,Shell Scripting,Openshift,Azure,Gcp,Api Testing
Security • Software
Maintain, automate, and improve operational tools and customer deployment processes; monitor and ensure service SLOs, backup/restore, alerting, and incident response; drive GitOps/IaC practices, cost tracking, and automation of repetitive tasks while supporting outages and upgrades.
Top Skills:
Ansible,Terraform,Helm,Kubernetes,Aws,Gcp,Azure,Prometheus,Grafana,Bash,Python,Gitops
Top Austin, TX Companies Hiring Remote Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results





.png)











_0.png)

















