Orion Innovation Logo

Orion Innovation

Senior Site Reliability Engineer

Reposted 6 Days Ago
Be an Early Applicant
In-Office
2 Locations
8-8 Annually
Senior level
In-Office
2 Locations
8-8 Annually
Senior level
The Senior Site Reliability Engineer ensures systems reliability, scalability, and performance for classified government projects, focusing on DevOps methodologies and coding expertise.
The summary above was generated by AI

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

Role: Senior Site Reliability Engineer (SRE)

Type: Remote - working EST Hours

Clearance Requirement: Must be eligible for up to a Top-Secret Security Clearance.

Job Summary

The Senior Site Reliability Engineer (SRE) will play a critical, hands-on role in ensuring the reliability, scalability, and performance of systems supporting highly classified government projects within an air-gapped deployment. This position demands expertise in both DevOps methodologies and deep coding skills to maintain up-time, resilience, and stringent compliance in a secure, disconnected environment.

Key Responsibilities
  • Develop robust automation, configuration management, and toolsets primarily using Ruby and Shell Scripting (CLI/PowerShell) to manage infrastructure and deployment pipelines (Git/Infrastructure Automation).
  • Implement and manage advanced observability solutions with Grafana and Prometheus, along with Splunk and Elastic, to monitor system health and proactively identify issues in an air-gapped setting.
  • Collaborate closely with the Lead, Infrastructure, and Security Specialists to rapidly resolve incidents and significantly improve overall system resilience.
  • Create and maintain comprehensive documentation for system configurations, runbooks, and disaster recovery procedures tailored for a classified environment.
  • Contribute an intermediate level of proficiency in Go to team projects and codebase.
Must-Have Requirements
  • 8+ years of experience in DevOps OR SRE using Ruby for writing robust automation and tooling.
  • Observability and monitoring with Grafana and Prometheus.
  • CLI tools including Shell Scripting and/or PowerShell for operational tasks.
  • Deploying Kubernetes in production environments.
  • Git and various Infrastructure Automation tools.
  • Deep administrative experience with Linux operating systems.
Nice-to-Have Requirements
  • Experience with Go programming language.
  • Prior experience in government or defense-related SRE roles.
  • Experience with Python for scripting and data analysis.
  • Familiarity with packaging and deployment using Helm.
  • Knowledge of network security protocols, specifically IPSec

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.


Top Skills

Elastic
Git
Go
Grafana
Helm
Kubernetes
Linux
Powershell
Prometheus
Python
Ruby
Shell Scripting
Splunk

Similar Jobs

11 Days Ago
Hybrid
Toronto, ON, CAN
90K-133K Annually
Mid level
90K-133K Annually
Mid level
Enterprise Web • Fintech • Financial Services
The Senior Site Reliability Engineer will enhance system reliability, lead automation projects, and optimize cloud solutions in a collaborative environment.
Top Skills: Ci/CdCloud-Based SolutionsCloudFormationContainersDevOpsDistributed ApplicationsDockerInfrastructure As CodeMicroservicesPlsqlServerless TechnologySQLTerraform
10 Hours Ago
Easy Apply
In-Office or Remote
7 Locations
Easy Apply
170K-230K Annually
Senior level
170K-230K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
The Senior Site Reliability Engineer is responsible for managing AI infrastructure, ensuring reliability through scalability, incident response, and collaboration with suppliers, focusing on Kubernetes and advanced GPU services.
Top Skills: AnsibleBashGrafanaKubernetesPrometheusPython
25 Days Ago
In-Office or Remote
Toronto, ON, CAN
Senior level
Senior level
Insurance
The Senior Site Reliability Engineer at Zensurance will focus on enhancing production systems' reliability, scalability, and performance through automation, best practices, and incident management, while mentoring junior engineers.
Top Skills: AWSDatadogElk StackGithub ActionsGrafanaKubernetesPrometheusSplunkTerraformTypescript

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account