SMX Logo

SMX

Site Reliability Engineer (5009) (US, DC, Tampa, San Antonio) (Secret)

Reposted 11 Days Ago
In-Office or Remote
4 Locations
124K-206K Annually
Senior level
In-Office or Remote
4 Locations
124K-206K Annually
Senior level
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
The summary above was generated by AI
 
The SMX Space and Intelligence (S&I) Business Unit (BU) is on the ground floor across the future remote sensing ecosystem for all orbital regimes (LEO, MEO, HEO, and GEO). We build, integrate, and operationally support our customer's emerging space-ground systems to include real-time data processing frameworks, sensor data processing, and data visualization. We are teamed with the most passionate companies in industry, dedicated to bringing best-of-breed capabilities to address our customers most pressing needs. 
 
We have an immediate opportunity for a Site Reliability Engineer who is excited to apply their talents to our customer's challenging project and one who will thrive in a collaborative environment. The ideal candidate is a recognized professional with hands-on expertise and an excellent understanding of engineering processes supporting large, National Defense, Agile software development programs.
 
Work will be performed in Colorado. Please apply to this role if you have interest in relocating to Colorado.

Essential Duties & Responsibilities

  • Support the availability, reliability, and performance of IaaS services supporting mission systems
  • Monitor infrastructure health using metrics, logs, and alerts; respond to and resolve incidents
  • Perform root-cause analysis for infrastructure and service outages; implement corrective and preventative actions Improve system reliability through automation, standardization, and proactive engineering
  • Support capacity planning, performance analysis, and scaling of infrastructure services
  • Maintain and enhance monitoring, logging, and alerting solutions
  • Participate in incident response, on-call rotations (as required), and post-incident reviews
  • Collaborate with network, systems, platform, and application teams to resolve cross-stack issues
  • Support infrastructure lifecycle activities including upgrades, patches, and configuration changes
  • Apply security best practices and support compliance requirements in a regulated environment
  • Develop and maintain runbooks, procedures, and operational documentation
  • Contribute to CI/CD and Infrastructure-as-Code workflows supporting IaaS services
  • Participate in Agile ceremonies and operational planning activities
  • Perform other duties as assigned

Required Skills & Experience 

  • Secret clearance
  • 5+ years of professional experience in systems engineering, SRE, DevOps, or infrastructure operations
  • Strong experience administering Linux systems
  • Experience supporting on-prem, cloud, or hybrid infrastructure environments 
  • Hands-on experience with monitoring, logging, and alerting systems
  • Strong troubleshooting skills across compute, storage, networking, and OS layers
  • Experience scripting or automating tasks using Bash, Python, or similar languages
  • Familiarity with Infrastructure as Code concepts and tooling
  • Strong verbal and written communication skills
  • Detail-oriented, self-motivated, and able to own issues through resolution
  • Ability to work on-site at the customer location

Desired Skills & Experience 

  • Experience working on an IaaS or platform operations team
  • Experience with virtualization platforms (e.g., VMware vSphere)
  • Experience supporting container platforms (Kubernetes, OpenShift) 
  • Experience with cloud environments (AWS, Azure, or GovCloud)
  • Familiarity with SRE concepts such as SLIs, SLOs, error budgets, and toil reduction
  • Experience with configuration management or automation tools (Ansible, Terraform)
  • Experience with CI/CD pipelines (GitLab CI, Jenkins, or similar)
  • Experience operating systems in government or secure environments
  • Experience with incident management and operational readiness reviews

Application Deadline:  March 30, 2026

#CJPost

#LI-onsite



The SMX salary determination process takes into account a number of factors, including but not limited to, geographic location, Federal Government contract labor categories, relevant prior work experience, specific skills, education and certifications. At SMX, one of our Core Values is to Invest in Our People so we offer a competitive mix of compensation, learning & development opportunities, and benefits. Some key components of our robust benefits include health insurance, paid leave, and retirement.

The proposed salary for this position is:
$123,900$206,400 USD

At SMX®, we are a team of technical and domain experts dedicated to enabling your mission. From priority national security initiatives for the DoD to highly assured and compliant solutions for healthcare, we understand that digital transformation is key to your future success.

We share your vision for the future and strive to accelerate your impact on the world. We bring both cutting edge technology and an expansive view of what’s possible to every engagement. Our delivery model and unique approaches harness our deep technical and domain knowledge, providing forward-looking insights and practical solutions to power secure mission acceleration.

SMX is an Equal Opportunity employer including disabilities and veterans.

Selected applicant may be subject to a background investigation and/or education verification.

SMX does not sponsor a new applicant for employment authorization or immigration related support for this position (i.e. H1B, F-1 OPT, F-1 STEM OPT, F-1 CPT, J-1, TN, E-2, E-3, L-1 and O-1, or any EADs or other forms of work authorization that require immigration support from an employer).

Top Skills

Ansible
AWS
Azure
Bash
Gitlab Ci
Jenkins
Kubernetes
Linux
Openshift
Python
Terraform
Vmware Vsphere

Similar Jobs

2 Days Ago
Remote
USA
388K-558K Annually
Senior level
388K-558K Annually
Senior level
News + Entertainment
The Site Reliability Engineer will design and maintain infrastructure, improve software reliability, manage incidents, and promote engineering best practices across Netflix.
Top Skills: AWSAzureGCPGoJavaKubernetesPythonTerraform
6 Days Ago
In-Office or Remote
3 Locations
230K-330K Annually
Senior level
230K-330K Annually
Senior level
Travel
The Senior Site Reliability Engineer will automate and optimize infrastructure on Google Cloud, improve cost efficiency, and support on-call incidents, working closely with the engineering teams.
Top Skills: BashContainersDatadogGCPHelmIstioKubernetesKustomizePythonSQL
10 Days Ago
Remote
USA
Senior level
Senior level
Database
The Senior Site Reliability Engineer at Niche will manage cloud infrastructure, oversee incident responses, mentor team members, and promote best practices to ensure reliability across distributed systems and applications.
Top Skills: AWSBashDockerGCPGitGoGrafanaKafkaKubernetesPrometheusPythonSQLSumo LogicTerraform

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account