Senior Site Reliability Engineer
Senior Site Reliability Engineer
Why YOU want this position
Drillinginfo is now Enverus! Since our founding as a groundbreaking provider of oil & gas data, we have evolved our solutions to cover oil & gas analytics, trading & risk, and business automation for customers across the energy industry. Enverus represents this growth, while bringing us closer together as one team. Enverus delivers business-critical insights to the global energy industry through a state-of-the-art SaaS platform built on industry-leading data and energy analytics. Our solutions deliver value across the entire energy value chain, empowering customers to be more agile, efficient and competitive. The range of energy industry participants we serve includes exploration and production (E&P) companies and related businesses such as oilfield services, midstream, capital markets, power generators and utilities, energy traders, and downstream commercial & industrial energy consumers.
We are currently seeking a highly driven Senior Site Reliability to join our Cloud Engineering team in Austin, TX. This role offers the opportunity to join a rapidly growing company delivering industry-leading solutions to customers in the world’s most dynamic and fastest growing sector. Enverus is the right company at the right time.
Performance Objectives
- Work on a team that manages our entire global AWS presence
- Your team will be responsible for keeping our infrastructure humming as new releases and maintenance updates are rolled out
- You will help organize, secure, and automate existing infrastructure and deployments
- You will work closely with developers to provide feedback and drive operational improvements within our products and operations infrastructure
- You will be responsible for ensuring that our platform is stable and balanced
- Maintain high site up time, while embracing rapid change and growth
- Scale infrastructure to meet increasing demand and evolving technology
- Help the dev teams working on our code bases realize zero down-time deployments
- Develop and improve operational practices and procedures
- Implement, monitor, and maintain CI/CD frameworks
- You will coordinate and participate in on-call rotations
- Automate, automate, automate
Competitive Candidate Profile
- You have excellent communication and collaboration skills
- You demonstrate the ability to succeed in a high-pressure environment with rapidly changing priorities
- You are an excellent problem solver, and willing to roll up your sleeves to take on any issue thrown your way
- You have a desire not just to resolve problems, but to fully understand them and prevent them in the future
- 5+ years of professional Windows and Linux server administration
- 3+ years of Amazon Web Services (AWS) administration
- 2+ years of experience within a high-performance, 24x7, DevOps, SysOps, or Operations team
- You seek out opportunities to improve, fix bugs, and challenge assumptions
- You have experience working with global teams (North America, Europe, Asia)
- You have experience with the following technologies:
- Docker
- Container Orchestration (Nomad, Kubernetes, ECS)
- Configuration Management tools (Chef, Puppet, Ansible)
- Infrastructure as Code (Terraform, Cloudformation)
- C#, Golang, or Python programming experience is a plus
- You prefer to lead the charge, not just keep up with it