Site Reliability Engineer
Who We Are
Overhaul is a supply chain integrity solutions company that allows shippers to connect disparate sources of data into the first fully transparent situational analysis engine designed for the logistics industry. Data that is transformed into critical insights can instantly trigger corrective actions, impacting everything from temperature control to handling requirements or package-level tracking, ensuring cargo arrives at its destination safely, undamaged, and on time. We are a dynamic, innovative, and fun team who is highly committed to our customers’ experiences and our Mission and Vision.
The Role
At Overhaul, we’re building the future of supply chain monitoring technology. As a Site Reliability Engineer, you’ll be tasked with creating and supporting a stable, scalable cloud platform capable of handling the large amounts of growth we’ve experienced and continue to expect.
You’ll work with Kubernetes clusters and corresponding technologies on AWS and Azure, expand our monitoring and response capabilities with Datadog, and help continue to automate our infrastructure and deploy processes with tools like Helm, Terraform, and ArgoCD. You’ll be working with a fantastic team of developers and cloud architecture experts from all around the globe.
Responsibilities:
- Enhancing our Kubernetes-based platform to allow for growth while keeping high uptime guarantees
- Increase monitoring capabilities of our platform using Datadog and other technologies
- Work to implement full continuous integration and continuous delivery across Overhaul’s suite of services
- Support development improvements by deploying new internal software tools
- Develop software tools and scripts to help automate platform processes
- Support Overhaul Services as part of an on-call rotation during waking hours
Required Skills and Qualifications:
- Deep understanding of distributed systems, with a focus on container-based solutions such as Kubernetes or Docker Swarm
- Modern public cloud experience – we use both AWS and Azure
- Experience with developing and implementing CI/CD patterns in a software team environment
- Excellent written and oral communication skills
Preferred Qualifications:
- Software Engineering experience beyond writing simple scripts, preferably in Python
- Hands-on experience designing, implementing, and maintaining infrastructure at scale using infrastructure-as-code
Our Core Values and how they benefit you as an “Overhauler”
Authenticity, Receptivity and Trust
· Extremely competitive base salary package
· 401(k) with Overhaul match
· Flexible working schedules
· Remote, hybrid, and/or In-office*
Encouragement and Learning
· Progressive advancement opportunity & career mobility
· Paid development personal stipend
· Monthly lunch and learns
· 2 Unique learning systems w/Instructor led content
Wellness and Integrity
· Rotating Overhaul “Perks @ work” (Discounts and Freebies)
· Overhaul fully provided healthcare plan
· Employee assistance & wellbeing programs
· New Parent/Family/Caregiver leave(s)
· Daily BAMM time (body and mind movement)
· Life by design vacation policy
Diversity and Inclusivity Statement:
Overhaul has always been, and always will be, committed to diversity and inclusion. Our Overhaul Culture Code’s top listed commitment is to “Diversity and Synergy.” All aspects of employment will be based on merit, competence, performance, and business needs. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local law. We strongly encourage people from underrepresented groups to apply!
#BI-Remote