Site Reliability Engineer
Who We Are
Bestow is a digital life insurance company built on full stack technology and AI. In a world in need of greater financial resilience and protection, Bestow democratizes access to smart financial products and powers some of the world’s leading consumer platforms. We are reimagining and rebuilding a 400-year-old, $7 trillion industry to create a brighter future for millions of families. And we’re just getting started.
The Bestow team is a diverse band of first principles thinkers on a mission to do good. We’re fortunate to be backed by leading investors and partners including Valar Ventures, NEA, 8VC and MunichRe.
Open to Remote location
As a Site Reliability Engineer you will automate cloud resources, implement developer tooling, and make changes to our products to increase their manageability.
You should be able to define technical design and implement work independently.
You should have prior experience with systems administration and cloud service automation.
Bestow engineers are great teammates. You will be spending half of your time embedded with a product team. We do this so that you can get to know the specific needs of our product teams and ensure we build useful tooling. The remainder of your time will be spent with other Site Reliability Engineers working on cross-cutting projects. As such, you have exceptional written and verbal communication skills.
Do you want to build products to reinvent a centuries old industry? If so, we'd love to hear from you.
Challenges on which you can expect to work:
Cloud Infrastructure Projects
We believe that all cloud infrastructure should be treated as code. Modifications go through code review and are executed via CI tools.
Automate cloud infrastructure via tools like Terraform and Atlantis;
Administer our software platform including Kubernetes, BigQuery and PostgreSQL;
Evaluate, design and implement solutions for continuous integration, continuous delivery and DevOps productivity.
Ensure Reliability of Our Software
Great software is more than product features. It simultaneously considers non-functional concerns like security, maintainability and extensibility. You are responsible for ensuring:
Collaborate with product engineers to identify observability/configurability gaps and make code-level improvements;
Creating tooling to improve the developer experience, enabling them to deliver business value faster and more effectively;
Manage runbooks for incident response;
Measure operational performance and make gradual improvements.
A Little About You:
- 4+ years of site reliability, systems administration or DevOps experience
- Professional experience with container orchestration systems (Kubernetes, Swarm, Rancher, ECS, etc.)
- Professional experience with a cloud hosting platform (GCP preferred)
- Professional experience with Terraform Atlantis experience a plus
- Professional experience administering a CI/CD platform GitOps experience a plus
- Experience with Python 3 and Golang
- Proficient in a scripting language such as Bash, Python or Ruby
- Experience with distributed systems and microservices
- Clear, concise written and verbal communication
- A desire and willingness to learnInitiative and motivation to make things happen
What We Can Offer You:
- Competitive salary
- Generous PTO
- Flexible schedule and work/life balance
- 100% company-paid health, dental, and vision insurance
- Choose your own computer setup (Mac or PC)
- Office snacks and weekly team lunches
- Team building events and activities
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Bestow does not currently sponsor applicants for work visas.