Senior Site Reliability Engineer
Who We Are
Bestow is a digital life insurance company built on full stack technology and AI. In a world in need of greater financial resilience and protection, Bestow democratizes access to smart financial products and powers some of the world’s leading consumer platforms. We are reimagining and rebuilding a 400-year-old, $7 trillion industry to create a brighter future for millions of families. And we’re just getting started.
The Bestow team is a diverse band of first principles thinkers on a mission to do good. We’re fortunate to be backed by leading investors and partners including Valar Ventures, NEA, 8VC and MunichRe.
As a Senior Site Reliability Engineer you will automate cloud resources, implement developer tooling, and make changes to our products to increase their manageability.
You should be able to define technical design and implement work independently.
You should have prior experience with systems administration and cloud service automation.
Bestow engineers are great teammates. You will be spending half of your time embedded with a product team. We do this so that you can get to know the specific needs of our product teams and ensure we build useful tooling. The remainder of your time will be spent with other Site Reliability Engineers working on cross-cutting projects. As such, you have exceptional written and verbal communication skills.
Do you want to build products to reinvent a centuries old industry? If so, we'd love to hear from you.
Challenges on which you can expect to work:
Cloud Infrastructure Projects
We believe that all cloud infrastructure should be treated as code. Modifications go through code review and are executed via CI tools.
Automate cloud infrastructure via tools like Terraform and Atlantis;
Administer our software platform including Kubernetes, BigQuery and PostgreSQL;
Evaluate, design and implement solutions for continuous integration, continuous delivery and DevOps productivity.
Ensure Reliability of Our Software
Great software is more than product features. It simultaneously considers non-functional concerns like security, maintainability and extensibility. You are responsible for ensuring:
Collaborate with product engineers to identify observability/configurability gaps and make code-level improvements;
Creating tooling to improve the developer experience, enabling them to deliver business value faster and more effectively;
Manage runbooks for incident response;
Measure operational performance and make gradual improvements.
A Little About You
- 4+ years of site reliability, systems administration or DevOps experience.
- Professional experience with container orchestration systems (Kubernetes, Swarm, Rancher, ECS, etc.).
- Professional experience with a cloud hosting platform (GCP preferred).
- Professional experience with Terraform Atlantis experience a plus.
- Professional experience administering a CI/CD platform GitOps experience a plus.
- Experience with Python 3 and Golang.
- Proficient in a scripting language such as Bash, Python or Ruby.
- Experience with distributed systems and microservices.
- Clear, concise written and verbal communication.
- You thrive in a highly independent work culture and are capable of working autonomously.
- You have initiative and motivation to make things happen.
- You always want to sit next to the person who is smarter than you because you value a culture of mentorship and learning.
- You are egoless, hold yourself accountable and you have a thoughtful approach to adopting new technology.
- You are looking to bring your voice and talent to a mission-driven company in ways that help it to grow and expand its reach.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Bestow does not currently sponsor applicants for work visas.
Read Full Job Description