Principal Database Reliability Engineer
2 weeks ago
Form Cognite’s DBRE team, owning the full cluster lifecycle of all of our PostgreSQL, Elasticsearch or Kafka clusters. (We plan one sub-team per technology).
… on both public clouds and on private Kubernetes deployments.
Establish robust reliability engineering to support these clusters, managing aspects like monitoring, chaos testing, alerting, on-call rotations, internal best-practices education, and capacity forecasting.
Enable product teams to focus on using the databases, and not on running them – but deeply engage them to make sure the products are operable at scale.