Cloudlinux Logo

Cloudlinux

Senior Database Reliability Engineer (DBRE) (worldwide remote)

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Tbilisi
Senior level
In-Office or Remote
Hiring Remotely in Tbilisi
Senior level
As a Senior Database Reliability Engineer, you'll ensure the reliability of PostgreSQL and other databases, automate workflows, and support engineering teams in production environments.
The summary above was generated by AI

CloudLinux / TuxCare is a remote-first infrastructure and security company. More than 300 engineers build and operate products used by hosting providers, enterprises, and internal service teams worldwide. Our Infrastructure Department runs the platforms behind CloudLinux OS, Imunify, KernelCare, TuxCare ELS, and our engineering systems.

We are hiring a Senior Database Reliability Engineer to join the Infrastructure DBA cell. This is a hands-on production ownership role, not a narrow ticket-processing DBA position. You will keep critical database services reliable, automate repeated work, support engineering teams, and reduce single-person dependency in our PostgreSQL, ClickHouse, MongoDB, and Redis operations.

PostgreSQL is the main requirement. ClickHouse experience is a strong plus, but it is not a day-one blocker. We need a senior engineer with enough database, Linux, automation, and incident-response depth to learn our ClickHouse environment quickly and operate it safely.

Your Responsibilities:

  • Own production PostgreSQL reliability: HA design, Patroni, PgBouncer, replication, failover, upgrades, vacuum/bloat control, query tuning, locks, indexes, capacity, backups, PITR, and restore validation.
  • Improve disaster recovery and operational evidence: tested restores, documented recovery paths, measurable RTO/RPO targets, runbooks, and safe maintenance plans.
  • Support the wider database estate: ClickHouse, MongoDB, and Redis. You will troubleshoot incidents, review access and data-safety changes, improve monitoring, and learn the production ClickHouse patterns already in use.
  • Automate DBA workflows with Ansible, Terraform/OpenTofu, GitLab CI/CD, scripts, and reproducible runbooks for provisioning, grants, backups, restores, health checks, and ownership metadata.
  • Help build DBaaS-style self-service capabilities so engineering teams can request databases, access, credentials, and operational checks with less manual DBA intervention.
  • Improve observability and incident response through Grafana, metrics, logs, SLOs, alert rules, Opsgenie routing, and clear communication during production issues.

What Success Looks Like:

  • PostgreSQL clusters have tested backup and restore paths, useful dashboards, clear ownership, and documented failover procedures.
  • Repeated DBA tickets become automation or self-service workflows.
  • ClickHouse operational knowledge is no longer a single-person dependency.
  • Database incidents have owners, runbooks, evidence, and measurable recovery paths.
  • Product and engineering teams get database help faster without sacrificing safety, auditability, or reliability.

Why CloudLinux?

  • You will work on real production infrastructure used across CloudLinux and TuxCare products.
  • You will have a direct impact on reliability, incident response, developer experience, and operational resilience.
  • You will also work in an AI-assisted engineering culture where automation, documentation, Claude, Codex, and careful human verification are part of the daily operating model.

Requirements

What We Expect From You:

  • Deep hands-on PostgreSQL experience in business-critical production environments, typically 5+ years or equivalent depth.
  • Strong understanding of PostgreSQL internals and operations: MVCC, WAL, transactions, locks, indexes, query planning, replication, autovacuum, bloat, major upgrades, backups, PITR, and restore testing.
  • Proven experience with highly available databases and the ability to reason about quorum, split-brain risk, failover, rollback, and recovery.
  • Strong Linux and infrastructure fundamentals: systemd, networking, storage, filesystems, CPU/memory/disk bottlenecks, TLS, DNS, firewalls, and root-cause troubleshooting.
  • Automation skills with Ansible and scripting. Terraform/OpenTofu, GitLab CI/CD, and merge-request based delivery are strong advantages.
  • Ability to support more than one database engine. You do not need to be a ClickHouse expert on day one, but you must be ready to learn it quickly and take responsibility for it.
  • Practical use of AI engineering assistants such as Claude and Codex. We expect you to use them to improve speed and quality, while personally verifying generated SQL, commands, scripts, and operational conclusions.
  • Clear written English for asynchronous work in Jira, Slack, GitLab, Slite, and runbooks.

Nice to Have:

  • ClickHouse operations: replication, Keeper/ZooKeeper, MergeTree engines, distributed DDL, grants, row policies, backups, query troubleshooting, and cluster recovery.
  • MongoDB replica sets and Percona Backup for MongoDB.
  • Redis/Sentinel and broker/cache failure modes.
  • Database observability, SLOs, golden signals, alert tuning, and executable incident runbooks.
  • Building internal platforms, self-service portals, or DBaaS workflows for engineering teams.

Benefits

What's in it for you?

  • A focus on professional development.
  • Interesting and challenging projects.
  • Fully remote work with flexible working hours, which allows you to schedule your day and work from any location worldwide.
  • Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves.
  • Compensation for private medical insurance.
  • Co-working and gym/sports reimbursement.
  • Budget for education.
  • The opportunity to receive a reward for the most innovative idea that the company can patent.

By applying for this position, you agree with CloudLinux Privacy Policy and give us your consent to maintain and process your personal data with this respect. Please read our Privacy Policy for more information.

Similar Jobs

24 Days Ago
In-Office or Remote
Senior level
Senior level
Software
Lead the evolution of CloudLinux's data platform, focusing on automated DBaaS architecture, data analytics support, and implementing SRE practices.
Top Skills: AnsibleApache AirflowAWSAzureClickhouseGerritGitlabGoGCPGrafanaJenkinsKafkaKubernetesLokiMongoDBOpennebulaPostgresPythonRedashRedisTerraformVictoriametrics
8 Hours Ago
In-Office or Remote
Junior
Junior
Fintech • Software • Financial Services
The Treasurer will manage company funds, oversee payments, maintain accounts, and communicate with financial institutions, while ensuring compliance and documentation accuracy.
Top Skills: Banking OperationsFinancial InstitutionsPayment Processing
8 Hours Ago
Remote or Hybrid
Mid level
Mid level
Other
The role involves developing and executing test cases for a mobile calling app, collaborating with teams, and performing post-release testing.
Top Skills: Android StudioAppiumCharles ProxyCi/CdGithub ActionsJavaJenkinsJmeterK6PostmanRest ApisRestassuredSeleniumSoap ApisSwaggerXcode

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account