CloudLinux

CloudLinux

CloudLinux is a leading provider of the CloudLinux OS, a platform for Linux web hosting that offers next-level performance and security. With a focus on optimizing web hosting environments, CloudLinux helps service providers improve density, stability,...

IT Services
51-250
Founded 2009

Description

  • Design and implement a self-service DBaaS platform using Terraform and Ansible for deploying highly available PostgreSQL, ClickHouse, MongoDB, and Redis clusters.
  • Build and operate database infrastructure across bare metal, OpenNebula, Kubernetes, and public cloud environments.
  • Manage and scale large ClickHouse analytics clusters, including sharding, replication, table engine optimization, and S3 backup pipelines.
  • Maintain and scale Apache Airflow and Redash infrastructure to support reliable ETL pipelines and analytics workflows.
  • Implement SRE practices for data management, including automated self-healing and defined SLO/SLI for databases.
  • Lead migration from legacy database solutions to modern cloud-native patterns and help evaluate Kubernetes operators for stateful workloads.
  • Serve as a technical authority for product teams on data schema design and SQL query optimization for high-load systems.
  • Collaborate with infrastructure and analytics teams to improve reliability, observability, and performance across the data platform.
  • Automate infrastructure and operational tasks with code to reduce manual intervention and repeat work.

Requirements

  • 5+ years of deep PostgreSQL experience, including MVCC internals, locking mechanics, Patroni, PgBouncer, and major version upgrades under load.
  • Proven experience operating large ClickHouse clusters, including ZooKeeper or ClickHouse Keeper, sharding, replication internals, and performance troubleshooting.
  • Strong Terraform and Ansible experience, including writing complex modules and roles.
  • Programming experience in Python or Go for infrastructure and automation is a major plus.
  • Experience working in hybrid environments across bare metal, Kubernetes, and cloud platforms.
  • Understanding of database performance tuning, including NVMe and network storage optimization.
  • Systems-level thinking across networking, infrastructure, and application logic.
  • Knowledge of security and disaster recovery practices, including FIPS and audit logs.
  • Preferred experience building an Internal Developer Platform (IDP).
  • Preferred experience operating databases in Kubernetes using CloudNativePG or Altinity Operator.
  • Preferred experience working for cloud or hosting providers in similar service environments.

Benefits

  • Fully remote work with flexible working hours and the ability to work from anywhere worldwide.
  • Paid 24 days of vacation per year.
  • 10 days of national holidays.
  • Unlimited sick leave.
  • Private medical insurance coverage.
  • Co-working and gym/sports reimbursement.
  • Budget for education, training, and conferences.
  • Opportunity to be rewarded for innovative ideas that the company can patent.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
50 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
4 hours, 50 minutes ago

Clinical Data Associate II

Precision Medicine Group 251-1K Pharmaceuticals

The Clinical Data Associate II at Precision Medicine Group supports clinical trial data management activities from study start-up through database lock for assigned projects.

5 hours, 50 minutes ago

Database Administrator - Cloud Platform / Infrastructure

3Cloud 251-1K Internet Software & Services

3Cloud is seeking an experienced Database Administrator to support multiple customer database migration and Azure data services projects across development, test, and production environments.

Azure Oracle SQL Server Terraform
8 hours, 25 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers