Senior Cloud Performance Engineer

14 hours, 40 minutes ago
Full-time
Senior
DevOps and Infrastructure
ClickHouse

ClickHouse

ClickHouse provides a fast open source column-oriented database management system that enables users to generate real-time analytical data reports through SQL queries, catering to the needs of industries requiring efficient data processing and analysis.

IT Services
51-250
Founded 2021
$300M raised

Description

  • Benchmark system and database performance, including capacity sizing and optimization.
  • Troubleshoot and debug application and server errors, logs, and related issues.
  • Recommend configuration tuning and other optimizations for performance bottlenecks.
  • Work closely with the core development, cloud, and security teams to improve ClickHouse Cloud performance.
  • Plan, enable, and drive chaos engineering initiatives across engineering teams.
  • Develop, deploy, and manage tools to run chaos experiments and measure their impact.
  • Extend backend systems to support chaos engineering techniques.
  • Observe running systems and identify ways to disrupt them in a controlled manner.
  • Study and address problems in software resilience, operations, and delivery.
  • Partner with engineering teams to improve the performance of a high-scale distributed platform.

Requirements

  • 6+ years of relevant software development experience building and operating scalable, fault-tolerant distributed systems.
  • Software development experience in Go, C/C++, Java, or a similar language.
  • Experience with concurrency, multithreading, and distributed system architectures.
  • Experience developing cloud infrastructure services, preferably with Kubernetes.
  • Experience leading and shipping large-scope technical projects with multiple experienced engineers.
  • Expertise with a public cloud provider such as AWS, GCP, or Azure, including infrastructure services like EC2.
  • Excellent communication skills and the ability to work well within and across engineering teams.
  • Strong problem-solving and production debugging skills.
  • Passion for efficiency, availability, scalability, and data governance.
  • High responsibility, ownership, and accountability.
  • Experience understanding performance limits of distributed databases and building tools for performance and scalability measurement (preferred).
  • Background in database benchmarking, test automation, system engineering, performance analysis, and capacity management (preferred).

Benefits

  • Remote-friendly, globally distributed work environment.
  • Employer contributions toward healthcare.
  • Equity in the company through stock options for new team members.
  • Flexible time off in the US and generous time off in other countries.
  • $500 home office setup stipend for remote employees.
  • Opportunities to attend company-wide global gatherings and offsites.
  • Salary range with potential premium market adjustments in certain locations.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Database Reliability Engineer

Sporty Group 51-250 Media

Sporty is seeking a Database Reliability Engineer to own and improve its database infrastructure supporting multiple platforms and international expansion.

Ansible Argo CD Elasticsearch GitHub Actions Go Grafana Helm Jenkins Kubernetes MongoDB MySQL PostgreSQL Prometheus Python RabbitMQ Terraform
10 hours, 10 minutes ago

Senior Cloud Engineer

Moniepoint 1K-5K Diversified Financial Services

Moniepoint is seeking an experienced Cloud Engineer to design and manage its multi-cloud, Kubernetes-based infrastructure and reliability systems supporting high-scale financial services across emerging markets.

Ansible Argo CD AWS Azure Bash CloudFormation Containerd CRI-O DNS Docker EC2 GCP Git GitHub GitLab GitOps Go Grafana HAProxy HashiCorp Vault Helm Jenkins JSON Kafka Kubernetes Load Balancing Microservices MySQL Nginx Prometheus Python Reverse Proxy Secrets Management Sentinel TCP/IP Terraform YAML
10 hours, 55 minutes ago

Senior Site Reliability Engineer

Moniepoint 1K-5K Diversified Financial Services

Moniepoint is hiring an experienced Site Reliability Engineer to improve the reliability, scalability, and observability of its highly distributed financial platform serving emerging markets.

AWS Azure Datadog GCP Go Java Kafka Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python RabbitMQ Rust
10 hours, 55 minutes ago

Senior Site Reliability Engineer, Identity Platform

Coinbase 1K-5K Capital Markets

Coinbase is hiring an experienced Site Reliability Engineer to build and scale identity and access management tooling for its IT Operations Corporate Engineering team supporting cloud-based, security-first systems.

Ansible AWS Azure C# CI/CD Docker GCP Go Java Kubernetes Python Ruby Secrets Management Terraform
11 hours, 25 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers