ClickHouse

ClickHouse

ClickHouse provides a fast open source column-oriented database management system that enables users to generate real-time analytical data reports through SQL queries, catering to the needs of industries requiring efficient data processing and analysis.

IT Services
51-250
Founded 2021
$300M raised

Description

  • Own the full lifecycle of JVM-based data framework integrations, from core database drivers to SDKs and connectors.
  • Design, implement, and maintain high-performance connectors, sinks, and sources for frameworks such as Apache Spark, Apache Flink, Apache Beam, and Kafka Connect.
  • Develop and optimize core driver code to handle massive throughput and low-latency workloads.
  • Profile and tune JVM performance, including memory management and garbage-collection optimizations, to meet throughput and latency targets.
  • Collaborate with the open-source community, internal teams, and enterprise users to gather requirements, iterate on designs, and ship production-ready integrations.
  • Ensure reliability, scalability, and excellent developer experience for ClickHouse integrations across JVM-based applications.
  • Troubleshoot and resolve production issues related to connectors, JDBC, network protocols (TCP/IP, HTTP), and data throughput.
  • Contribute to documentation, developer tooling, and best practices for data integration and connector development.

Requirements

  • 6+ years of software development experience building and delivering high-quality, data-intensive solutions.
  • Proven experience with the internals of at least one of: Apache Spark, Apache Flink, Kafka Connect, or Apache Beam.
  • Experience developing or extending connectors, sinks, or sources for big data processing frameworks (e.g., Spark, Flink, Beam, Kafka Connect).
  • Strong proficiency in Java and the JVM ecosystem, including memory management, garbage collection tuning, and performance profiling.
  • Solid experience with concurrent programming in Java, including threads, executors, and reactive or asynchronous patterns.
  • Strong understanding of database fundamentals: SQL, data modeling, query optimization, and familiarity with OLAP/analytical databases.
  • Track record of building scalable data integration systems (beyond simple ETL jobs).
  • Understanding of JDBC, network protocols (TCP/IP, HTTP), and techniques for optimizing data throughput over the wire.
  • Outstanding written and verbal communication skills and a passion for open-source development.
  • Nice to have: prior contributions to open-source projects, familiarity with ClickHouse or similar high-performance data platforms, and working knowledge of Python for data engineering (Pandas, PySpark, Airflow).

Benefits

  • Flexible, remote-friendly work environment with operations across ~20 countries.
  • Employer contributions toward healthcare coverage.
  • Equity: stock options provided to new employees.
  • Flexible time off in the US and generous time-off entitlement in other countries.
  • $500 home office setup allowance for remote employees.
  • Opportunities for in-person Global Gatherings and company-wide offsites.
  • Competitive compensation with a typical starting salary range referenced for US roles and potential regional premiums (San Francisco Bay Area and NYC); contact paytransparency@clickhouse.com for compensation questions.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Engineering Tech Lead

Lingaro 5K-10K IT Services

Data Engineering Tech Lead at Lingaro (Data Engineering & Management) — lead a Poland-based remote/full-time team to design, deliver, and maintain scalable, secure data engineering solutions while mentoring engineers and ensuring timely, high-quality project delivery.

Azure CI/CD Python Scala SQL
14 hours, 40 minutes ago

Junior Data Engineer (Remote Argentina) / Ingénieur données junior (à distance)

GlobalVision 51-250 Internet Software & Services

Junior Data Engineer at GlobalVision supporting and maintaining the company’s data infrastructure to ensure reliable, accessible, and actionable data that informs business decision-making across the organization.

dbt Domo Machine Learning Power BI Python Salesforce SQL Tableau
1 month ago

Data/Infrastructure Advocate Engineer - EMEA Remote

Hugging Face 51-250 IT Services

Hugging Face is hiring a Data/Infrastructure Advocate Engineer to bridge data infrastructure and the community by championing Xet storage on the Hub and enabling efficient storage, versioning, and collaboration on large-scale datasets.

AWS GitHub Pandas Python
1 month ago

Associate Software Engineer - Data Engineer

GroundTruth 251-1K Media

GroundTruth is hiring a Data Engineering Associate Software Engineer on the Attribution Team to build and maintain scalable data pipelines and infrastructure that enable accurate, real-world ad attribution and analytics.

Apache Airflow Apache Spark AWS Docker Git Hadoop Java Looker MapReduce Python REST API Shell Scripting SQL
1 month ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers