ClickHouse

ClickHouse

ClickHouse provides a fast open source column-oriented database management system that enables users to generate real-time analytical data reports through SQL queries, catering to the needs of industries requiring efficient data processing and analysis.

IT Services
51-250
Founded 2021
$300M raised

Description

  • Own the full lifecycle of JVM-based data framework integrations, from core drivers to SDKs and connectors.
  • Design, build, and maintain high-performance connectors, sinks, and sources for big data processing frameworks (e.g., Spark, Flink, Beam, Kafka Connect).
  • Develop and optimize the core database driver that handles very high throughput and large-scale data ingestion.
  • Profile and tune JVM performance, including memory management and garbage collection, to maximize throughput and reliability.
  • Implement concurrent and asynchronous solutions using threads, executors, and reactive patterns to support data-intensive workloads.
  • Collaborate with the open-source community, internal product and engineering teams, and enterprise users to iterate on integrations and developer experience.
  • Ensure integrations meet standards for performance, reliability, and developer ergonomics across real-time analytics and observability use cases.
  • Contribute to documentation, SDKs, and developer tooling to make ClickHouse feel native in JVM-based applications.

Requirements

  • 6+ years of software development experience building and delivering data-intensive solutions.
  • Proven experience with the internals of at least one of: Apache Spark, Apache Flink, Kafka Connect, or Apache Beam.
  • Experience developing or extending connectors, sinks, or sources for big data processing frameworks (Spark, Flink, Beam, Kafka Connect, etc.).
  • Strong understanding of database fundamentals: SQL, data modeling, query optimization, and familiarity with OLAP/analytical databases.
  • Track record of building scalable data integration systems (beyond simple ETL jobs).
  • Strong proficiency in Java and the JVM ecosystem, including memory management, garbage collection tuning, and performance profiling.
  • Solid experience with concurrent programming in Java, including threads, executors, and reactive/asynchronous patterns.
  • Understanding of JDBC, network protocols (TCP/IP, HTTP), and techniques for optimizing data throughput over the wire.
  • Outstanding written and verbal communication skills and a passion for open-source development.
  • Bonus: prior open-source contributions, familiarity with ClickHouse or similar high-performance data platforms, and working knowledge of Python tooling used in data engineering (Pandas, PySpark, Airflow).

Benefits

  • Flexible, remote-friendly work environment (ClickHouse is globally distributed and operates in 20 countries).
  • Employer contributions toward healthcare.
  • Equity in the company through stock options for new hires.
  • Flexible time off in the US and generous time-off entitlement in other countries.
  • $500 home office setup allowance for remote employees.
  • Opportunities for company-wide in-person Global Gatherings (offsites).
  • Salary ranges are provided for US roles with possible market premiums for SF Bay Area and NYC, and individual placement within the range depends on experience and location.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Engineering Tech Lead

Lingaro 5K-10K IT Services

Data Engineering Tech Lead at Lingaro (Data Engineering & Management) — lead a Poland-based remote/full-time team to design, deliver, and maintain scalable, secure data engineering solutions while mentoring engineers and ensuring timely, high-quality project delivery.

Azure CI/CD Python Scala SQL
14 hours, 45 minutes ago

Junior Data Engineer (Remote Argentina) / Ingénieur données junior (à distance)

GlobalVision 51-250 Internet Software & Services

Junior Data Engineer at GlobalVision supporting and maintaining the company’s data infrastructure to ensure reliable, accessible, and actionable data that informs business decision-making across the organization.

dbt Domo Machine Learning Power BI Python Salesforce SQL Tableau
1 month ago

Data/Infrastructure Advocate Engineer - EMEA Remote

Hugging Face 51-250 IT Services

Hugging Face is hiring a Data/Infrastructure Advocate Engineer to bridge data infrastructure and the community by championing Xet storage on the Hub and enabling efficient storage, versioning, and collaboration on large-scale datasets.

AWS GitHub Pandas Python
1 month ago

Associate Software Engineer - Data Engineer

GroundTruth 251-1K Media

GroundTruth is hiring a Data Engineering Associate Software Engineer on the Attribution Team to build and maintain scalable data pipelines and infrastructure that enable accurate, real-world ad attribution and analytics.

Apache Airflow Apache Spark AWS Docker Git Hadoop Java Looker MapReduce Python REST API Shell Scripting SQL
1 month ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers