ClickHouse

ClickHouse

ClickHouse provides a fast open source column-oriented database management system that enables users to generate real-time analytical data reports through SQL queries, catering to the needs of industries requiring efficient data processing and analysis.

IT Services
51-250
Founded 2021
$300M raised

Description

  • Own and evolve ClickHouse's Python connector and SDK ecosystem.
  • Build and maintain enterprise-grade integrations with orchestration platforms such as Airflow, Dagster, and Prefect.
  • Develop and maintain integrations with transformation tools such as dbt.
  • Drive AI and LLM integration strategy for RAG architectures, ML feature pipelines, and LLM-powered data applications.
  • Engage with the open-source community by triaging issues, supporting contributors, and incorporating user feedback into the roadmap.
  • Collaborate with Product, Cloud, and other engineering teams to align integration work with platform priorities.
  • Bring a data practitioner’s perspective to product and roadmap decisions.
  • Set architecture, performance, reliability, and API design standards for key integrations.

Requirements

  • 7+ years of software development experience.
  • Hands-on experience as a Data Engineer, Data Scientist, or ML Engineer is preferred.
  • Proven experience designing, building, and maintaining production-grade Python connectors, SDKs, or integrations.
  • Experience with at least one major platform in orchestration, BI, MLOps, or data transformation.
  • Strong experience with the Python data ecosystem, including Pandas, NumPy, and Pydantic.
  • Prior contributions to or deep practical experience with Airflow, Dagster, or Prefect.
  • Hands-on production experience with AI/ML in data engineering contexts, including embedding generation, vector search, feature pipelines, or LLM-powered tooling.
  • Strong understanding of SQL, data modeling, query optimization, and OLAP or analytical databases.
  • Experience with concurrent Python, including threading, multiprocessing, and async patterns.
  • Excellent written and verbal communication skills.
  • Experience deploying AI/ML models in production, including inference APIs and vector databases, is a plus.
  • Prior experience as a Data Engineer or Data Scientist in a product-facing or platform role is a plus.
  • Familiarity with ClickHouse or similar high-performance OLAP platforms is a plus.
  • Familiarity with the JVM ecosystem is a plus.

Benefits

  • Flexible remote-friendly work environment with a globally distributed team.
  • Employer contributions toward healthcare.
  • Equity in the company through stock options for new team members.
  • Flexible time off in the US and generous leave entitlement in other countries.
  • $500 home office setup stipend for remote employees.
  • Opportunities to attend company-wide global gatherings and offsites.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Sr. Data Engineer I (6436)

MetroStar 251-1K IT Services

MetroStar is hiring a Sr. Data Engineer I to support an enterprise AI-enabled financial compliance initiative for the Department of War, building the data foundation for compliance modernization across 180+ systems.

PostgreSQL Python SQLAlchemy XML YAML
8 hours, 34 minutes ago

Backend Developer (Node.js)

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Backend Developer to build and scale the core services behind its global nonprofit fundraising platform.

Bull ClickHouse Datadog Elasticsearch Grafana Kafka Koa MongoDB NestJS Node.js Prometheus RabbitMQ React Redis REST API TypeScript Vue.js
8 hours, 49 minutes ago

Staff Software Engineer - Product Analytics

Datadog 5K-10K IT Services

Datadog is hiring a Staff Engineer to lead the backend technical direction for its Product Analytics platform, building systems that help customers analyze user behavior, retention, and growth at scale.

SQL
8 hours, 49 minutes ago

Senior Staff Data Engineer

SoFi 1K-5K Capital Markets

SoFi is seeking a Senior Staff Data Engineer to lead the architecture and evolution of its AI-powered Data Platform, advancing data reliability, governance, and scalable data experiences for members.

Apache Airflow Apache Spark AWS GCP GitLab Hadoop Kafka Python Snowflake SQL
8 hours, 49 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers