Senior Data Engineer - Dbt, CI/CD

2 hours, 18 minutes ago
Full-time
Senior
DevOps and Infrastructure
ClickHouse

ClickHouse

ClickHouse provides a fast open source column-oriented database management system that enables users to generate real-time analytical data reports through SQL queries, catering to the needs of industries requiring efficient data processing and analysis.

IT Services
51-250
Founded 2021
$300M raised

Description

  • Design and develop reusable Airflow components, including operators, connectors, hooks, and custom integrations.
  • Build reusable dbt macros and abstractions for incremental strategies, ETL building blocks, and generic data quality tests.
  • Develop DataOps tooling such as CI pipelines, data migration frameworks, and security controls.
  • Conduct thorough code reviews for team members and other contributors.
  • Participate in technical and architectural design discussions.
  • Drive technical leadership in specific areas of the stack.
  • Improve the team's development environment through automation, including AI-assisted workflows.
  • Troubleshoot platform issues and resolve incidents to maintain stability.
  • Develop, refactor, and optimize business data models in line with modeling standards and ClickHouse best practices.
  • Lead the design and implementation of complex data marts involving high data velocity, large volumes, or complex business logic.

Requirements

  • Exceptional SQL skills are required.
  • Strong hands-on experience with Airflow, dbt, and Python.
  • Proven track record building and optimizing large-scale, high-throughput data pipelines.
  • Solid understanding of data warehousing fundamentals, including ETL/ELT, dimensional modeling, and data quality.
  • Bachelor's degree in Computer Science or a related field.
  • Comfortable working in a fast-paced startup environment.
  • Hands-on experience with ClickHouse is a significant advantage.
  • Familiarity with AWS or other cloud platforms is preferred.
  • Experience with CI/CD development, especially using GitHub Actions, is preferred.

Benefits

  • Typical starting salary of $84,000 to $143,000 USD in the US.
  • Typical starting salary of $115,000 to $170,000 USD in US Premium Markets.
  • Flexible, remote-friendly work environment across more than 20 countries.
  • Employer contributions toward healthcare.
  • Stock options for every new team member.
  • Flexible time off in the US, with generous entitlement in other countries.
  • $500 home office setup stipend for remote employees.
  • Opportunities to connect with colleagues at company-wide global gatherings.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

[Job - 29221] Senior Data Developer (Azure), Brazil

CI&T 5K-10K Internet Software & Services

CI&T is seeking a Senior Data Developer (Azure) to build and evolve its cloud data platform in Brazil, turning architectural standards into reliable, scalable data pipelines and analytics-ready datasets.

Apache Airflow Apache Spark Azure CI/CD Databricks dbt Feature Engineering Git Prefect Python Snowflake SQL
16 minutes ago

Software Engineer I - Data

Precision AQ 1001-5000 Business Consulting and Services

Precision Medicine Group is hiring a data-focused application support professional to configure, maintain, and deliver client-facing healthcare applications and data flows across multiple environments.

Agile Azure C# CI/CD HTML .NET Salesforce SQL SQL Server
51 minutes ago

Data Engineer, Azure - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote Data Engineer, Azure to develop and optimize data engineering pipelines and warehousing solutions for client projects across Latin America.

Agile Apache Spark Azure Git Machine Learning Power BI Python REST API SQL Tableau
2 hours, 3 minutes ago

Senior Software Engineer - Data Infrastructure

Marqeta 251-1K Diversified Financial Services

Marqeta is hiring a Senior Software Engineer on its Data Infrastructure team to own and deliver platform-focused work that supports analytics and AI across the company’s data lakehouse, streaming, orchestration, and catalog systems.

Apache Airflow Apache Spark AWS AWS CDK CloudFormation Docker Java Kafka Kubernetes Python SQL Terraform
2 hours, 10 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers