Zeta Global

Zeta Global

Zeta Global provides an AI-powered marketing cloud that enables enterprises to acquire, grow, and retain customers through precision marketing, leveraging data science, advanced analytics, and machine learning to create optimized customer experiences.

Media
1K-5K
Founded 2007

Description

  • Design, build, and operate batch and streaming data pipelines for large-scale event data.
  • Ingest, transform, and enrich data such as impressions, clicks, conversions, costs, and identity signals.
  • Create and maintain curated aggregates, marts, and dimensional models for multiple internal consumers.
  • Define schemas, data contracts, and versioning strategies to support stable downstream systems.
  • Implement validation, anomaly detection, backfills, and reconciliation to improve data quality and reliability.
  • Optimize compute and storage for scale, balancing latency, throughput, and cost.
  • Build repeatable orchestration workflows and CI/CD processes for data pipelines.
  • Instrument data systems with metrics, logs, lineage, and alerting for observability and incident response.
  • Apply security and governance controls, including least-privilege access and PII-aware handling.
  • Partner closely with backend, ML, and product teams to deliver trusted data products.

Requirements

  • 5+ years of experience building production data pipelines and data products in a high-scale environment.
  • Strong experience with SQL and data modeling, including dimensional modeling, star/snowflake schemas, and event modeling.
  • Hands-on experience with streaming systems such as Kafka or AWS Kinesis, including event-driven designs.
  • Proficiency in one or more data engineering languages: Python, Java, Scala, or Go.
  • Experience with distributed data processing tools such as Spark or Flink and performance tuning at scale.
  • Experience with AWS data services and cloud-native patterns such as S3, Glue/EMR, Athena, and Redshift.
  • Familiarity with lakehouse/table formats and large-scale storage patterns, including Parquet; Iceberg, Hudi, or Delta are a plus.
  • Experience with orchestration/workflow tools such as Airflow, Dagster, or Step Functions, plus CI/CD for data workloads.
  • Strong data quality and observability practices, including testing, monitoring, lineage, and SLA/SLO awareness.
  • Experience with SQL and NoSQL data stores such as Postgres, MySQL, DynamoDB, Cassandra, or Redis, and choosing the right store per use case.
  • Clear communicator and collaborator who can work with mixed audiences and translate needs into reliable data interfaces.
  • Preferred: AdTech or programmatic advertising domain knowledge, including DSP/SSP/exchange/RTB concepts and data flows.
  • Preferred: Experience building measurement pipelines for attribution, incrementality, lift, or experimentation analytics.
  • Preferred: Experience supporting ML feature stores, offline/online feature generation, or model training datasets.
  • Preferred: Experience with real-time analytics stores such as Druid, ClickHouse, or Pinot and high-cardinality aggregation strategies.
  • Preferred: Deep knowledge of data governance and privacy, including PII handling and consent-aware processing.
  • Preferred: Open-source contributions, publications, or conference speaking.
  • Preferred: BS/MS in Computer Science, Engineering, or equivalent practical experience.

Benefits

  • Unlimited PTO.
  • Excellent medical, dental, and vision coverage.
  • Employee equity.
  • Employee discounts.
  • Virtual wellness classes.
  • Pet insurance.
  • Salary range of $165,000 to $175,000 depending on location and experience.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Head of Data Engineering & Platform, Real-World Data (RWD)

Natera 1K-5K Pharmaceuticals

Natera is hiring a Head of Data Engineering & Platform for Real-World Data to lead the strategy, development, and delivery of a healthcare data platform that supports clinical, research, analytics, and business needs.

Agile AWS CI/CD Machine Learning PostgreSQL Python REST API Snowflake SQL
1 hour, 39 minutes ago

Senior Data Engineer

Knak 51-250 Internet Software & Services

Knak is hiring its first Data Engineer to architect the governed Snowflake data layer that powers company-wide self-serve analytics, AI agents, and department-specific data access.

AWS Databricks dbt GCP Git LinkedIn Ads Looker Mixpanel Mode MySQL Pandas Python Salesforce Snowflake SQL Tableau
2 hours, 20 minutes ago

SAP BW Lead

Lingaro 5K-10K IT Services

SAP BW Lead for Poland’s CC Data Engineering & Management team at SAP, responsible for leading SAP BW-related data engineering work in a full-time remote role.

3 hours, 34 minutes ago

Senior Data Engineer

Egen.ai IT Services

Egen is seeking a Senior Data Engineer to build scalable, client-facing data platforms and API integrations on Google Cloud, with a focus on healthcare data solutions.

Apache Airflow AWS dbt GCP JSON Python REST API Salesforce SQL
4 hours, 14 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers