Staff Data Engineer

2 months ago
Full-time
Lead
Software Development
tvScientific

tvScientific

tvScientific offers a pioneering Connected TV advertising and marketing platform that integrates traditional television's impact with the effectiveness of digital advertising, specifically tailored for performance marketers.

Media
11-50
Founded 2020
$22M raised

Description

  • Design and maintain a scalable identity resolution platform.
  • Build pipelines and services to ingest, normalize, link, and version identity data from multiple sources.
  • Implement transparent, auditable, and measurable deterministic and probabilistic matching logic.
  • Partner with product and analytics teams to deliver reliable APIs and datasets for identity data.
  • Build and operate batch and streaming pipelines using modern data stack tools.
  • Create documentation, standards, and runbooks for identity and governance systems.
  • Own data governance foundations, including lineage, quality checks, schema enforcement, and access controls.
  • Implement privacy-by-design practices, including PII handling, consent enforcement, and retention policies.
  • Work with legal, privacy, and security teams to operationalize regulatory requirements such as GDPR and CCPA.
  • Establish monitoring and alerting for data quality, freshness, and integrity.

Requirements

  • Production data engineering experience.
  • Bachelor’s degree in computer science, a related field, or equivalent experience.
  • Proficiency in Spark and Scala, with experience building data infrastructure in Spark using Scala.
  • Experience delivering significant technical initiatives and building reliable, large-scale services.
  • Experience delivering APIs backed by relationship-heavy datasets.
  • Experience implementing data governance practices, including data quality, metadata management, and access controls.
  • Strong understanding of privacy-by-design principles and handling sensitive or regulated data.
  • Familiarity with data lakes, cloud warehouses, and storage formats.
  • Strong proficiency in AWS services.
  • Excellent written and verbal communication skills.
  • Demonstrated ability to design and implement scalable, efficient data infrastructure.
  • High attention to detail in implementing automated data quality checks.
  • Effective collaboration with cross-functional teams.
  • Demonstrated ability to use AI to improve speed and quality in day-to-day work for relevant outputs.
  • Strong track record of critically evaluating and verifying AI-assisted work through testing, source-checking, data validation, or peer review.
  • High integrity and ownership, with the ability to protect sensitive data and remain accountable for final decisions and deliverables.

Benefits

  • Base salary range of $177,185 to $364,795 USD for US-based applicants.
  • Eligible for equity.
  • Relocation assistance is not available for this position.
  • Work model information is available through PinFlex.
  • Benefits information for the role is available through Pinterest Life.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Middle Data Engineer for Real Time Streaming Platform

GR8 Tech 251-1K IT Services

GR8_TECH is hiring a Streaming Data Engineer to design, build, and maintain Kafka-based streaming data solutions that support reliable processing, integrations, and testing for its B2B iGaming platform.

CI/CD Grafana Java JSON Kafka Kubernetes Prometheus Scala SQL
6 hours, 4 minutes ago

Senior Data Engineer - Managed Services

3Cloud 251-1K Internet Software & Services

3Cloud is hiring a Senior Data Engineer for Managed Services to support and optimize client Microsoft Azure data and Power BI environments while delivering analytics solutions and resolving incidents across diverse customer accounts.

Azure Databricks Power BI
6 hours, 34 minutes ago

Senior Compute Platform Engineer

Zeta Global 1K-5K Media

Zeta Global is hiring a Senior Compute Platform Engineer for its Data Platform team to build and operate the infrastructure behind its lakehouse and large-scale data processing environment.

Apache Airflow Apache Spark AWS EC2 Hadoop Kubernetes Linux Scala
6 hours, 34 minutes ago

Data Engineer

Stitch Fix 10K-50K Textiles, Apparel & Luxury Goods

Stitch Fix is hiring a data engineer to help build and scale the infrastructure that centralizes ETL logic and metric definitions for personalized retail analytics and AI-driven operations.

Apache Airflow Apache Spark AWS dbt Python SQL
6 hours, 49 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers