Staff Data Engineer, tvScientific

1 hour, 54 minutes ago
Full-time
Lead
Software Development
Pinterest

Pinterest

Pinterest is the world's first visual discovery engine, offering a vast dataset of ideas with over 200 billion recipes, home hacks, and style inspiration. With a mission to inspire everyone to create a life they love, Pinterest empowers its employees t...

Internet Software & Services
5K-10K
Founded 2010

Description

  • Design and maintain a scalable identity resolution platform.
  • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources.
  • Implement deterministic and probabilistic matching logic that is transparent, auditable, and measurable.
  • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets.
  • Build and operate batch and streaming pipelines using modern data stack tools.
  • Create documentation, standards, and runbooks for identity and governance systems.
  • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls.
  • Implement privacy-by-design practices, including PII handling, consent enforcement, and retention policies.
  • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements such as GDPR and CCPA.
  • Establish monitoring and alerting for data quality, freshness, and integrity.

Requirements

  • Production data engineering experience.
  • Bachelor’s degree in computer science, a related field, or equivalent experience.
  • Proficiency in Spark and Scala, with proven experience building data infrastructure in Spark using Scala.
  • Experience delivering significant technical initiatives and building reliable, large-scale services.
  • Experience delivering APIs backed by relationship-heavy datasets.
  • Experience implementing data governance practices, including data quality, metadata management, and access controls.
  • Strong understanding of privacy-by-design principles and handling sensitive or regulated data.
  • Familiarity with data lakes, cloud warehouses, and storage formats.
  • Strong proficiency in AWS services.
  • Excellent written and verbal communication skills.
  • Demonstrated success designing and implementing scalable and efficient data infrastructure.
  • High attention to detail in implementing automated data quality checks.
  • Effective collaboration with cross-functional teams.
  • Demonstrated ability to use AI to improve speed and quality in day-to-day workflow for relevant outputs.
  • Strong track record of critically evaluating and verifying AI-assisted work through testing, source-checking, data validation, and peer review.
  • High integrity and ownership, with the ability to protect sensitive data, avoid over-reliance on AI, and remain accountable for final decisions and deliverables.

Benefits

  • Base salary range of $177,185 to $364,795 USD for US-based applicants.
  • Eligible for equity.
  • Remote-friendly working model via PinFlex.
  • No relocation assistance provided for this role.
  • Access to Pinterest culture and benefits information for the position.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Data Engineer

tvScientific 11-50 Media

tvScientific is hiring a Staff Data Engineer to build and evolve the company’s core data infrastructure and pipelines that support its CTV advertising platform.

Apache Spark AWS Scala SQL
10 minutes ago

Senior Software Engineer - Data Platform

Motional 1K-5K Automotive

Motional is seeking a Data Platform engineer to help design and operate core infrastructure that turns large-scale data into actionable insights for Machine Learning and Autonomy teams.

AWS CI/CD Machine Learning Microservices Python SQL
54 minutes ago

AI Data Engineer

Power Digital is hiring a data engineer to build and own end-to-end data systems that power its internal AI platform and translate raw data into AI-ready outputs across the organization.

AWS CI/CD Git Looker Machine Learning Python Snowflake SQL
1 hour, 10 minutes ago

Senior Data Engineer

Level Access 251-1K Internet Software & Services

Level Access is seeking a Senior Data Engineer to build and operate a unified, governed data platform on Databricks and AWS that supports analytics and AI initiatives.

Apache Spark AWS CI/CD Databricks dbt DynamoDB Git MLflow Python REST API SQL
2 hours, 10 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers