Software Engineer II, Big Data

1 week, 4 days ago
Full-time
Mid Level
Software Development
tvScientific

tvScientific

tvScientific offers a pioneering Connected TV advertising and marketing platform that integrates traditional television's impact with the effectiveness of digital advertising, specifically tailored for performance marketers.

Media
11-50
Founded 2020
$22M raised

Description

  • Design and implement robust data infrastructure in AWS using Spark and Scala.
  • Evolve core data pipelines to scale efficiently with company growth.
  • Store data in optimal engines and formats to balance performance and cost.
  • Collaborate with cross-functional teams to design data solutions that meet business needs.
  • Design and implement knowledge graphs and expose them through batch processing and APIs.
  • Leverage and optimize AWS resources while designing for scale.
  • Work closely with Data Science and Product teams.
  • Implement automated data quality checks with strong attention to detail.
  • Define and implement a strategic vision for data engineering as an individual contributor.

Requirements

  • Production data engineering experience.
  • Proficiency in Spark and Scala, with preferred experience building data infrastructure in Spark using Scala.
  • Experience delivering significant technical initiatives and building reliable, large-scale services.
  • Experience delivering APIs backed by relationship-heavy datasets.
  • Familiarity with data lakes, cloud warehouses, and storage formats.
  • Strong proficiency in AWS services.
  • Expertise in SQL for data manipulation and extraction.
  • Excellent written and verbal communication skills.
  • Bachelor's degree in Computer Science or a related field.
  • Demonstrated ability to use AI to improve speed and quality in day-to-day workflow for relevant outputs.
  • Strong track record of critically evaluating and verifying AI-assisted work through testing, source-checking, data validation, or peer review.
  • High integrity and ownership, including protecting sensitive data and remaining accountable for final decisions and deliverables.
  • Experience in adtech is preferred.
  • Experience implementing data governance practices, including data quality, metadata management, and access controls, is preferred.
  • Strong understanding of privacy-by-design principles and handling sensitive or regulated data is preferred.
  • Familiarity with data table formats like Apache Iceberg or Delta is preferred.
  • Previous experience building out a Data Engineering function is preferred.
  • Proven experience working closely with Data Science teams on machine learning pipelines is preferred.

Benefits

  • Base salary range of $123,696 to $254,667 USD for US-based applicants.
  • Eligible for equity.
  • Remote-friendly working model noted via PinFlex.
  • Relocation assistance is not provided.
  • Additional culture and benefits information is available through Pinterest Life.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI & Data Engineer

Orion Innovation 1K-5K IT Services

Orion Innovation is seeking a Web Delivery Lead to drive end-to-end delivery of AEM-based global websites for its digital engagement platforms, with a focus on reliable launches, governance, and AI-enabled web operations.

Agile AWS Azure CDN CI/CD Computer Vision DNS Docker GCP GPT GraphQL Hugging Face Java JavaScript Kubernetes LLM NLP Python PyTorch SEO TensorFlow
5 hours, 10 minutes ago

Data Engineer

Innodata 1K-5K IT Services

Innodata is seeking a Data Engineer to build enterprise data warehouses, data lakes, and pipelines that support data center supply chain and real estate operations, while enabling AI-driven analytics and workflow automation.

AWS ERP GCP Looker MLOps Power BI Python SQL Tableau
5 hours, 25 minutes ago

Data Engineer

Pavago IT Services

Pavago is hiring a remote Data Engineer to build and maintain cloud-based data pipelines, warehouses, and analytics datasets that support reporting, automation, and business intelligence.

Apache Airflow AWS CI/CD Dagster dbt Docker GCP GitHub Actions GitLab CI Jenkins Kafka Kubernetes Looker Luigi Power BI Prefect Python Scala Snowflake SQL Tableau Terraform
5 hours, 40 minutes ago

Senior Data Engineer - Surveillance & Interoperability

inventYOU 1-10 Internet Software & Services

Senior Data Engineer - Surveillance & Interoperability at inventYOU, responsible for designing and delivering large-scale data integration and interoperability solutions that support secure information exchange, surveillance, monitoring, and analytics across complex systems.

AWS Azure GCP Python REST API SQL
5 hours, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers