Engineering Manager- Data Engineering

2 hours, 10 minutes ago
Full-time
Lead
Software Development
GroundTruth

GroundTruth

GroundTruth is a media platform that leverages real-world behavior to deliver marketing solutions that drive tangible business outcomes.

Media
251-1K
Founded 2009

Description

  • Architect and deploy large-scale distributed data processing pipelines using PySpark on Amazon EMR and AWS Glue.
  • Build and evolve core data infrastructure for the audience platform and related analytics services.
  • Write production-ready Python and PySpark code, conduct code reviews, and optimize Spark performance and cost efficiency.
  • Coach and mentor data engineers on Python, Spark optimization, data modeling, and career growth.
  • Partner with stakeholders and engineering leadership to plan and deliver data-first initiatives across advertising, analytics, and reporting.
  • Apply Agile practices such as Scrum to support iterative delivery and cross-team collaboration.
  • Manage team performance through regular 1:1s, feedback, quarterly reviews, recognition, and performance management.
  • Support data quality, pipeline reliability, and modernized delivery through AI-assisted engineering practices.

Requirements

  • Bachelor’s degree in Computer Engineering, Data Science, or equivalent practical experience.
  • 8+ years of experience in technology, with a focus on data engineering, data warehousing, or big data architecture.
  • 2+ years of experience leading a data engineering team.
  • Deep expertise in Python and PySpark for distributed processing, data skew handling, and Spark memory optimization.
  • Strong AWS experience managing Amazon EMR for large-scale processing.
  • Experience building ETL systems with SQL and AWS big data technologies such as S3, EMR, Glue, Athena, and Lambda.
  • Expert-level SQL skills for complex transformations, performance tuning, and analytics.
  • Advanced proficiency with Airflow and Git.
  • Hands-on experience with AI-native tools such as Cursor, Claude, or GitHub Copilot, and using AI to improve data engineering workflows.
  • Preferred experience with debugging complex PySpark and/or Scala jobs, optimizing EMR Instance Fleets/Spot Instances, and event-driven architecture using AWS SQS.
  • AWS certification preferred.
  • Organized, collaborative, detail-oriented, and strong at communicating technical trade-offs to business partners.

Benefits

  • Parental leave for maternity and paternity.
  • Flexible time off, including earned leave, sick leave, birthday leave, bereavement leave, and company holidays.
  • Daily in-office catered breakfast, lunch, snacks, and beverages.
  • Health coverage for hospitalization, including coverage for the employee’s nuclear family and parents.
  • Free tele-med consultations plus discounts on health checkups and medicines.
  • Wellness and gym reimbursement.
  • Pet expense reimbursement.
  • Childcare expenses and reimbursements, including creche reimbursement.
  • Employee referral program and education/skill development reimbursement programs.
  • Cell phone and internet reimbursement options, plus meal card and retirement-related benefits such as provident fund and national pension contributions.
  • Co-working space reimbursement and special benefits on salary account.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Engineer: Data Lake (Remote)

Constructor Internet Software & Services

Constructor is hiring a Senior Data Engineer for its Data Lake Team to build and operate the core data platform that powers internal analytics, data science, and ML workflows at high scale.

Apache Spark AWS ClickHouse CloudFormation Databricks FastAPI LLM Machine Learning OpenTelemetry PagerDuty Prometheus Python Terraform Trino
10 minutes ago

AWS Data Engineer

qode Internet Software & Services

AWS Data Engineer at AWS, responsible for building and maintaining scalable cloud data pipelines and modernizing data platforms for secure, high-quality data processing and migration.

AWS CI/CD Databricks Power BI Python Snowflake Tableau
10 minutes ago

Data QA Engineer

Dreamix 51-250 Internet Software & Services

Dreamix is seeking an experienced Data QA Engineer to own quality across its data products by validating pipelines, reporting on risks and defects, and supporting release decisions.

Agile AWS Azure CI/CD JIRA JSON Python Scrum SQL
25 minutes ago

FBS Senior Data Engineer (Airflow)

Capgemini 100K+ Internet Software & Services

FBS – Farmer Business Services is hiring an Airflow-focused data platforms professional to help build and support centralized shared data platform services for Farmers, with responsibility for strategy, implementation, and best practices.

Apache Airflow CI/CD dbt DynamoDB Jenkins Python Snowflake
25 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers