Software Engineer II (Data Platform)

2 hours, 53 minutes ago
Full-time
Mid Level
Software Development
G-P/Globalization Partners

G-P/Globalization Partners

G P offers Employer of Record solutions for global team management in 180+ countries, combining EOR, Payroll, and HCM solutions for streamlined and compliant workforce expansion.

Professional Services
1K-5K

Description

  • Design and develop core data platform components, including high-volume event ingestion pipelines and real-time processing workflows.
  • Build reusable data access layers and framework-based ETL/ELT processes that transform business data and event streams into Lakehouse assets.
  • Develop internal APIs and backend services that allow other engineering teams to interact with the data platform.
  • Build robust, fault-tolerant data pipelines with a focus on correctness, consistency, and availability.
  • Implement schema enforcement and validation gates to support data governance and prevent poor data quality.
  • Optimize code and query performance to keep workloads fast and cost-efficient.
  • Build and own observability tooling, including logging, metrics, and alerting for pipeline health and uptime.
  • Proactively troubleshoot production bottlenecks, participate in root-cause analysis, and support incident response.
  • Apply automated testing, including unit, integration, and regression tests, to data code.
  • Participate in design discussions and code reviews while following governance and PII handling standards.

Requirements

  • 3+ years of experience in software engineering building production-grade backend or data systems.
  • Strong hands-on experience with Apache Spark and the Databricks/Delta Lake ecosystem.
  • Expert-level Python and SQL skills.
  • Deep understanding of software design principles, debugging, and distributed computing concepts.
  • Experience with CI/CD practices.
  • Working knowledge of Terraform or similar infrastructure-as-code tools.
  • Background in building monitored, observable systems where data quality and validation are first-class concerns.
  • Experience working in an AI-accelerated development environment is preferred.
  • Familiarity with AI-assisted development tools such as GitHub Copilot or Cursor is preferred.

Benefits

  • Competitive compensation with eligibility for additional pay based on role type.
  • Annual bonus eligibility for non-sales roles, based on individual and company performance.
  • Commission structure for sales roles.
  • Generous paid parental leave.
  • Flexible time off.
  • Medical, dental, and vision insurance.
  • Spending accounts.
  • Sabbatical after 5 years.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Infrastructure Operations Manager II (Starlink)

SpaceX 10K-50K Aerospace & Defense

SpaceX is hiring a Data Infrastructure Operations Manager II for Starlink to manage end-to-end sourcing, supplier operations, and delivery of high-quality datasets that support AI model training, validation, and improvement.

23 minutes ago

Data Engineer (Big Data + Full Stack exposure) - ONLY MEXICO

Scalepex 51-250 Professional Services

Scalepex is seeking a Data Engineer / Senior Data Engineer to design and scale data pipelines and analytics solutions that support business insights and data products across the full data lifecycle.

Angular Apache Airflow Apache Spark AWS CI/CD Databricks GitHub Actions Hadoop Java Linux MySQL Play Framework Python Scala Tableau
1 hour ago

Sr. Data Engineer, tvScientific

Pinterest 5K-10K Internet Software & Services

tvScientific is hiring a Senior Data Engineer to build and evolve the AWS-based data infrastructure that powers its CTV advertising platform for performance marketers.

Apache Spark AWS Scala SQL
1 hour, 8 minutes ago

Senior Data Engineer (Integrations & Platforms)

Massive Rocket 51-250 Media

Massive Rocket is hiring a Senior Data Engineer in India to build and maintain the integrations, pipelines, and platform infrastructure that keep client and internal data systems connected, scalable, and reliable.

Apache Airflow AWS Azure CRM dbt Docker GCP Kafka Kubernetes Microservices Python Segment Snowflake SQL Terraform
1 hour, 18 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers