Capgemini

Capgemini

Capgemini is a global leader in consulting, technology services, and digital transformation, empowering businesses with innovative solutions and expertise to thrive in a rapidly evolving market.

Internet Software & Services
100K+
Founded 1967
$93M raised

Description

  • Design, build, and manage end-to-end data pipelines across the bronze, silver, and gold layers of the medallion architecture.
  • Ingest and process raw data using Spark and Amazon EMR for scalable distributed computation.
  • Develop and automate data transformations for the base vault using DBT.
  • Support business vault and gold-layer data modeling to produce curated datasets.
  • Work with orchestration tools such as Airflow or AWS Step Functions to run and manage data workflows.
  • Use Amazon S3 and Apache Iceberg to store and manage data assets.
  • Collaborate on data architecture practices that support governance, quality, and reusable data products.
  • Maintain and optimize Elasticsearch-related data engineering components, including indexing, query performance, and administration.

Requirements

  • At least 5 years of experience as an Elasticsearch Data Engineer with ELK stack expertise.
  • Strong experience with Elasticsearch cluster optimization, query development, data modeling, performance tuning, and administration (4–6 years).
  • Deep experience with Spark, Python, ETLs, and Amazon EMR.
  • Hands-on experience with DBT for data transformation and modeling.
  • Experience with Apache Airflow, AWS Step Functions, or similar orchestration tools.
  • Expert knowledge of Amazon S3 and Apache Iceberg for data storage and management.
  • Experience with Kubernetes for container orchestration.
  • Experience with Dremio, Looker, or equivalent semantic layer/business view technologies.
  • Intermediate AWS Cloud experience, including AWS Lambda, Step Functions, IAM, SNS, API Gateway, VPC, and Transit Gateway.
  • BS in Computer Science, Data Engineering, Data Modeling, or a similar field; Big Data or AWS certification is a plus.
  • Full English fluency.
  • Strong analytical, problem-solving, and communication skills.
  • Java Spring Boot and IBM ACE programming experience.

Benefits

  • Competitive salary with performance-based bonuses.
  • Comprehensive benefits package.
  • Private health insurance.
  • Pension plan.
  • Paid time off.
  • Career development, training, and development opportunities.
  • Flexible work arrangements, including remote and/or office-based options.
  • Dynamic and inclusive work culture within a globally recognized group.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Conversion Software Engineer

Career TEAM 251-1K Professional Services

Career Team is hiring a Data Conversion Software Engineer to build data transformation and integration software for government-funded workforce development programs across the United States.

Agile Angular CI/CD Docker Express.js JavaScript JSON MongoDB NestJS Next.js Node.js React Scrum TypeScript XML
15 hours, 28 minutes ago

Sr. Associate Data Platform - Remote

TWO95 International 51-250 Internet Software & Services

Sr. Associate Data Platform is a contract role with a Los Angeles-based team supporting Adobe analytics and data platform implementation work across on-site and remote locations.

CSS Digital Marketing HTML JavaScript jQuery Vue.js
15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time remote Python Data Scraping Engineer for the Tendem project to deliver accurate, structured data extraction and processing within a hybrid AI-plus-human workflow.

AJAX GitHub JavaScript JSON LLM Python Selenium
15 hours, 43 minutes ago

Freelance Data Scraping Engineer (Python)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is hiring a part-time freelance Python Data Scraping Engineer for the Tendem project to build and oversee data extraction workflows in a hybrid AI + human environment.

AJAX GitHub JavaScript JSON LLM Python Selenium
15 hours, 43 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers