Capgemini

Capgemini

Capgemini is a global leader in consulting, technology services, and digital transformation, empowering businesses with innovative solutions and expertise to thrive in a rapidly evolving market.

Internet Software & Services
100K+
Founded 1967
$93M raised

Description

  • Design, build, and manage end-to-end data pipelines across the bronze, silver, and gold layers of the medallion architecture.
  • Ingest and process raw data using Spark and Amazon EMR for scalable distributed computation.
  • Develop and automate data transformations for the base vault using DBT.
  • Support business vault and gold-layer data modeling to produce curated datasets.
  • Work with orchestration tools such as Airflow or AWS Step Functions to run and manage data workflows.
  • Use Amazon S3 and Apache Iceberg to store and manage data assets.
  • Collaborate on data architecture practices that support governance, quality, and reusable data products.
  • Maintain and optimize Elasticsearch-related data engineering components, including indexing, query performance, and administration.

Requirements

  • At least 5 years of experience as an Elasticsearch Data Engineer with ELK stack expertise.
  • Strong experience with Elasticsearch cluster optimization, query development, data modeling, performance tuning, and administration (4–6 years).
  • Deep experience with Spark, Python, ETLs, and Amazon EMR.
  • Hands-on experience with DBT for data transformation and modeling.
  • Experience with Apache Airflow, AWS Step Functions, or similar orchestration tools.
  • Expert knowledge of Amazon S3 and Apache Iceberg for data storage and management.
  • Experience with Kubernetes for container orchestration.
  • Experience with Dremio, Looker, or equivalent semantic layer/business view technologies.
  • Intermediate AWS Cloud experience, including AWS Lambda, Step Functions, IAM, SNS, API Gateway, VPC, and Transit Gateway.
  • BS in Computer Science, Data Engineering, Data Modeling, or a similar field; Big Data or AWS certification is a plus.
  • Full English fluency.
  • Strong analytical, problem-solving, and communication skills.
  • Java Spring Boot and IBM ACE programming experience.

Benefits

  • Competitive salary with performance-based bonuses.
  • Comprehensive benefits package.
  • Private health insurance.
  • Pension plan.
  • Paid time off.
  • Career development, training, and development opportunities.
  • Flexible work arrangements, including remote and/or office-based options.
  • Dynamic and inclusive work culture within a globally recognized group.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Lead Data Platform Engineer

PR Newswire 1K-5K Internet Software & Services

INFOnline, part of saas.group, is seeking a Lead Data Platform Engineer to own and evolve its GCP-native data platform that powers digital audience measurement for the German and Austrian media industry.

CI/CD dbt Docker GCP Go Serverless SQL Terraform
34 minutes ago

OFSAA - Basel Technical Consultant

Unison Group Technology consulting

An experienced OFSAA Basel Technical Consultant is needed to design, develop, and support Basel regulatory reporting solutions for Oracle Financial Services Analytical Applications at a banking environment.

49 minutes ago

Data Engineer for AI Product

Qonto 1K-5K Banks

Qonto is hiring a Data Engineer for AI Product to build the data layer and production infrastructure that powers machine learning products for its finance workspace serving SMEs across Europe.

Apache Airflow Apache Spark CI/CD dbt Machine Learning Python
1 hour, 4 minutes ago

Senior Azure Data Consultant

Trility Consulting 51-250 Internet Software & Services

Trility Consulting is hiring a Senior Azure Data Consultant to work remotely with U.S. clients and lead data architecture and engineering efforts from initial discovery through production delivery.

Agile Azure CI/CD Databricks SQL
1 hour, 4 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers