Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Administer, maintain, and optimize Cloudera Hadoop clusters (CDP or legacy distributions) to ensure high availability and performance.
  • Manage upgrades, patching, monitoring, and troubleshooting of core Cloudera services (HDFS, YARN, Hive, HBase, Impala, Spark, Kafka, etc.).
  • Perform capacity planning, performance tuning, and cluster security hardening.
  • Design, document, and communicate data storage architectures and end-to-end data flow diagrams.
  • Oversee the design, development, and maintenance of ETL/ELT pipelines for batch and streaming workloads using tools such as Spark, Hive, Kafka, Airflow, and NiFi.
  • Review and guide data ingestion, transformation, and storage strategies to ensure data quality, reliability, and scalability.
  • Implement and maintain security and governance controls, including Kerberos, Ranger, data access controls, lineage, and compliance requirements.
  • Lead and mentor data engineering teams, set development/deployment/monitoring/documentation standards, and collaborate with data scientists, analysts, DevOps, and application teams.
  • Translate business requirements into scalable, secure data platform solutions and collaborate with stakeholders on platform architecture decisions.

Requirements

  • 7+ years of experience in data platforms, Hadoop, or big data administration.
  • Deep experience with Cloudera Machine Learning (CML) workspaces, projects, and runtimes.
  • Strong hands-on experience with Cloudera administration (CDH/CDP).
  • Strong experience with containerization technologies (Docker, Kubernetes) and OpenShift.
  • Proven experience managing and mentoring data engineering teams.
  • Solid background in ETL pipeline design and implementation for batch and streaming use cases.
  • Expertise with Spark, Hive, HDFS, YARN, Kafka, and orchestration tools such as Airflow or NiFi.
  • Ability to design and produce architecture diagrams for data storage and data flows, with solid understanding of Linux, networking, and distributed systems.
  • Experience with cloud platforms (AWS, Azure, or GCP) is a strong plus.
  • Nice to have: Cloudera certification(s), exposure to data governance and metadata management tools, and experience in large-scale enterprise or regulated environments.

Benefits

  • Attractive, market-leading salary package.
  • Clear career advancement path with professional development opportunities.
  • One-year contract with Xenon7 with significant opportunity for renewal.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Data Engineer

Age of Learning 251-1K Internet Software & Services

Age of Learning is seeking a Senior Data Engineer to lead its Data and Analytics platform, ensuring trusted data systems that support education products and cross-functional decision-making.

dbt Python Snowflake SQL
1 hour, 30 minutes ago

Head of Market Data

Galaxy 251-1K Capital Markets

Galaxy is hiring a Head of Market Data to lead the acquisition, governance, and optimization of market data supporting trading, research, risk, and related systems across digital and traditional assets.

Generative AI
1 hour, 49 minutes ago

Senior Data Engineer

NEORIS 5K-10K Internet Software & Services

NEORIS, del grupo EPAM, busca un Senior Data Engineer para diseñar y operar soluciones de datos en entornos cloud y apoyar proyectos de modernización y escalado de arquitecturas de datos.

Apache Airflow Apache Spark AWS Azure CI/CD CloudFormation dbt Docker FastAPI GCP Git Linux Microservices Pandas Python REST API Snowflake SQL Terraform
1 hour, 49 minutes ago

Data Engineer

Egen.ai IT Services

Egen is hiring a Remote Data Engineer to design and support large-scale batch and streaming data pipelines that turn business needs into secure, accurate, and accessible data solutions.

Agile Apache Airflow Apache Spark dbt GCP PostgreSQL Salesforce
1 hour, 52 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers