Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Administer, maintain, and optimize Cloudera Hadoop clusters (CDP or legacy distributions) to ensure high availability and performance.
  • Manage upgrades, patching, monitoring, and troubleshooting of core Cloudera services (HDFS, YARN, Hive, HBase, Impala, Spark, Kafka, etc.).
  • Perform capacity planning, performance tuning, and cluster security hardening.
  • Design, document, and communicate data storage architectures and end-to-end data flow diagrams.
  • Oversee the design, development, and maintenance of ETL/ELT pipelines for batch and streaming workloads using tools such as Spark, Hive, Kafka, Airflow, and NiFi.
  • Review and guide data ingestion, transformation, and storage strategies to ensure data quality, reliability, and scalability.
  • Implement and maintain security and governance controls, including Kerberos, Ranger, data access controls, lineage, and compliance requirements.
  • Lead and mentor data engineering teams, set development/deployment/monitoring/documentation standards, and collaborate with data scientists, analysts, DevOps, and application teams.
  • Translate business requirements into scalable, secure data platform solutions and collaborate with stakeholders on platform architecture decisions.

Requirements

  • 7+ years of experience in data platforms, Hadoop, or big data administration.
  • Deep experience with Cloudera Machine Learning (CML) workspaces, projects, and runtimes.
  • Strong hands-on experience with Cloudera administration (CDH/CDP).
  • Strong experience with containerization technologies (Docker, Kubernetes) and OpenShift.
  • Proven experience managing and mentoring data engineering teams.
  • Solid background in ETL pipeline design and implementation for batch and streaming use cases.
  • Expertise with Spark, Hive, HDFS, YARN, Kafka, and orchestration tools such as Airflow or NiFi.
  • Ability to design and produce architecture diagrams for data storage and data flows, with solid understanding of Linux, networking, and distributed systems.
  • Experience with cloud platforms (AWS, Azure, or GCP) is a strong plus.
  • Nice to have: Cloudera certification(s), exposure to data governance and metadata management tools, and experience in large-scale enterprise or regulated environments.

Benefits

  • Attractive, market-leading salary package.
  • Clear career advancement path with professional development opportunities.
  • One-year contract with Xenon7 with significant opportunity for renewal.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Mid-Level Data Engineer | CARE

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Brazil-based remote Data Engineer for its CARE (Coach on Artificial Intelligence for Real Engagement) team to build cloud-based data solutions that support AI-powered wellness engagement products.

Apache Airflow Kafka
20 minutes ago

Data Engineer (Python)

Orcrist Technologies Internet Software & Services

Orcrist is seeking a Data Engineer to prototype and validate new data pipelines and connectors for its Kubernetes-based intelligence platform, with the goal of producing adoptable blueprints for productization.

Hive Kafka Kubernetes PostgreSQL Python SQL Trino
50 minutes ago

Data Engineer I

Capital Rx 251-1K Health Care Providers & Services

Judi Health is hiring a Data Engineer I (Analytics Engineering) to build analytics-ready datasets, trusted metrics, and scalable data models for Capital Rx’s healthcare data platform.

CI/CD dbt Git PostgreSQL Python Snowflake SQL
50 minutes ago

Data Engineer (Remote-US)

DataKind 51-250 Diversified Consumer Services

DataKind is hiring a remote Data Engineer to support higher education institutions by building and maintaining the UDTS data platform and partnering directly with clients to improve student outcomes.

Databricks GCP JavaScript Python SQL
1 hour, 5 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers