Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Administer, maintain, and optimize Cloudera Hadoop clusters (CDP or legacy distributions) to ensure high availability and performance.
  • Manage upgrades, patching, monitoring, and troubleshooting of core Cloudera services (HDFS, YARN, Hive, HBase, Impala, Spark, Kafka, etc.).
  • Perform capacity planning, performance tuning, and cluster security hardening.
  • Design, document, and communicate data storage architectures and end-to-end data flow diagrams.
  • Oversee the design, development, and maintenance of ETL/ELT pipelines for batch and streaming workloads using tools such as Spark, Hive, Kafka, Airflow, and NiFi.
  • Review and guide data ingestion, transformation, and storage strategies to ensure data quality, reliability, and scalability.
  • Implement and maintain security and governance controls, including Kerberos, Ranger, data access controls, lineage, and compliance requirements.
  • Lead and mentor data engineering teams, set development/deployment/monitoring/documentation standards, and collaborate with data scientists, analysts, DevOps, and application teams.
  • Translate business requirements into scalable, secure data platform solutions and collaborate with stakeholders on platform architecture decisions.

Requirements

  • 7+ years of experience in data platforms, Hadoop, or big data administration.
  • Deep experience with Cloudera Machine Learning (CML) workspaces, projects, and runtimes.
  • Strong hands-on experience with Cloudera administration (CDH/CDP).
  • Strong experience with containerization technologies (Docker, Kubernetes) and OpenShift.
  • Proven experience managing and mentoring data engineering teams.
  • Solid background in ETL pipeline design and implementation for batch and streaming use cases.
  • Expertise with Spark, Hive, HDFS, YARN, Kafka, and orchestration tools such as Airflow or NiFi.
  • Ability to design and produce architecture diagrams for data storage and data flows, with solid understanding of Linux, networking, and distributed systems.
  • Experience with cloud platforms (AWS, Azure, or GCP) is a strong plus.
  • Nice to have: Cloudera certification(s), exposure to data governance and metadata management tools, and experience in large-scale enterprise or regulated environments.

Benefits

  • Attractive, market-leading salary package.
  • Clear career advancement path with professional development opportunities.
  • One-year contract with Xenon7 with significant opportunity for renewal.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Python Data Engineer

Flatgigs Professional Services

FlatGigs is hiring a Python Data Engineer to build and maintain reliable, scalable data pipelines and ETL workflows that support analytics and data-driven decision-making.

Apache Airflow AWS Azure Docker GCP Kubernetes MongoDB NumPy Pandas PostgreSQL Python SQL
1 hour, 5 minutes ago

Data Engineer- Remote

DeepSource 1-10 Internet Software & Services

Data Engineer at the organization responsible for designing and maintaining data architecture and delivering data solutions that support the company’s data needs.

Apache Spark Azure Databricks Pandas Python SQL SQL Server Terraform
1 hour, 59 minutes ago

Senior Data Infrastructure Engineer

Voltus 251-1K Electric Utilities

Voltus is hiring a Senior Data Infrastructure Engineer to own and strengthen the core data systems that power analytics, reporting, and future AI-ready applications for its remote climate-tech platform.

Apache Airflow AWS Dagster Databricks Datadog dbt GCP Go Jupyter Looker Machine Learning Mode Prometheus Python Redash Superset
5 hours, 16 minutes ago

Senior Software Engineer - Data Integration & JVM Ecosystem

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer for its Connectors team to build and maintain JVM-based data integrations that connect the database to widely used data engineering and visualization platforms.

Apache Airflow Apache Spark dbt Grafana HTTP Java Kafka Metabase Pandas Power BI Python SQL Tableau TCP/IP
6 hours, 54 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers