Data Engineer (Python)

1 month, 1 week ago
Full-time
Mid Level
Software Development
Orcrist Technologies

Orcrist Technologies

Orcrist Technologies specializes in providing advanced technology solutions, including data analytics, AI applications, and cybersecurity, aimed at empowering businesses to innovate and transform through the use of artificial intelligence and data-driv...

Internet Software & Services

Description

  • Prototype batch and streaming ingestion and connector patterns using NiFi, Kafka, Kafka Connect/Streams, and CDC approaches.
  • Design schemas and data models that are easy to adopt and support clear semantics and controlled evolution.
  • Build incremental lakehouse datasets using Hudi, Iceberg, or Delta patterns and create queryable outputs for performance and latency evaluation.
  • Incorporate data quality, provenance, metadata, and operability considerations early in prototype development.
  • Containerize and deploy prototypes on Kubernetes and provide minimal runbooks and configuration files to support handoff.
  • Produce adoption artifacts including schemas, reference implementations, technical design notes, and integration backlogs.
  • Generate credible performance and operability readouts for new data initiatives.
  • Collaborate with delivery or foundation teams to enable productization of prototypes.

Requirements

  • 3+ years of data engineering experience, with hands-on pipeline delivery beyond ad hoc scripts.
  • Strong Python and SQL skills for transformations, validation tooling, and pipeline glue code.
  • Practical knowledge of streaming and CDC concepts, including ordering, duplication, replay, and idempotency.
  • Experience with the Kafka ecosystem.
  • Familiarity with lakehouse and storage/query layers such as Hudi, Iceberg, Delta, Trino, Hive, or Postgres.
  • Comfort working in Kubernetes and container-based environments.
  • Ability to document technical decisions clearly.
  • Eligible to work in Germany; EU or NATO citizenship is preferred and export-control screening applies.
  • Experience with Great Expectations or similar data quality tools is preferred.
  • Experience with metadata and lineage platforms such as OpenMetadata, DataHub, or Atlas is preferred.
  • Experience shipping in on-prem or air-gapped environments is preferred.
  • Governance and policy awareness for regulated customers is preferred.
  • German language proficiency at B1+ level is preferred.
  • Experience with OSINT, GEOINT, or multi-INT data shapes is preferred.

Benefits

  • Remote-first work in Germany with regular Berlin prototyping sprints.
  • 30 days of vacation.
  • Equipment and learning budget.
  • Opportunity to work with a modern data stack including Kafka, NiFi, lakehouse technologies, distributed SQL, and Kubernetes.
  • High-leverage work where prototypes become blueprints reused and productized by multiple teams.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Python Data Engineer

Flatgigs Professional Services

FlatGigs is hiring a Python Data Engineer to build and maintain reliable, scalable data pipelines and ETL workflows that support analytics and data-driven decision-making.

Apache Airflow AWS Azure Docker GCP Kubernetes MongoDB NumPy Pandas PostgreSQL Python SQL
1 hour, 45 minutes ago

Data Engineer- Remote

DeepSource 1-10 Internet Software & Services

Data Engineer at the organization responsible for designing and maintaining data architecture and delivering data solutions that support the company’s data needs.

Apache Spark Azure Databricks Pandas Python SQL SQL Server Terraform
2 hours, 39 minutes ago

Senior Data Infrastructure Engineer

Voltus 251-1K Electric Utilities

Voltus is hiring a Senior Data Infrastructure Engineer to own and strengthen the core data systems that power analytics, reporting, and future AI-ready applications for its remote climate-tech platform.

Apache Airflow AWS Dagster Databricks Datadog dbt GCP Go Jupyter Looker Machine Learning Mode Prometheus Python Redash Superset
5 hours, 56 minutes ago

Senior Software Engineer - Data Integration & JVM Ecosystem

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer for its Connectors team to build and maintain JVM-based data integrations that connect the database to widely used data engineering and visualization platforms.

Apache Airflow Apache Spark dbt Grafana HTTP Java Kafka Metabase Pandas Power BI Python SQL Tableau TCP/IP
7 hours, 35 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers