Dataiku

Dataiku

Dataiku develops a collaborative data science software platform that empowers teams to explore, prototype, build, and deliver data products, facilitating the transformation of business data into impactful insights through Everyday AI.

Internet Software & Services
251-1K
Founded 2013
$847M raised

Description

  • Develop and maintain system integrations, platform automations, and platform configurations within the Dataiku platform.
  • Build and maintain Python and SQL data replication workflows and data pipelines for large, complex datasets.
  • Support data operations, troubleshooting, monitoring, and ongoing platform reliability activities.
  • Develop data quality metrics and observability to improve data quality standards.
  • Learn and work across existing data platform, data engineering, and data governance systems and processes.
  • Perform root cause analysis on complex pipeline and platform issues to help ensure availability.
  • Build and maintain administration tools for monitoring, alerting, observability, access management, platform metrics, and user transparency.
  • Design data models for short-term and long-term use cases to support warehouse scalability.
  • Collaborate with analytics engineers and stakeholders across Infra, Product, Engineering, and embedded analytics teams.
  • Help test new Dataiku and partner tool features, provide feedback, and document internal systems and processes.

Requirements

  • 2+ years of relevant experience in data engineering or data platform engineering.
  • Strong technical skills in SQL and Python.
  • Experience with Dataiku DSS is a strong plus.
  • Prior experience with Snowflake is a plus.
  • Experience with DevOps tools such as GitHub Actions, Azure DevOps, or Jenkins.
  • Experience building data models.
  • Experience building and maintaining replication and data pipelines in a cloud data warehouse or data lake environment.
  • Strong analytical and creative problem-solving skills.
  • Ability to manage multiple projects and time constraints in a high-trust remote environment.
  • Excellent written and verbal communication skills, especially with senior stakeholders, and the ability to create clear, precise documentation.

Benefits

  • Internal, non-client-facing role with the opportunity to work on Dataiku’s own enterprise data platform.
  • Remote-friendly, high-trust work environment.
  • Opportunity to work on scalable, governed data systems that support the broader company.
  • Strong emphasis on collaboration, innovation, and personal growth.
  • Equal opportunity employer committed to dignity, decency, fairness, and inclusion.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Ingeniero de Datos Senior (Informatica CDI)

NEORIS 5K-10K Internet Software & Services

EPAM/NEORIS busca un Ingeniero de Datos Senior para desarrollar y asegurar soluciones de integración y plataforma de datos en un entorno multicultural con tecnologías cloud y de datos empresariales.

Agile Azure Databricks MongoDB Oracle PostgreSQL Power BI SQL Server
56 minutes ago

Senior Data Engineer

phData 251-1K IT Services

phData is building a pipeline of experienced software, data, and analytics professionals to support future client work delivering production-grade solutions across modern cloud data platforms.

Apache Airflow Apache Spark AWS Azure Cassandra Databricks dbt Elasticsearch GCP Hadoop HDFS Java Kafka Luigi Python Scala Snowflake Solr SQL
1 hour, 49 minutes ago

Staff Data Engineer

SecurityScorecard 251-1K IT Services

SecurityScorecard is hiring a Staff Engineer to lead its Data Engineering team in building and maintaining large-scale data infrastructure that powers cybersecurity ratings for companies worldwide.

Apache Airflow Apache Spark AWS ClickHouse Cybersecurity Docker Git Hive Jenkins JSON Kafka PostgreSQL Python Redis Scala Terraform XML
2 hours, 1 minute ago

Data Engineer

Verisian 1-10 Pharmaceuticals

Verisian is hiring a Data Engineer to build the platform and data pipelines that help clinical trial evidence, validation, and regulatory workflows move medical therapies to market faster and more safely.

Docker Git GitHub Actions Java Kubernetes Next.js Python SQL
3 hours, 1 minute ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers