Senior Data Engineer, AI and Systems Engineering

1 hour, 42 minutes ago
Full-time
Lead
Software Development
Dropbox

Dropbox

Dropbox is a technology company that builds simple, powerful products for individuals and businesses. With over 700 million registered users worldwide, Dropbox offers file sync, sharing, online backup, cloud storage, collaboration tools, and more to st...

Internet Software & Services
1K-5K
Founded 2007

Description

  • Design and build scalable data pipelines using Databricks and Spark to ingest, transform, and unify data from multiple enterprise systems.
  • Develop and maintain medallion architecture data models to create reliable and performant golden record datasets.
  • Implement data normalization, mapping, and entity resolution techniques to unify asset data across disparate systems.
  • Build data workflows to detect and surface Shadow IT across financial, identity, endpoint, and network signals and integrate the results into CMDB systems.
  • Partner with IT, Security, Finance, Procurement, and GRC teams to define and enforce data standards for critical CMDB attributes.
  • Develop and maintain data integrations and APIs to synchronize curated datasets into operational systems such as ServiceNow and Jira Assets.
  • Monitor, troubleshoot, and improve data quality, reliability, and observability across the data platform.
  • Help maintain a stable platform and occasionally participate in on-call support for bugs, outages, and other operational issues.

Requirements

  • 9+ years of experience building and maintaining data pipelines and large-scale data platforms.
  • Strong experience with Databricks, Apache Spark, and SQL for distributed data processing and transformation.
  • Experience designing data models and architectures such as medallion architecture, data lakes, or lakehouse systems.
  • Proficiency in Python or similar programming languages for data engineering and ETL development.
  • Experience integrating data from multiple enterprise systems such as SaaS tools, financial systems, and identity systems.
  • Strong understanding of data quality, data governance, and entity resolution techniques across heterogeneous datasets.
  • Excellent collaboration and communication skills, with experience working cross-functionally with technical and non-technical stakeholders.
  • Experience working with CMDB systems such as Jira Assets or ServiceNow (preferred).
  • Familiarity with identity, security, or IT asset management systems such as Okta, Jamf, or Zscaler (preferred).
  • Experience implementing cost-optimized data processing strategies in cloud environments (preferred).
  • Exposure to financial data systems such as Oracle or Concur and spend analytics use cases (preferred).
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field (preferred).

Benefits

  • Poland pay range of 239 700 zł to 324 300 zł PLN.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Gen AI Data Engineer

Tiger Analytics 1K-5K Professional Services

Tiger Analytics is hiring an experienced Machine Learning Engineer with GenAI experience to build and optimize large-scale data and retrieval systems for advanced analytics and RAG solutions.

Apache Airflow Apache Spark AWS CI/CD CloudFormation Docker Elasticsearch GCP Generative AI GitHub GitHub Actions Hadoop Jenkins Kubernetes Linux LLM Machine Learning Neo4j Python Snowflake SQL Terraform Vertex AI VS Code
56 minutes ago

Data Engineer, Azure - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote Data Engineer, Azure to build and optimize data pipelines and warehousing solutions for client projects across Latin America.

Agile Apache Spark Azure Git Machine Learning Power BI Python REST API SQL Tableau
2 hours, 4 minutes ago

Senior Data Engineer

Rezilient Health 11-50 Health Care Providers & Services

Rezilient Health is seeking a Data Engineer to build the data platform that powers near-real-time healthcare insights, patient and provider experiences, and operational efficiency across its CloudClinic model.

Apache Airflow AWS Azure dbt GCP HIPAA Machine Learning Python Scala Snowflake SQL
2 hours, 8 minutes ago

[Job - 29221] Senior Data Developer (Azure), Brazil

CI&T 5K-10K Internet Software & Services

CI&T is seeking a Senior Data Developer (Azure) to build and evolve its cloud data platform in Brazil, turning architectural standards into reliable, scalable data pipelines and analytics-ready datasets.

Apache Airflow Apache Spark Azure CI/CD Databricks dbt Feature Engineering Git Prefect Python Snowflake SQL
2 hours, 53 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers