Data Engineering Team Lead (Agentic Search)

5 hours, 32 minutes ago
Full-time
Senior
Data Science and Analytics
Nebius

Nebius

Nebius enables B2B companies to build local hyperscaling cloud platforms with cost-effective GPUs, InfiniBand network, and 50% less compute cost. They offer managed Kubernetes and a launch-ready business model for innovative cloud solutions.

Internet Software & Services
51-250

Description

  • Lead and architect the data platform from real-time ingestion through warehouse medallion layers to consumer-facing datasets and dashboards.
  • Lead, hire, mentor, and grow a team of data engineers while setting standards for code quality, testing, documentation, and on-call practices.
  • Work closely with engineers across the company to ensure batch and streaming pipelines are built correctly.
  • Define and implement observability for the data platform, including data quality checks, freshness monitoring, lineage, schema evolution, and cost controls.
  • Partner with researchers, engineers, analysts, finance, and product managers to deliver trustworthy datasets for product and go-to-market analytics.
  • Define the objects, entities, and relationships that model the search domain and turn that understanding into a clean, queryable data model.
  • Ensure high standards of data quality, integrity, and security across all environments.
  • Stay hands-on by designing and reviewing the systems your team ships.
  • Own the end-to-end data lifecycle, including ingestion from production services and support for the data warehouse architecture.

Requirements

  • 5+ years of data engineering experience, with a focus on scalable, analytics-ready data models and cloud data warehouses such as BigQuery or Snowflake.
  • Hands-on experience with Snowflake or a comparable cloud data warehouse, preferably with medallion schema architecture.
  • Deep knowledge of databases, including schema design and query optimization, with familiarity with NoSQL use cases.
  • Experience with modern data orchestration and transformation frameworks such as Airflow and DBT.
  • Solid understanding of cloud data services such as AWS or GCP and streaming platforms such as Kafka or Pub/Sub.
  • Hands-on experience with the Spark / MapReduce paradigm and judgment on when distributed processing is appropriate.
  • Fluency in Python and SQL for production data work.
  • Experience operating data systems in production, including debugging under pressure, recovering from data incidents, and backfilling corrupted tables.
  • Experience leading, hiring, or mentoring engineers is implied by the team lead role.
  • Authorization to work in the country where the candidate applies and ability to provide proof of employment eligibility.
  • Experience with data governance, quality, and security practices is preferred.

Benefits

  • Competitive compensation.
  • Career growth and learning opportunities.
  • Flexibility and work-life balance.
  • Collaborative and innovative culture.
  • Opportunity to work on impactful AI projects.
  • International environment with talented teams.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Gen AI Data Engineer

Tiger Analytics 1K-5K Professional Services

Tiger Analytics is hiring an experienced Machine Learning Engineer with GenAI experience to build and optimize large-scale data and retrieval systems for advanced analytics and RAG solutions.

Apache Airflow Apache Spark AWS CI/CD CloudFormation Docker Elasticsearch GCP Generative AI GitHub GitHub Actions Hadoop Jenkins Kubernetes Linux LLM Machine Learning Neo4j Python Snowflake SQL Terraform Vertex AI VS Code
19 minutes ago

Data/ ML Solution Architect

Provectus 251-1K Professional Services

Provectus is seeking a Data/ML Solution Architect to design and deliver cloud-based and on-premise data and machine learning solutions for customer engagements across big data, real-time analytics, and AI initiatives.

Agile AWS AWS CDK Azure Docker GCP Generative AI Java Kubernetes Machine Learning Microservices MLflow MLOps Neo4j Python PyTorch Terraform TypeScript
54 minutes ago

Senior Data Engineer, AI and Systems Engineering

Dropbox 1K-5K Internet Software & Services

Dropbox is hiring a Senior Data Engineer to build the CMDB and Asset Intelligence data platform that unifies enterprise systems into trusted data for asset visibility, cost optimization, and security insights.

Apache Spark Databricks Oracle Python SQL
1 hour, 5 minutes ago

Data Engineer, Azure - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight is hiring a remote Data Engineer, Azure to build and optimize data pipelines and warehousing solutions for client projects across Latin America.

Agile Apache Spark Azure Git Machine Learning Power BI Python REST API SQL Tableau
1 hour, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers