Staff Data Engineer

1 hour, 19 minutes ago
Full-time
Lead
Software Development
Robots & Pencils

Robots & Pencils

Robots & Pencils is a digital innovation firm that assists organizations in leveraging mobile, web, and AI technologies to modernize their operations and create efficient, user-centered solutions that enhance productivity and decision-making.

IT Services
51-250
Founded 2009

Description

  • Define data architecture and platform strategy across pipelines, warehouses, and data lakes.
  • Build and optimize scalable data pipelines for batch and real-time processing.
  • Define and enforce data governance, quality standards, and compliance frameworks.
  • Build monitoring, logging, and alerting for data pipelines and services, and contribute to CI/CD workflows.
  • Drive data platform modernization with a focus on performance, cost, and scalability.
  • Design and implement data contracts and event flows with backend, platform, and engineering teams.
  • Lead the design and implementation of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG preparation, feature stores, and training/inference data flows.
  • Integrate data services with APIs, middleware, and third-party systems to support end-to-end data consumption.
  • Partner with leadership on data strategy and translate technical decisions into actionable direction.
  • Collaborate with engineering, analytics, AI, and product teams to align data platforms with broader goals.
  • Mentor junior and mid-level engineers and establish standards that improve team-wide quality and consistency.

Requirements

  • 7+ years of professional data engineering experience, including leadership on complex data platform initiatives.
  • Strong system architecture background with expertise in distributed data systems.
  • Expert proficiency in Python, Scala, and SQL.
  • Deep experience with cloud-native data platforms and enterprise data warehousing.
  • Strong expertise in data pipeline orchestration and processing.
  • Strong experience with streaming platforms and real-time data processing, such as Kafka, Kinesis, or Pub/Sub.
  • Strong data modeling expertise and experience with data transformation.
  • Strong experience with data quality, governance, and compliance frameworks.
  • Strong experience with container orchestration and CI/CD for data systems.
  • Strong experience building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.
  • Demonstrated leadership and technical mentoring experience across a team or organization.
  • Strong stakeholder communication skills and the ability to translate technical depth across audiences.
  • Demonstrable day-to-day usage and expert knowledge of AI-forward coding tools such as Claude and Cursor.
  • Excellent problem-solving skills and the ability to navigate highly ambiguous technical and business challenges.
  • Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight Consulting is hiring a remote Data Engineer in Latin America to build and support data integration and reporting systems that improve data accessibility, quality, and business decision-making.

Agile AWS EC2 Git Linux MySQL Oracle PostgreSQL Power BI REST API Salesforce SOAP SQL SQL Server Tableau XML
1 hour, 7 minutes ago

Revenue Intelligence Engineer

Greenhouse Software 251-1K Professional Services

Greenhouse is hiring a Revenue Intelligence Engineer to join its Revenue Operations team and build internal web applications and automation that improve productivity across the Customer Growth & Success lifecycle.

Firebase LLM Microservices Next.js Node.js Python Railway React Render Salesforce Supabase TypeScript Vercel
1 hour, 34 minutes ago

Software Engineer - Backend & Data (Eastern Europe)

SPATE 1-10 Diversified Consumer Services

Spate is hiring a remote backend engineer in Eastern Europe to help design and implement the next iteration of the data pipeline, storage, and API for its SaaS platform serving major beauty brands.

AWS Azure GCP Power BI Python SQL Tableau
1 hour, 34 minutes ago

Data Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight Consulting is hiring a remote Data Engineer in Latin America to build and support data integration, pipeline, and reporting solutions for client projects in a fast-growing software consultancy.

Agile AWS EC2 Git Linux MySQL Oracle PostgreSQL Power BI REST API Salesforce SOAP SQL SQL Server Tableau XML
1 hour, 43 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers