Data Engineer

1 hour, 19 minutes ago
Full-time
Lead
Software Development
Robots & Pencils

Robots & Pencils

Robots & Pencils is a digital innovation firm that assists organizations in leveraging mobile, web, and AI technologies to modernize their operations and create efficient, user-centered solutions that enhance productivity and decision-making.

IT Services
51-250
Founded 2009

Description

  • Define data architecture and platform strategy across pipelines, warehouses, and data lakes.
  • Build and optimize scalable batch and real-time data pipelines.
  • Define and enforce data governance, quality standards, and compliance frameworks.
  • Build monitoring, logging, alerting, CI/CD workflows, and automation for data platforms.
  • Drive data platform modernization with a focus on performance, cost, and scalability.
  • Design and implement data contracts and event flows with backend, platform, and engineering teams.
  • Lead the design and implementation of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG preparation, feature stores, and training/inference flows.
  • Integrate data services with APIs, middleware, and third-party systems.
  • Partner with leadership on data strategy and translate technical decisions into actionable direction.
  • Mentor junior and mid-level engineers and establish engineering standards across the team.

Requirements

  • 7+ years of professional data engineering experience, including leading complex data platform initiatives.
  • Strong system architecture background with expertise in distributed data systems.
  • Expert proficiency in Python, Scala, and SQL.
  • Deep expertise with cloud-native data platforms and enterprise data warehousing.
  • Strong expertise in data pipeline orchestration and processing.
  • Strong experience with streaming platforms and real-time data processing such as Kafka, Kinesis, or Pub/Sub.
  • Strong data modeling expertise and experience with data transformation.
  • Strong experience with data quality, governance, and compliance frameworks.
  • Strong experience with container orchestration and CI/CD for data systems.
  • Strong experience building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.
  • Demonstrated leadership and technical mentoring experience across a team or organization.
  • Strong stakeholder communication skills with the ability to translate technical depth across audiences.
  • Demonstrable day-to-day usage and expert knowledge of AI-forward coding tools such as Claude and Cursor.
  • Excellent problem-solving skills and the ability to navigate ambiguous technical and business challenges with sound judgment.
  • Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus.
  • Experience with handling and modeling data in the healthcare industry is a plus.
  • AWS certifications, such as Certified Data Engineer – Associate, are strongly preferred.

Benefits

  • The role is based at Robots & Pencils, an applied AI engineering firm working across enterprise clients and industries.
  • The company is all in on AWS and focused on getting AI into production quickly.
  • Teams average 15+ years of experience, offering a highly senior working environment.
  • Delivery centers span Canada, the United States, Eastern Europe, and Latin America.
  • The role supports meaningful, real-world enterprise work with measurable client impact.
  • Employment may be contingent on successful completion of a background check, in accordance with local legislation.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight Consulting is hiring a remote Data Engineer in Latin America to build and support data integration and reporting systems that improve data accessibility, quality, and business decision-making.

Agile AWS EC2 Git Linux MySQL Oracle PostgreSQL Power BI REST API Salesforce SOAP SQL SQL Server Tableau XML
1 hour, 7 minutes ago

Revenue Intelligence Engineer

Greenhouse Software 251-1K Professional Services

Greenhouse is hiring a Revenue Intelligence Engineer to join its Revenue Operations team and build internal web applications and automation that improve productivity across the Customer Growth & Success lifecycle.

Firebase LLM Microservices Next.js Node.js Python Railway React Render Salesforce Supabase TypeScript Vercel
1 hour, 34 minutes ago

Software Engineer - Backend & Data (Eastern Europe)

SPATE 1-10 Diversified Consumer Services

Spate is hiring a remote backend engineer in Eastern Europe to help design and implement the next iteration of the data pipeline, storage, and API for its SaaS platform serving major beauty brands.

AWS Azure GCP Power BI Python SQL Tableau
1 hour, 34 minutes ago

Data Engineer - Remote, Latin America

Bluelight Consulting 11-50 Internet Software & Services

Bluelight Consulting is hiring a remote Data Engineer in Latin America to build and support data integration, pipeline, and reporting solutions for client projects in a fast-growing software consultancy.

Agile AWS EC2 Git Linux MySQL Oracle PostgreSQL Power BI REST API Salesforce SOAP SQL SQL Server Tableau XML
1 hour, 43 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers