Staff Machine Learning Infrastructure Engineer, Simulation

4 weeks, 1 day ago
Full-time
Senior
DevOps and Infrastructure

Waymo

Waymo is an autonomous driving technology company building the Waymo Driver and operating Waymo One, its fully autonomous ride-hailing service.

Autonomous vehicles, robotics, AI, ride-hailing / mobility tech
Founded 2009
$21600M raised

Description

  • Advance ultra-realistic multi-agent simulations using foundation models as part of a high-performing research engineering team.
  • Collaborate with Google DeepMind, Waymo Realism Modeling in London, and Waymo Oxford to improve simulation realism.
  • Provide technical leadership on large-scale ML model architectures for autonomous vehicle models.
  • Work across data engineering, model development, and deployment while guiding architectural decisions and technical direction.
  • Own large, complex systems and drive architectures that meet technical and business objectives.
  • Design and scale distributed systems across the ML lifecycle for planet-scale dataset generation and model training.
  • Partner cross-functionally to define performance and system-level requirements for large ML systems.
  • Translate product and business goals into measurable technical deliverables.
  • Mentor junior engineers and help foster a collaborative engineering culture.

Requirements

  • BS in Computer Science, Robotics, a similar technical field, or equivalent practical experience.
  • 5+ years of professional software engineering experience.
  • At least 3 years of experience in machine learning infrastructure, including developing, scaling, training, deploying, and optimizing large-scale ML systems from data to model.
  • MS in Computer Science, Robotics, a similar technical field, or equivalent practical experience (preferred).
  • 10+ years of professional software engineering experience (preferred).
  • At least 5 years of experience in machine learning infrastructure, including developing, designing, scaling, training, deploying, and optimizing large-scale ML systems from data to model (preferred).
  • Experience with ML infrastructure tools such as DeepSpeed, PyTorch, TensorFlow, or similar frameworks.
  • Strong expertise in distributed training techniques, including gradient sharding and optimization strategies for scaling large models.
  • Experience using ML accelerator profiling tools to uncover performance bottlenecks.
  • Deep understanding of modern ML models such as auto-regressive transformers and familiarity with custom kernels for hardware efficiency.
  • Practical familiarity with autonomous driving, simulations, and ML accelerators is a plus.

Benefits

  • Base salary range of £155,000 to £163,000 GBP.
  • Eligibility for Waymo’s discretionary annual bonus program.
  • Eligibility for Waymo’s equity incentive plan.
  • Access to Waymo’s generous company benefits program, subject to eligibility requirements.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer - Machine Learning (Behaviors)

Motional 1K-5K Automotive

Motional’s Behaviors team is hiring an engineer to develop machine learning models that help autonomous vehicles understand and predict traffic behavior in complex real-world driving scenarios.

C++ Computer Vision Deep Learning Machine Learning Neural Networks Python PyTorch
1 day, 13 hours ago

PyTorch & MLOps AI Specialist

Weekday 11-50 Construction & Engineering

A leading AI lab’s Generative AI team is hiring an MLOps and ML Systems Engineer to support the development and evaluation of next-generation large language models and the training data that powers them.

Generative AI LLM MLOps PyTorch
1 day, 14 hours ago

Junior Python Developer - AI & Innovation Team

Adzuna 51-250 Internet Software & Services

Adzuna is hiring a Junior Python Developer to help build and maintain AI-powered jobseeker products and the production systems behind them for a remote team working in London hours.

Apache Spark AWS CSS EC2 Git GitHub HTML LLM Machine Learning MySQL Playwright PostgreSQL Python React Selenium Solr SQL Tailwind CSS
1 day, 14 hours ago

Staff AI/ML Engineer

Burq 11-50 Air Freight & Logistics

Burq is hiring a Staff AI/ML Engineer to build the core AI systems that automate logistics operations and improve real-time decision-making for the company’s delivery platform.

Computer Vision FastAPI MLOps Python Reinforcement Learning
1 day, 14 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers