Senior Machine Learning Infrastructure Engineer, Simulation

1 month ago
Full-time
Senior
Software Development

Waymo

Waymo is an autonomous driving technology company building the Waymo Driver and operating Waymo One, its fully autonomous ride-hailing service.

Autonomous vehicles, robotics, AI, ride-hailing / mobility tech
Founded 2009
$21600M raised

Description

  • Provide technical leadership on large-scale machine learning model architectures for autonomous vehicle systems.
  • Design and scale distributed infrastructure across the ML lifecycle, including planet-scale dataset generation and model training.
  • Collaborate with Google DeepMind, Waymo Realism Modeling, and Waymo Oxford teams to improve simulation realism.
  • Work at the intersection of data engineering, model development, and deployment to guide architectural decisions and technical direction.
  • Translate product and business goals into measurable technical requirements and system-level deliverables.
  • Own large, complex systems and drive architectures that meet technical and business objectives.
  • Mentor junior engineers and help foster a collaborative engineering culture.

Requirements

  • BS in Computer Science, Robotics, a similar technical field, or equivalent practical experience.
  • 5+ years of professional software engineering experience.
  • At least 3 years of experience in machine learning infrastructure, including developing, scaling, training, deploying, and optimizing large-scale machine learning systems from data to model.
  • Preferred: 10+ years of professional software engineering experience.
  • Preferred: At least 5 years of experience in machine learning infrastructure.
  • Preferred: Experience with DeepSpeed, PyTorch, TensorFlow, or similar machine learning frameworks.
  • Preferred: Strong expertise in distributed training techniques, including gradient sharding and optimization strategies.
  • Preferred: Experience using ML accelerator profiling tools to identify performance bottlenecks.
  • Preferred: Deep understanding of state-of-the-art models such as auto-regressive transformers and familiarity with custom kernels for hardware efficiency.
  • Preferred: Excellent verbal and written communication skills.
  • Preferred: Practical familiarity with autonomous driving, simulations, and ML accelerators.

Benefits

  • Base salary range of $213,000 to $263,000 USD.
  • Eligibility for Waymo’s discretionary annual bonus program.
  • Eligibility for Waymo’s equity incentive plan.
  • Access to Waymo’s generous company benefits program, subject to eligibility requirements.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Software Engineer - Machine Learning (Behaviors)

Motional 1K-5K Automotive

Motional’s Behaviors team is hiring an engineer to develop machine learning models that help autonomous vehicles understand and predict traffic behavior in complex real-world driving scenarios.

C++ Computer Vision Deep Learning Machine Learning Neural Networks Python PyTorch
1 day, 13 hours ago

PyTorch & MLOps AI Specialist

Weekday 11-50 Construction & Engineering

A leading AI lab’s Generative AI team is hiring an MLOps and ML Systems Engineer to support the development and evaluation of next-generation large language models and the training data that powers them.

Generative AI LLM MLOps PyTorch
1 day, 14 hours ago

Junior Python Developer - AI & Innovation Team

Adzuna 51-250 Internet Software & Services

Adzuna is hiring a Junior Python Developer to help build and maintain AI-powered jobseeker products and the production systems behind them for a remote team working in London hours.

Apache Spark AWS CSS EC2 Git GitHub HTML LLM Machine Learning MySQL Playwright PostgreSQL Python React Selenium Solr SQL Tailwind CSS
1 day, 14 hours ago

Staff AI/ML Engineer

Burq 11-50 Air Freight & Logistics

Burq is hiring a Staff AI/ML Engineer to build the core AI systems that automate logistics operations and improve real-time decision-making for the company’s delivery platform.

Computer Vision FastAPI MLOps Python Reinforcement Learning
1 day, 14 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers