Research Scientist, RL for Autonomous Planning & World Modeling

3 hours, 13 minutes ago
Full-time
Senior
Software Development

Waymo

Waymo is an autonomous driving technology company building the Waymo Driver and operating Waymo One, its fully autonomous ride-hailing service.

Autonomous vehicles, robotics, AI, ride-hailing / mobility tech
Founded 2009
$21600M raised

Description

  • Participate in Waymo’s Foundation World Model post-training and evaluation.
  • Research and develop reinforcement learning and distillation techniques for autonomous vehicle trajectory planning.
  • Integrate emerging research from the broader AI community into Waymo’s internal reinforcement learning infrastructure.
  • Run rigorous ablation studies to identify and scale the most promising methods.
  • Partner with engineering and research teams across Waymo to share recipes, techniques, and post-training best practices.

Requirements

  • PhD or Master’s in Computer Science, Machine Learning, Robotics, or a similar technical field.
  • 3+ years of industry or post-doc research experience in Reinforcement Learning or Foundation Models.
  • Demonstrated original contributions through high-impact publications, technical blog posts, or significant open-source contributions.
  • Proficiency implementing model training flows in a scalable, distributed, and performant manner, including data parallel, FSDP, and other sharding approaches.
  • Willingness to work with the complexity of globally distributed inference infrastructure.
  • Preferred: PhD in Computer Science, Machine Learning, or Robotics with a research focus on Reinforcement Learning, Foundation Models, or Multi-Modal learning.
  • Preferred: Extensive experience designing and deploying reinforcement learning infrastructure, especially for on-policy learning or alignment with human preferences.
  • Preferred: First-author publications at top-tier venues such as NeurIPS, ICLR, or ICRA, or significant open-source ML projects.
  • Preferred: Experience with large-scale many-machine training infrastructure and inference techniques for large models such as model sharding or tensor-parallel.
  • Remote work is possible, but Waymo may not be able to employ remotely in all locations.

Benefits

  • Base salary range of $204,000 to $259,000 USD.
  • Eligibility for Waymo’s discretionary annual bonus program.
  • Eligibility for Waymo’s equity incentive plan.
  • Access to Waymo’s generous company benefits program, subject to eligibility requirements.
  • Remote work may be available depending on location.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Quantitative Researcher

Binance 5K-10K Capital Markets

Binance is hiring a Quantitative Researcher in its Quantitative Strategy team to develop and implement pricing, risk management, and agency trading strategies for its global trading platform.

Agile Blockchain Machine Learning Python Statistics
1 hour, 29 minutes ago

Senior Research Scientist - Personalization

Spotify Media

Spotify is hiring a Senior Research Scientist on its Personalization team to advance machine learning and AI research that shapes long-term recommendation and discovery experiences for millions of listeners.

Generative AI Machine Learning
1 hour, 43 minutes ago

Staff AI Forward Deployed Engineer

Databricks 1K-5K IT Services

Databricks is seeking a Staff AI Engineer / Staff Forward Deployed Engineer to work directly with customers on designing and productionizing advanced GenAI applications while shaping product direction and internal expertise.

Apache Spark AWS Azure GCP Generative AI Hugging Face Machine Learning MLflow Pandas PyTorch Scikit-learn
2 hours, 29 minutes ago

Research Software Engineer- ML for Manufacturing (Computational Design & Geometry Processing)

Foundation EGI 11-50 Technology, Information and Internet

Research Software Engineer at an MIT-born, venture-backed startup building an AI copilot for design and manufacturing, focused on applying machine learning to real engineering problems involving CAD, CAE, CAM, geometry, and simulation data.

LLM Machine Learning Python Reinforcement Learning
3 hours, 13 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers