Senior/Staff Deep Reinforcement Learning Engineer

22 hours, 35 minutes ago
Full-time
Senior
Software Development
DoorDash

DoorDash

DoorDash empowers small business owners by providing an affordable and convenient platform for local delivery services, primarily focusing on restaurant food delivery.

Air Freight & Logistics
10K-50K
Founded 2012

Description

  • Formulate complex driving tasks as reinforcement learning problems with well-shaped rewards and expressive state/action representations.
  • Design and train model-based deep reinforcement learning agents using GPU-accelerated simulation at massive scale.
  • Improve and maintain the simulator used for reinforcement learning training.
  • Build and maintain distributed training infrastructure in JAX across large compute clusters.
  • Develop agentic optimization systems that automate code improvement, experimentation, metric analysis, and policy iteration.
  • Own the full lifecycle of learned policies from problem definition and reward design to large-scale distributed training and deployment on the vehicle.
  • Help define how learned components integrate with the broader autonomy stack to produce robust, shippable behavior.

Requirements

  • BS, MS, or PhD in CS, EE, Robotics, or a related field.
  • Strong foundation in reinforcement learning and deep learning.
  • Hands-on experience training RL agents at scale, ideally in robotics, autonomous driving, or other real-time decision-making domains.
  • Proficiency in JAX or a similar functional ML framework.
  • Comfort with JIT compilation, vectorized environments, and GPU-accelerated simulation.
  • Deep understanding of core RL concepts, including policy gradients, value functions, exploration-exploitation, model-based RL, reward shaping, and sim-to-real transfer.
  • Data-driven mindset with experience building experiment pipelines and analyzing training runs.
  • Preferred: publications at top venues such as NeurIPS, ICML, ICLR, CoRL, RSS, or ICRA on RL or learned planning.
  • Preferred: experience building or working with GPU-accelerated simulators for RL training.
  • Preferred: track record of shipping a learned component in a production robotics or autonomous vehicle stack.

Benefits

  • Base salary range of $168,000 to $247,000 USD.
  • Opportunities for equity grants.
  • 401(k) plan with employer matching.
  • 16 weeks of paid parental leave.
  • Medical, dental, and vision benefits.
  • 11 paid holidays.
  • Disability and basic life insurance.
  • Flexible paid time off/vacation for salaried roles, plus 80 hours of paid sick time per year.
  • Wellness benefits, commuter benefits match, and a mental health program.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Scientist - Optical Signal Processing

STR 251-1K Aerospace & Defense

STR’s Maritime Domain Awareness Group is hiring a Senior Scientist to develop airborne passive optical signal processing algorithms for detecting and classifying low-SNR targets in cluttered national security environments.

Computer Vision MATLAB Python
2 hours, 11 minutes ago

Full-Stack AI Engineer

Pavago IT Services

Our client is hiring a Full-Stack AI Engineer to build and deploy production-ready AI applications that combine software engineering, machine learning, and cloud infrastructure across the full product lifecycle.

AWS Azure CI/CD Dagster Docker Elasticsearch FastAPI Flask GCP Git GitHub HIPAA Hugging Face Kubeflow Kubernetes LLM Machine Learning Microservices MLflow MLOps Next.js Node.js Prefect Prometheus Python PyTorch React REST API SageMaker Serverless Snowflake SQL TensorFlow TypeScript Vue.js
2 hours, 30 minutes ago

Data/ ML Solution Architect (GenAI, AWS)

Provectus 251-1K Professional Services

Provectus is hiring a Data/ML Solution Architect in a remote role to design and lead cloud and on-premise data and AI/ML solutions that support customer transformations and business outcomes.

Agile AWS AWS CDK Azure Docker GCP Generative AI Java Kubernetes Machine Learning Microservices MLflow MLOps Neo4j Python PyTorch Terraform TypeScript
3 hours, 48 minutes ago

Robotics & Simulation Engineer, Discovery

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Robotics & Simulation Engineer to build and own the simulation, training, deployment, and safety infrastructure that supports autonomous robotic systems for defense applications.

C++ Python
6 hours, 40 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers