ML Systems Engineer, ML Acceleration

1 month ago
Full-time
Mid Level
Software Development
Motional

Motional

Motional is a leading company in driverless technology and autonomous vehicles, leveraging decades of industry expertise to develop and deploy safe and reliable autonomous vehicles. With a powerful DNA combining Aptiv's automotive technology and Hyunda...

Automotive
1K-5K
Founded 2020
$20M raised

Description

  • Profile and optimize training bottlenecks in data loading, gradient computation, and communication using tools such as Nsight and PyTorch Profiler.
  • Implement performance improvements including kernel fusion, sharding, and tiling to reduce step time.
  • Optimize distributed training pipelines using frameworks such as PyTorch Distributed.
  • Design, develop, and maintain high-performance GPU kernels in Triton or CUDA for machine learning workloads.
  • Improve and maintain data loading pipelines to maximize training throughput.
  • Work at the intersection of machine learning research and high-performance systems engineering to support large-scale distributed training.
  • Contribute to reducing time-to-convergence for next-generation model training.
  • Focus on improving speed, cost, reliability, and throughput of core ML systems.

Requirements

  • Bachelor’s, Master’s degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline.
  • Strong proficiency in Python.
  • Extensive hands-on experience with PyTorch.
  • Experience optimizing machine learning model execution during training and inference.
  • Strong understanding of fundamental machine learning concepts, architectures, and processes.
  • Exceptional analytical and problem-solving skills.
  • Bias for action and a data-driven approach to technical challenges.
  • Experience with profiling tools such as Nsight or PyTorch Profiler (preferred).
  • Experience with Triton, CUDA, or distributed training frameworks such as PyTorch Distributed (preferred).

Benefits

  • This posting does not list any specific benefits or perks.
  • Motional emphasizes a progressive, global, diverse, and inclusive workplace.
  • Headquartered in Boston with operations in the U.S. and Asia.
  • Participation in E-Verify for newly hired employees.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff MLOps Engineer

Apptronik 51-250 Aerospace & Defense

Apptronik is hiring a Staff MLOps Engineer to define and own the platform that moves data, experiments, and trained models from teleoperation through deployment on its Apollo humanoid robots.

AWS Azure C++ CI/CD Docker Embedded Systems GCP Git Go Kubernetes MLflow MLOps Python Reinforcement Learning Rust
30 minutes ago

Senior Software Engineer (Backend) - AI/ML

ClickHouse 51-250 IT Services

ClickHouse is hiring a software engineer for its AI/ML Engineering team to design, build, and operate AI products and integrations that power the company’s cloud analytics platform.

AWS Azure ClickHouse GCP Go Python TypeScript
2 hours, 15 minutes ago

Senior Software Engineer - Platform & MLOps

Serko 251-1K Consumer Services

Serko is hiring 2 Senior Full Stack Engineers to build the internal platform and tooling that support its AI engineering teams in creating and operating next-generation travel technology products.

AWS Azure Datadog Docker GCP Grafana Kubernetes Machine Learning Next.js Prometheus Python React REST API TypeScript
2 hours, 33 minutes ago

Senior AI Platform Engineer

Wellhub 1-10 Gas Utilities

Wellhub is hiring a Senior AI Platform Engineer in Brazil to help build and evolve the cloud-native ML development platform that enables engineers and data scientists to develop and deploy AI at scale.

Apache Spark AWS CI/CD Kubeflow Kubernetes MLOps Python Terraform
3 hours, 38 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers