Machine Learning Systems Engineer

22 minutes ago
Full-time
Mid Level
DevOps and Infrastructure
Motional

Motional

Motional is a leading company in driverless technology and autonomous vehicles, leveraging decades of industry expertise to develop and deploy safe and reliable autonomous vehicles. With a powerful DNA combining Aptiv's automotive technology and Hyunda...

Automotive
1K-5K
Founded 2020
$20M raised

Description

  • Profile and optimize training performance by identifying bottlenecks in data loading, gradient computation, and communication.
  • Implement system-level optimizations such as kernel fusion, sharding, and tiling to improve step time.
  • Optimize distributed training pipelines using frameworks such as PyTorch Distributed.
  • Design and maintain high-performance GPU kernels in Triton or CUDA for machine learning workloads.
  • Engineer robust data loading pipelines that maximize training throughput.
  • Work at the intersection of machine learning research and high-performance systems engineering to support large-scale distributed training.
  • Improve core infrastructure that enables researchers to train frontier models at scale.

Requirements

  • Bachelor’s, Master’s degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline.
  • Strong proficiency in Python.
  • Extensive hands-on experience with PyTorch.
  • Experience optimizing machine learning model execution during training and inference.
  • Strong understanding of fundamental machine learning concepts, architectures, and processes.
  • Exceptional analytical and problem-solving skills with a bias for action and a data-driven approach.
  • Experience with profiling tools such as Nsight or PyTorch Profiler (preferred).
  • Experience with Triton or CUDA for GPU kernel development (preferred).
  • Experience with distributed training frameworks such as PyTorch Distributed (preferred).

Benefits

  • Hybrid schedule with in-office time in Boston, Pittsburgh, or Las Vegas, or the option to work fully remote.
  • Base salary range of $144,000 to $192,000 USD.
  • Additional compensation may include a bonus or company equity.
  • Medical, dental, and vision insurance.
  • 401(k) with company match.
  • Health savings accounts.
  • Life insurance and pet insurance.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Associate AI/ML Engineer

66degrees 251-1K IT Services

66degrees is hiring an Associate in AI/ML to build and support Google Cloud–based Generative AI and LLM systems that turn business requirements into scalable, production-ready solutions.

GCP Generative AI LLM MLOps Python SQL Vertex AI
8 minutes ago

Machine Learning Engineer I, Personalization , Minesweeper

Spotify Media

Spotify is hiring a Machine Learning Engineer I for its Personalization team to build and improve content-enrichment systems that understand music, podcasts, and audiobooks for recommendations and listening experiences.

Agile Apache Spark AWS GCP Java LLM Machine Learning Python PyTorch Scala SQL TensorFlow
8 minutes ago

Machine Learning Engineer

CloudWalk 51-250 Diversified Financial Services

CloudWalk is hiring a Machine Learning Engineer to develop real-time, edge-deployed security models that detect and block attacks across high-volume network traffic for its cybersecurity and risk & compliance environment.

AWS CDN Cloudflare GCP HTTP LLM Machine Learning Python PyTorch Rust Scikit-learn SQL TensorFlow TypeScript WAF
8 minutes ago

Staff Machine Learning Engineer, Credit Products (Square Financial Services)

Block 10K-50K Capital Markets

Block is hiring a Machine Learning Engineer on its Credit and Lending team to own and evolve the credit decisioning systems that support regulated banking products and expand access to credit for underserved customers.

Machine Learning Neural Networks
53 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers