Sr. Staff AI Research TLM - AI Systems

2 weeks, 1 day ago
Full-time
Lead
DevOps and Infrastructure
Databricks

Databricks

Databricks is the pioneering data intelligence platform, empowering organizations worldwide to solve complex data challenges with AI-driven analytics solutions.

IT Services
1K-5K
Founded 2013
$4450M raised

Description

  • Lead and grow a multidisciplinary research team focused on LLM scaling, efficiency, and systems performance.
  • Define the scaling research roadmap aligned with Databricks’ strategic objectives.
  • Drive algorithmic innovations for large-scale training and inference, including optimizers, low-precision techniques, and model adaptation methods.
  • Design and run large-scale experiments and benchmark new methods against state-of-the-art approaches.
  • Optimize distributed training, parallelism, memory efficiency, and compute efficiency in collaboration with systems and infrastructure teams.
  • Work hands-on in Python and PyTorch to prototype research ideas and integrate them into production systems.
  • Establish metrics, evaluation protocols, and best practices for scaling-focused research across Databricks AI.
  • Partner with product and engineering leaders to translate research breakthroughs into customer-facing platform capabilities.
  • Champion responsible deployment, ensuring reliability, safety, and model behavior remain first-class considerations.
  • Mentor and develop researchers and engineers through technical guidance and career support.

Requirements

  • Proven ability to lead a research team developing novel techniques for foundation model efficiency with strong industry impact.
  • Deep expertise in at least one of: generative AI, LLMs, distributed ML systems, model optimization, or responsible AI.
  • Strong programming skills with demonstrated ability to write high-quality, efficient code in Python and PyTorch.
  • Experience translating research innovations into scalable product capabilities in partnership with product and engineering teams.
  • Excellent communication, leadership, and stakeholder management skills.
  • Prior work at the intersection of systems and ML, such as distributed training frameworks, compiler and kernel optimization, or memory-/compute-efficient model design (preferred).
  • Strong industry and academic network in large-scale ML, with collaborations or service at top conferences such as PC or area chair roles (preferred).
  • A strong record of research impact, such as first-author publications at ICLR, ICML, NeurIPS, or MLSys, influential open-source contributions, or widely used deployed systems (preferred).

Benefits

  • Annual performance bonus eligibility.
  • Equity eligibility.
  • Competitive local pay range of $270,000 to $340,000 USD.
  • Comprehensive benefits and perks package.
  • Region-specific benefits details provided by Databricks.
  • Commitment to a diverse and inclusive workplace.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Artificial Intelligence Research Engineer (SJ2026KP)

Archer 251-1K Construction & Engineering

Archer is hiring a research-focused machine learning engineer to develop and transition AI systems for autonomy, trajectory prediction, and perception for its all-electric aircraft program.

Deep Learning Generative AI Machine Learning Reinforcement Learning
11 hours, 8 minutes ago

Open-Source Machine Learning Engineer - US Remote

Hugging Face 51-250 IT Services

Hugging Face is hiring an Open-Source Machine Learning Engineer to improve widely used ML libraries and support a global community of builders, researchers, and contributors.

Deep Learning GitHub Machine Learning Python PyTorch TensorFlow Transformers
13 hours, 46 minutes ago

AI Safety Argumentation Platform Research Engineer

Bluesky Internet Software & Services

CARMA is hiring a remote AI Safety Argumentation Platform Research Engineer to build the evidentiary and argumentation infrastructure used to structure, verify, and communicate AI risk arguments for policymakers, researchers, journalists, and the public.

1 day, 2 hours ago

Research Scientist

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Research Scientist to develop and deploy mathematically grounded algorithms and high-performance software for mission-critical defense systems.

Agile Computer Vision Machine Learning Statistics
2 days, 5 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers