Sr. Staff AI Research TLM - AI Systems

2 hours, 41 minutes ago
Full-time
Lead
DevOps and Infrastructure
Databricks

Databricks

Databricks is the pioneering data intelligence platform, empowering organizations worldwide to solve complex data challenges with AI-driven analytics solutions.

IT Services
1K-5K
Founded 2013
$4450M raised

Description

  • Lead and grow a multidisciplinary research team focused on LLM scaling, efficiency, and systems performance.
  • Define the scaling research roadmap aligned with Databricks’ strategic objectives.
  • Drive algorithmic innovations for large-scale training and inference, including optimizers, low-precision techniques, and model adaptation methods.
  • Design and run large-scale experiments and benchmark new methods against state-of-the-art approaches.
  • Optimize distributed training, parallelism, memory efficiency, and compute efficiency in collaboration with systems and infrastructure teams.
  • Work hands-on in Python and PyTorch to prototype research ideas and integrate them into production systems.
  • Establish metrics, evaluation protocols, and best practices for scaling-focused research across Databricks AI.
  • Partner with product and engineering leaders to translate research breakthroughs into customer-facing platform capabilities.
  • Champion responsible deployment, ensuring reliability, safety, and model behavior remain first-class considerations.
  • Mentor and develop researchers and engineers through technical guidance and career support.

Requirements

  • Proven ability to lead a research team developing novel techniques for foundation model efficiency with strong industry impact.
  • Deep expertise in at least one of: generative AI, LLMs, distributed ML systems, model optimization, or responsible AI.
  • Strong programming skills with demonstrated ability to write high-quality, efficient code in Python and PyTorch.
  • Experience translating research innovations into scalable product capabilities in partnership with product and engineering teams.
  • Excellent communication, leadership, and stakeholder management skills.
  • Prior work at the intersection of systems and ML, such as distributed training frameworks, compiler and kernel optimization, or memory-/compute-efficient model design (preferred).
  • Strong industry and academic network in large-scale ML, with collaborations or service at top conferences such as PC or area chair roles (preferred).
  • A strong record of research impact, such as first-author publications at ICLR, ICML, NeurIPS, or MLSys, influential open-source contributions, or widely used deployed systems (preferred).

Benefits

  • Annual performance bonus eligibility.
  • Equity eligibility.
  • Competitive local pay range of $270,000 to $340,000 USD.
  • Comprehensive benefits and perks package.
  • Region-specific benefits details provided by Databricks.
  • Commitment to a diverse and inclusive workplace.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Associate Research Scientist, Evidence Generation

Precision AQ 1001-5000 Business Consulting and Services

Precision Medicine Group is hiring an Associate Research Scientist to support primary data collection studies in its Evidence Generation team for pharmaceutical and biotech clients.

R
2 hours, 13 minutes ago

Senior/Staff Deep Reinforcement Learning Engineer

DoorDash 10K-50K Air Freight & Logistics

DoorDash is hiring a Senior/Staff Deep RL Engineer to develop and deploy real-time autonomous driving policies for its DD Labs team, from problem formulation and training through on-vehicle inference.

Deep Learning Reinforcement Learning
3 hours, 24 minutes ago

AI Research Engineer - Foundation Models

Helsing 51-250 Aerospace & Defense

Helsing is hiring an AI research and machine learning engineer to develop and train foundational vision-language models for defense-focused autonomous systems built on multimodal sensor data.

Deep Learning Generative AI LLM Machine Learning Python PyTorch Reinforcement Learning Transformers
4 hours, 35 minutes ago

AI Security Researcher

Wiz 251-1K IT Services

Wiz is hiring an AI Security Researcher to investigate high-impact risks in cloud- and AI-native environments and turn those findings into product capabilities.

AWS Azure GCP Go Kubernetes Python SQL
4 hours, 58 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers