JetBrains

JetBrains

JetBrains provides cutting-edge development tools like IntelliJ IDEA and Kotlin, automating tasks to boost productivity and foster innovation.

Internet Software & Services
1K-5K
Founded 2000

Description

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt large language models for planning, tool use, and multi-step interactions in JetBrains IDEs.
  • Build simulation and evaluation environments where coding agents can perform and be measured on realistic developer tasks.
  • Design evaluation frameworks and metrics for agent behavior, and use traces and logs to improve training, data, and reward design.
  • Analyze training and evaluation results to improve model architectures, training recipes, and datasets.
  • Work with distributed GPU clusters and MapReduce-style infrastructure for training and data processing.
  • Collaborate with research, product, and infrastructure teams to turn product goals into models, experiments, and shipped features.

Requirements

  • Extensive hands-on experience training LLMs in pre-training, fine-tuning, or post-training settings.
  • Deep expertise in PyTorch and specialized LLM training stacks such as Megatron, NeMo, or verl.
  • Strong understanding of LLM fundamentals, including architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.
  • Ability to own projects end to end from problem definition through design, experimentation, implementation, and iteration.
  • Product-aware mindset with the ability to translate developer needs and failure modes into modeling and evaluation work.
  • At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.
  • Experience with ML orchestrators or workflow tools such as Kubeflow, Dagster, Airflow, or ZenML, or schedulers like Kubernetes or SLURM (preferred).
  • Experience with large-scale data and training pipelines, including MapReduce-style clusters, multi-node GPU training, or workloads around 1M+ CPU/GPU hours (preferred).
  • Experience designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks (preferred).
  • Experience with AI agent development, including tool-using agents, planners, or multi-step coding workflows (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Biology & Biophysics Researchers (India, Part-time)

Weekday 11-50 Construction & Engineering

An AI lab client is hiring part-time life science researchers to help train and evaluate frontier AI systems on advanced biological and biophysical reasoning.

Machine Learning
15 hours, 23 minutes ago

Senior Research Engineer, Threat Intelligence

SecurityScorecard 251-1K IT Services

SecurityScorecard is hiring an engineering-focused Threat Intelligence team member to turn research findings into production-ready detections, feeds, and platform capabilities for STRIKE.

AWS CI/CD Cybersecurity Go Node.js Python Splunk SQL TypeScript
16 hours, 8 minutes ago

Principal ML Scientist, Multimodal Biological Reasoning

Flagship Pioneering 251-1K Biotechnology

Flagship Pioneering’s Pioneering Intelligence is seeking a Lead to shape and advance AI-driven biological reasoning systems that support scientific discovery across the company and its portfolio.

LLM Machine Learning
1 day, 15 hours ago

Senior Modeling and Simulation Engineer, Space

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Senior Modeling and Simulation Engineer to support its Space team in developing analysis, models, and simulations that inform U.S. Department of Defense space mission decisions.

GitHub GitLab Machine Learning MATLAB Python Reinforcement Learning SAP
2 days, 15 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers