JetBrains

JetBrains

JetBrains provides cutting-edge development tools like IntelliJ IDEA and Kotlin, automating tasks to boost productivity and foster innovation.

Internet Software & Services
1K-5K
Founded 2000

Description

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt large language models for planning, tool use, and multi-step interactions in JetBrains IDEs.
  • Build simulation and evaluation environments where coding agents can perform and be measured on realistic developer tasks.
  • Design evaluation frameworks and metrics for agent behavior, and use traces and logs to improve training, data, and reward design.
  • Analyze training and evaluation results to improve model architectures, training recipes, and datasets.
  • Work with distributed GPU clusters and MapReduce-style infrastructure for training and data processing.
  • Collaborate with research, product, and infrastructure teams to turn product goals into models, experiments, and shipped features.

Requirements

  • Extensive hands-on experience training LLMs in pre-training, fine-tuning, or post-training settings.
  • Deep expertise in PyTorch and specialized LLM training stacks such as Megatron, NeMo, or verl.
  • Strong understanding of LLM fundamentals, including architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.
  • Ability to own projects end to end from problem definition through design, experimentation, implementation, and iteration.
  • Product-aware mindset with the ability to translate developer needs and failure modes into modeling and evaluation work.
  • At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.
  • Experience with ML orchestrators or workflow tools such as Kubeflow, Dagster, Airflow, or ZenML, or schedulers like Kubernetes or SLURM (preferred).
  • Experience with large-scale data and training pipelines, including MapReduce-style clusters, multi-node GPU training, or workloads around 1M+ CPU/GPU hours (preferred).
  • Experience designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks (preferred).
  • Experience with AI agent development, including tool-using agents, planners, or multi-step coding workflows (preferred).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Research Physicist - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking part-time physics contributors for project-based AI work focused on designing and evaluating original optics and physics problems for leading tech companies.

13 hours, 30 minutes ago

Research Scientist- Geometry & Machine Learning

Foundation EGI 11-50 Technology, Information and Internet

Foundation EGI is hiring a Remote Research Scientist, Geometry & Machine Learning to build AI systems for design and manufacturing by translating engineering geometry, simulation, and workflow data into practical machine-learning applications.

LLM Machine Learning Python Reinforcement Learning
15 hours, 49 minutes ago

Research Physicist - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking part-time physics contributors for project-based AI work focused on creating and evaluating computationally challenging optics and physics problems for leading tech companies.

1 day, 8 hours ago

Research Physicist - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking physics specialists for project-based AI work focused on testing, evaluating, and improving AI systems by creating challenging optics and physics problems.

1 day, 8 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers