Senior Research Engineer (Code World Models)

8 hours, 5 minutes ago
JetBrains

JetBrains

JetBrains provides cutting-edge development tools like IntelliJ IDEA and Kotlin, automating tasks to boost productivity and foster innovation.

Internet Software & Services
1K-5K
Founded 2000

Description

  • Design and run pre-training, continued pre-training, and mid-training experiments for code models.
  • Build and improve large-scale data pipelines for model training, including filtering, deduplication, mixture design, and dataset quality checks.
  • Work with code corpora, repositories, tests, execution traces, and synthetic data.
  • Develop evaluations for complex repository-level code reasoning tasks.
  • Collaborate with researchers and engineers working on machine learning for code and AI developer tools.

Requirements

  • Hands-on experience with model pre-training, continued training, or mid-training.
  • Strong engineering skills in Python and experience with modern machine learning frameworks.
  • Understanding of large-scale ML training workflows, including data processing, distributed training, checkpointing, evaluation, experiment tracking, and debugging.
  • Experience working with large datasets and attention to data quality, contamination, sampling, and reproducibility.
  • Background in NLP, ML for software engineering, or a similar domain.
  • Ability to work on high-uncertainty research problems and turn ideas into working experiments.
  • Experience training or adapting models for code generation, code understanding, software agents, program repair, test generation, or repository-level reasoning is a plus.
  • Experience with execution-based data such as unit tests, traces, logs, compiler feedback, runtime states, or sandboxed code execution is a plus.
  • Experience with large-scale distributed training of models with 70B+ parameters is a plus.
  • Understanding of evaluation challenges for code models, including benchmark contamination, flaky tests, execution-based scoring, and long-horizon task evaluation is a plus.
  • Contributions to ML infrastructure, open-source projects, or research systems is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior/Staff Applied Research Software Engineer

Twilio 5K-10K Diversified Telecommunication Services

Twilio is hiring remote Senior or Staff Applied Software Research Engineers in India to build and prototype new full-stack products and emerging technology solutions for its incubation team.

Angular AWS Azure Java JavaScript LLM Machine Learning Node.js Python React Spring Boot SQL
7 hours, 50 minutes ago

Senior / Staff Applied Research Software Engineer - Emerging Tech

Twilio 5K-10K Diversified Telecommunication Services

Twilio is hiring Senior or Staff Applied Software Research Engineers for its remote U.S.-based Emerging Technologies incubation team to help prototype and build new AI- and communications-driven product ideas.

Angular AWS Azure Java JavaScript LLM Machine Learning Node.js Python React Spring Boot SQL
8 hours, 5 minutes ago

Computational Materials Scientist (Contract)

subsense Biotechnology

Subsense is hiring a remote computational materials scientist to perform first-principles simulations that evaluate candidate nanoparticle core and core/shell designs for its brain-computer interface research and help prioritize the most promising materials for experimental follow-up.

Python
8 hours, 35 minutes ago

Quant Researcher

EXANTE Capital Markets

EXANTE is hiring a Quantitative Researcher to develop and implement automated crypto trading strategies and support the firm’s production trading systems within its global wealth-tech platform.

Blockchain Machine Learning NumPy Pandas Python Statistics
8 hours, 50 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers