Reinforcement Learning Engineer

1 month ago
Full-time
Junior
Data Science and Analytics

Code Metal

Code Metal builds verifiable code translation software for mission-critical industries. Its platform translates code across languages and hardware architectures, then verifies correctness, compliance, and performance for use cases like edge development, code portability, and code modernization.

Software Development
51-200
Founded 2023
$178M raised

Description

  • Build and maintain robust distributed training systems using PyTorch.
  • Design and implement scalable data curation and quality assurance pipelines for training datasets.
  • Develop orchestration tools to manage complex workflows across large-scale model training and evaluation.
  • Research and develop evaluation frameworks for AI model performance.
  • Build reinforcement learning solutions, including approaches using Reinforcement Learning with Human Feedback (RLHF).
  • Apply RLHF to large language models, with an emphasis on code generation tasks.
  • Engage with frontier research through open-source work and potential publications.
  • Support practical AI projects that address real-world challenges for leading chip manufacturers.

Requirements

  • 2+ years of experience in distributed training, preferably with PyTorch.
  • Strong background in reinforcement learning.
  • Recent experience with RLHF is highly preferred.
  • Proven ability to build data curation and quality assurance pipelines.
  • Experience developing evaluation frameworks.
  • Experience across both data pipeline and orchestration work is ideal.
  • Eligibility for TS/SCI clearance.
  • Contributions to open-source AI or ML projects are nice to have.
  • Published work or demonstrable research experience in related fields is nice to have.
  • Hands-on experience applying RLHF to LLMs, especially for code generation, is nice to have.
  • Experience with large-scale synthetic data generation is nice to have.

Benefits

  • 100% employer-paid health care coverage, including medical, dental, and vision.
  • 401(k) with 5% matching.
  • Uncapped vacation plus sick leave and public holidays.
  • Flexible hybrid work arrangement.
  • Relocation assistance for qualifying employees.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Security Researcher

Wiz 251-1K IT Services

Wiz is hiring an AI Security Researcher to investigate high-impact risks in cloud- and AI-native environments and turn those findings into product capabilities.

AWS Azure GCP Go Kubernetes Python SQL
10 hours, 29 minutes ago

Senior Manipulation Engineer (Imitation Learning), Discovery

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Senior Manipulation Engineer to develop and deploy learned robot manipulation policies for real hardware in unstructured environments.

Computer Vision Deep Learning Python PyTorch
1 day, 9 hours ago

AI Research Engineer

Nice Côte d'Azur Hotels, Restaurants & Leisure

NiCE is hiring an AI Research Engineer to support the development and validation of data solutions for NICE Enlighten AI’s customer experience analytics platform.

1 day, 12 hours ago

Sr. Staff AI Research TLM - AI Systems

Databricks 1K-5K IT Services

Databricks is seeking a Principal Research Scientist to lead its AI Scaling team in advancing large-scale machine learning and LLM efficiency research that improves how customers train, serve, and adapt models in production.

Apache Spark Generative AI LLM MLflow Python PyTorch
1 day, 22 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers