AI Benchmark & Datasets Engineer/ Researcher Internship

15 hours, 21 minutes ago
Internship
Entry Level
Data Science and Analytics
Pathway Genomics

Pathway Genomics

Pathway Genomics is a global leader in genetic testing and personalized healthcare, integrating AI and deep learning for actionable precision health information worldwide.

Health Care Providers & Services
51-250
Founded 2008
$40M raised

Description

  • Identify, prioritize, and curate public and client-driven benchmarks for target use cases and markets.
  • Evaluate candidate benchmarks for clarity, data quality, evaluation methodology, and fit with the model roadmap.
  • Run benchmarks with baseline models to validate setup, uncover edge cases, and reduce risk in R&D runs.
  • Prepare benchmark-ready packages for R&D, including specs, data, evaluation scripts, expected metrics, and constraints.
  • Maintain shared documentation and vocabulary around benchmarks, datasets, and evaluation formats for GTM and R&D.
  • Track and organize benchmark results, model leaderboards, and customer-specific definitions of what good looks like.
  • Contribute to demos and public-facing proof points based on benchmark outcomes.
  • Support the broader benchmarking process for AI model evaluation and help shape how the company communicates results.

Requirements

  • A cover letter with 2-3 lines of introduction.
  • One of the following: ICPC World Finalist; IOI, IMO, IOAI, or IPhO medalist in high school; published paper at an A-rated or A*-rated venue; coding projects with a GitHub repository; prior internship at a leading ML research center such as Google Brain, DeepMind, Apple, Meta, Anthropic, Nvidia, or MILA; or a warm recommendation from a university faculty member.
  • Experience with ML/LLM evaluation, data science, or technical product roles, ideally involving benchmarks or experimentation.
  • Comfort reading papers, leaderboards, and GitHub repos and turning them into clear, repeatable benchmark specs.
  • Ability to communicate with both engineers and customers and translate between technical detail and business value.
  • Strong attention to high-quality data, reproducible experiments, and crisp documentation.
  • Respectful interpersonal skills and fluent English.
  • Preferred start date of June 2026.
  • Ability to work from or meet with team members in Paris or Wroclaw, with permanent residence generally required in the EU, UK, US, or Canada.

Benefits

  • Intellectually stimulating work environment.
  • Opportunity to collaborate on a cutting-edge research project during the internship.
  • Chance to work on new "Live AI" challenges.
  • Experience at an early-stage AI startup focused on impactful research and foundational change.
  • Internship duration of 3-6 months.
  • Compensation based on profile and location.
  • Possibility to work or meet with team members in Paris, France or Wroclaw, Poland.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Shape the Future of AI - Galician Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is building a global network of Galician contributors for flexible remote AI data projects focused on annotation, evaluation, and prompt creation.

LLM
2 hours, 21 minutes ago

AI/RAG engineer

CoinMarketCap 11-50 IT Services

CMC Tech is hiring a full-time remote AI/RAG engineer to build and operate retrieval-augmented search and agent systems for global teams across Hong Kong, Singapore, and Dubai.

Machine Learning OpenSearch Python PyTorch React TensorFlow
2 hours, 21 minutes ago

AI Data Annotator - French (Canada)

Welocalize 1K-5K Professional Services

Welocalize is hiring a freelance, remote AI Data Annotator for French (Canada) to complete rating and annotation work that supports AI training data development on a temporary part-time basis.

Machine Learning NLP
2 hours, 36 minutes ago

Shape the Future of AI — Danish Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is seeking Danish-speaking contributors worldwide to join a remote talent network for flexible AI data projects involving annotation, evaluation, and prompt creation.

LLM
2 hours, 51 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers