AI Benchmark & Datasets Engineer/ Researcher Internship

15 hours, 21 minutes ago
Internship
Entry Level
Data Science and Analytics
Pathway Genomics

Pathway Genomics

Pathway Genomics is a global leader in genetic testing and personalized healthcare, integrating AI and deep learning for actionable precision health information worldwide.

Health Care Providers & Services
51-250
Founded 2008
$40M raised

Description

  • Proactively identify, prioritize, and curate relevant public and client-driven benchmarks across target use cases and markets.
  • Evaluate candidate benchmarks for clarity, data quality, evaluation methodology, and fit with the company’s model roadmap.
  • Run benchmarks with baseline models to validate setups, uncover edge cases, and de-risk R&D experiments.
  • Prepare and hand off benchmark-ready packages to R&D, including specs, data, evaluation scripts, expected metrics, and constraints.
  • Maintain shared vocabulary and documentation around benchmarks, datasets, and evaluation formats for use by GTM and R&D teams.
  • Track and organize benchmark results, maintain model leaderboards, and define “what good looks like” for different customers and scenarios.
  • Contribute to demos and public-facing proof points based on benchmark outcomes and measurements.
  • Help define and drive the overall benchmarking process and standards for AI model evaluation.

Requirements

  • Include a brief cover letter (2–3 lines) with your application.
  • Must meet at least one of the following: ICPC World Finalist or IOI/IMO/IOAI/IPhO medalist in high school; a published research paper at an A or A* venue (ICORE); completed coding projects (ideally with a public GitHub repo); internship experience at a leading ML research center (e.g., Google Brain, DeepMind, Apple, Meta, Anthropic, Nvidia, MILA); or a warm recommendation from a university faculty member.
  • Experience with ML/LLM evaluation, data science, or technical product roles, ideally focused on benchmarks or experimentation.
  • Ability to read research papers, leaderboards, and GitHub repos and convert them into clear, repeatable benchmark specifications.
  • Ability to communicate comfortably with engineers and customers and translate technical details into business value.
  • Strong emphasis on high-quality data, reproducible experiments, and clear, concise documentation.
  • Respectful collaboration style and fluency in English.
  • Permanent residence in the EU, UK, US, or Canada is generally required.

Benefits

  • Collaborate on a cutting-edge research project within an intellectually stimulating, early-stage AI startup.
  • Opportunity to work on pioneering “Live AI” challenges and influence model development and public-facing claims.
  • Internship duration of 3–6 months with a preferred start date in June 2026.
  • Compensation commensurate with profile and location.
  • Possibility to work remotely or meet with team members in Pathway offices (Paris, France or Wroclaw, Poland).

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Shape the Future of AI - Galician Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is building a global network of Galician contributors for flexible remote AI data projects focused on annotation, evaluation, and prompt creation.

LLM
2 hours, 21 minutes ago

AI/RAG engineer

CoinMarketCap 11-50 IT Services

CMC Tech is hiring a full-time remote AI/RAG engineer to build and operate retrieval-augmented search and agent systems for global teams across Hong Kong, Singapore, and Dubai.

Machine Learning OpenSearch Python PyTorch React TensorFlow
2 hours, 21 minutes ago

AI Data Annotator - French (Canada)

Welocalize 1K-5K Professional Services

Welocalize is hiring a freelance, remote AI Data Annotator for French (Canada) to complete rating and annotation work that supports AI training data development on a temporary part-time basis.

Machine Learning NLP
2 hours, 36 minutes ago

Shape the Future of AI — Danish Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is seeking Danish-speaking contributors worldwide to join a remote talent network for flexible AI data projects involving annotation, evaluation, and prompt creation.

LLM
2 hours, 51 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers