Scientific AI Evaluation & Computational Problem Designer

1 hour, 36 minutes ago
Part-time
Senior
Artificial Intelligence and Machine Learning
Weekday

Weekday

Weekday helps companies hire engineers who are vouched by other software engineers, enabling passive income for engineers. They offer services like drafting outreach messages, shortlisting candidates, and conducting reference checks. Backed by Y Combin...

Construction & Engineering
11-50
Founded 2020

Description

  • Design advanced computational problems using domain-specific scientific software.
  • Create tasks that test precise execution through multi-step workflows, simulations, and related computations.
  • Create tasks that test strategic reasoning through experiment design and inference from partial data.
  • Develop problem setups, solution pathways, and validation mechanisms.
  • Calibrate and refine tasks based on model performance to hit target difficulty levels.
  • Ensure problems emphasize reasoning strategy over brute-force computation.
  • Iterate on benchmark problems in response to feedback and evaluation results.
  • Work within scientific domains such as bioinformatics, chemistry, physics, engineering, geophysics, and systems biology.

Requirements

  • Graduate-level expertise in a relevant STEM field; MS or PhD preferred.
  • Hands-on experience using scientific software libraries for real research problems.
  • Strong Python programming skills, including building computational workflows and validators.
  • Ability to design challenging problems that require deep reasoning rather than surface-level solutions.
  • Familiarity with edge cases, limitations, and practical challenges of scientific tools.
  • Demonstrated proficiency with at least one relevant scientific library through research, open-source work, or industry experience.
  • Ability to work independently and iterate based on feedback.
  • Comfort working in Linux/terminal environments and remote compute setups.
  • Availability of at least 15–20 hours per week.
  • Experience across multiple domains or tools is preferred.
  • Background in evaluation frameworks or benchmarking is preferred.
  • Experience in teaching, pedagogy, or problem-set design is preferred.
  • Familiarity with reproducible research practices and containerized environments is preferred.

Benefits

  • Compensation of $45–$100 per hour based on expertise and domain specialization.
  • Weekly payments via supported global payment platforms.
  • Fully remote work with flexible scheduling.
  • Independent contractor role.
  • Project scope may evolve based on performance and research needs.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Maps Personalization Relevance Rater - Portuguese (Brazil)

Welo Global Professional Services

Welo Data is hiring freelance remote Portuguese (Brazil) raters in Brazil to evaluate the relevance and usefulness of personalized search and location recommendations for Google Maps-related tasks.

54 minutes ago

AI Evaluation Engineer (Knowledge & Research)

Gramian Consultancy Group Professional Services

Gramian Consultancy is hiring an AI Evaluation Engineer to design and evaluate multi-agent benchmark tasks and datasets that test AI systems on reading, reasoning, and extracting knowledge from large unstructured research sources.

Docker JSON Python
1 hour, 46 minutes ago

Statistics & Python Expert - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking statistics specialists for project-based AI work focused on creating and validating computational math problems for leading tech companies.

C MATLAB NumPy Pandas Python R SciPy SQL
1 hour, 49 minutes ago

Statistics & Python Expert - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking statistics specialists for project-based AI evaluation work focused on creating and validating computational mathematics problems for leading tech companies.

C MATLAB NumPy Pandas Python R SciPy SQL
2 hours, 13 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers