Weekday

Weekday helps companies hire engineers who are vouched by other software engineers, enabling passive income for engineers. They offer services like drafting outreach messages, shortlisting candidates, and conducting reference checks. Backed by Y Combin...

Construction & Engineering

Industrials

11-50 (45)

Founded 2020

38 open positions

Links

View All Jobs

Scientific AI Evaluation & Computational Problem Designer

1 hour, 36 minutes ago

United States

Part-time

Senior

AI (Artificial Intelligence)

Artificial Intelligence and Machine Learning

Linux Python

Apply Now

Weekday

Construction & Engineering

11-50

Founded 2020

View All Jobs 38

Description

Design advanced computational problems using domain-specific scientific software.
Create tasks that test precise execution through multi-step workflows, simulations, and related computations.
Create tasks that test strategic reasoning through experiment design and inference from partial data.
Develop problem setups, solution pathways, and validation mechanisms.
Calibrate and refine tasks based on model performance to hit target difficulty levels.
Ensure problems emphasize reasoning strategy over brute-force computation.
Iterate on benchmark problems in response to feedback and evaluation results.
Work within scientific domains such as bioinformatics, chemistry, physics, engineering, geophysics, and systems biology.

Requirements

Graduate-level expertise in a relevant STEM field; MS or PhD preferred.
Hands-on experience using scientific software libraries for real research problems.
Strong Python programming skills, including building computational workflows and validators.
Ability to design challenging problems that require deep reasoning rather than surface-level solutions.
Familiarity with edge cases, limitations, and practical challenges of scientific tools.
Demonstrated proficiency with at least one relevant scientific library through research, open-source work, or industry experience.
Ability to work independently and iterate based on feedback.
Comfort working in Linux/terminal environments and remote compute setups.
Availability of at least 15–20 hours per week.
Experience across multiple domains or tools is preferred.
Background in evaluation frameworks or benchmarking is preferred.
Experience in teaching, pedagogy, or problem-set design is preferred.
Familiarity with reproducible research practices and containerized environments is preferred.

Benefits

Compensation of $45–$100 per hour based on expertise and domain specialization.
Weekly payments via supported global payment platforms.
Fully remote work with flexible scheduling.
Independent contractor role.
Project scope may evolve based on performance and research needs.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Maps Personalization Relevance Rater - Portuguese (Brazil)

Welo Global Professional Services

Welo Data is hiring freelance remote Portuguese (Brazil) raters in Brazil to evaluate the relevance and usefulness of personalized search and location recommendations for Google Maps-related tasks.

Brazil Freelance Entry Level AI (Artificial Intelligence)

$37k-$37k

54 minutes ago

Apply

54 minutes ago

AI Evaluation Engineer (Knowledge & Research)

Gramian Consultancy Group Professional Services

Gramian Consultancy is hiring an AI Evaluation Engineer to design and evaluate multi-agent benchmark tasks and datasets that test AI systems on reading, reasoning, and extracting knowledge from large unstructured research sources.

Brazil Colombia Egypt Turkey Bangladesh India Indonesia Vietnam Ghana Kenya Nigeria Contract Senior AI (Artificial Intelligence)

Docker JSON Python

1 hour, 46 minutes ago

Apply

1 hour, 46 minutes ago

Statistics & Python Expert - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking statistics specialists for project-based AI work focused on creating and validating computational math problems for leading tech companies.

India Part-time Junior AI (Artificial Intelligence)

$23k-$23k

C MATLAB NumPy Pandas Python R SciPy SQL

1 hour, 49 minutes ago

Apply

1 hour, 49 minutes ago

Statistics & Python Expert - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking statistics specialists for project-based AI evaluation work focused on creating and validating computational mathematics problems for leading tech companies.

United States Part-time Junior AI (Artificial Intelligence)

Up to $0k

C MATLAB NumPy Pandas Python R SciPy SQL

2 hours, 13 minutes ago

Apply

2 hours, 13 minutes ago

Weekday

Tags

Links

Scientific AI Evaluation & Computational Problem Designer

Weekday

Description

Requirements

Benefits

Similar Roles

Maps Personalization Relevance Rater - Portuguese (Brazil)

AI Evaluation Engineer (Knowledge & Research)

Statistics & Python Expert - Freelance AI Trainer

Statistics & Python Expert - Freelance AI Trainer

You're on a roll! Sign up now to keep applying.