Chinese-Speaking AI Evaluation Specialist

2 hours, 14 minutes ago
Full-time
Junior
Artificial Intelligence and Machine Learning
Blueprint Technologies

Blueprint Technologies

Blueprint Technologies specializes in delivering tailored business management and IT solutions that optimize cloud spending, enhance productivity, and drive innovation across various industries, including manufacturing, retail, finance, and healthcare.

Internet Software & Services
251-1K
Founded 2013

Description

  • Evaluate and compare AI-generated responses in Chinese.
  • Perform side-by-side analysis of outputs from different AI systems.
  • Assess responses for accuracy, relevance, clarity, and instruction-following.
  • Identify nuances in meaning, tone, and cultural context.
  • Apply structured annotation guidelines consistently.
  • Work with datasets of real user queries.

Requirements

  • Native or professional fluency in Chinese.
  • Strong English reading comprehension.
  • Analytical thinking and attention to detail.
  • Experience with structured evaluation or guidelines.
  • Background in linguistics, translation, or localization is preferred.
  • Experience with data annotation, AI evaluation, or search relevance is preferred.
  • Ability to work remotely in the USA.

Benefits

  • Medical, dental, and vision coverage.
  • Flexible Spending Account.
  • 401(k) program.
  • Competitive PTO offerings.
  • Parental leave.
  • Opportunities for professional growth and development.
  • Remote work in the USA.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Sr. QA Engineer

Media.Monks 5K-10K Media

Monks is seeking a Quality Assurance team member to ensure the reliability, performance, and security of its software products by planning and executing tests that help development teams ship high-quality code confidently and quickly.

Agile Appium AWS Azure Bitbucket CI/CD Cypress Docker GCP Git Java JavaScript JIRA JMeter Kubernetes Locust Penetration Testing Playwright Postman Python REST API Selenium SQL
14 minutes ago

Weld Engineer I

Relativity Space 251-1K Aerospace & Defense

Relativity Space is hiring a Welding Engineer to support the development and qualification of fusion welding processes for Terran R engine components as the company builds and scales its launch vehicle program.

29 minutes ago

Senior Build Quality Engineer

Relativity Space 251-1K Aerospace & Defense

Relativity Space is hiring a Reliability Quality Engineer to help ensure quality and reliability are built into the Terran R rocket’s design, manufacturing, test, and launch processes as the program scales from development into production.

ERP
29 minutes ago

Ads Quality Rater - Catalan (Spain)

Welo Global Professional Services

Welo Data is hiring a remote Ads Quality Rater in Spain to review and grade internet advertisements in English and Catalan for its long-term ads rating program.

49 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers