German-Speaking AI Evaluation Specialist

1 hour, 56 minutes ago
Full-time
Junior
Artificial Intelligence and Machine Learning
Blueprint Technologies

Blueprint Technologies

Blueprint Technologies specializes in delivering tailored business management and IT solutions that optimize cloud spending, enhance productivity, and drive innovation across various industries, including manufacturing, retail, finance, and healthcare.

Internet Software & Services
251-1K
Founded 2013

Description

  • Evaluate and compare AI-generated responses in German.
  • Perform side-by-side analysis of outputs from different AI systems.
  • Assess responses for accuracy, relevance, clarity, and instruction-following.
  • Identify nuances in meaning, tone, and cultural context.
  • Apply structured annotation guidelines consistently.
  • Work with datasets of real user queries.

Requirements

  • Native or professional fluency in German.
  • Strong English reading comprehension.
  • Analytical thinking and attention to detail.
  • Experience with structured evaluation or guidelines.
  • Background in linguistics, translation, or localization is preferred.
  • Experience with data annotation, AI evaluation, or search relevance is preferred.
  • Ability to work remotely in the USA.

Benefits

  • Medical, dental, and vision coverage.
  • Flexible Spending Account.
  • 401(k) program.
  • Competitive PTO offerings.
  • Parental leave.
  • Opportunities for professional growth and development.
  • Remote work within the USA.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior / Lead Technical Product Manager — AI-First Platform & Consumer

LumiMeds 11-50 medical practice

LumiMeds is hiring a Senior/Lead AI-native Product Manager to own key product surfaces for its remote-first telehealth platform, including consumer apps, e-commerce, clinical workflows, and AI-powered operations.

Android E-commerce HIPAA iOS LLM Next.js Node.js REST API
11 minutes ago

Ads Quality Rater - German (Germany)

Welo Global Professional Services

Welo Data is hiring a remote Ads Quality Rater in Germany to review and rate English and German internet ads for its large-scale ad evaluation program.

31 minutes ago

Delta Crateris - AI Content Evaluator - Icelandic (Iceland)

Welo Global Professional Services

Welo Data, part of Welocalize, is hiring a remote freelance AI Content Evaluator for an Icelandic (Iceland) data annotation project focused on reviewing and assessing content against structured guidelines.

LLM
32 minutes ago

Shape the Future of AI - Tamil Talent Hub

Welo Global Professional Services

Welo Data, part of Welocalize, is building a global contributor network for remote AI data projects involving Tamil-language annotation, evaluation, and prompt creation.

LLM
1 hour, 3 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers