Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Collaborate with product, data science, and engineering teams to define AI test strategies and use cases.
  • Design and execute test cases for LLM-based features including summarization, classification, conversational flows, and content generation.
  • Validate model outputs for factual accuracy, tone, relevance, and compliance with domain-specific standards.
  • Develop automated test scripts and frameworks for prompt-response validation and regression testing.
  • Evaluate ML model performance using metrics such as precision, recall, F1-score, and domain-specific thresholds.
  • Conduct integration testing across cloud platforms such as AWS, Azure, and GCP, as well as data environments like Snowflake and Databricks.
  • Document test results, anomalies, and improvement recommendations in a structured format.
  • Ensure ethical AI practices and compliance with data governance policies.

Requirements

  • 5+ years of experience in software testing, QA, or data validation roles.
  • 2–3 years of hands-on experience testing Generative AI models such as GPT, LLaMA, or Claude.
  • Strong proficiency in Python and experience with ML testing tools or frameworks.
  • Familiarity with cloud platforms including AWS, Azure, and GCP.
  • Experience with data engineering environments such as Snowflake and Databricks.
  • Understanding of ML workflows including time series, regression, classification, and neural networks.
  • Excellent communication skills and ability to drive cross-functional discussions with ownership.
  • Exposure to domain-specific AI applications such as pharma, healthcare, or finance is preferred.
  • Experience with prompt engineering and LLM evaluation frameworks is preferred.
  • Knowledge of ethical AI principles and regulatory compliance such as HIPAA and GDPR is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Korean Music Content Specialist in Japan (Remote)

Welocalize 1K-5K Professional Services

Welo Data is hiring a remote Korean Music Content Specialist in Japan to transcribe, proofread, and quality-check music lyrics and metadata for local-market content.

macOS
0 minutes ago

Shape the Future of AI - Uzbek Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is building a global contributor network of Uzbek speakers for flexible remote AI data projects focused on annotation, evaluation, and prompt creation.

LLM
0 minutes ago

Evaluators for AI training (English language)

TSMG Professional Services

An Ivano-Frankivsk-based Field Projects team is hiring a remote Evaluator to review and rate AI agent replies for content moderation and safety.

0 minutes ago

Shape the Future of AI — Polish Talent Hub

Welocalize 1K-5K Professional Services

Welo Data, part of Welocalize, is building a global network of Polish-language contributors to support remote AI data projects in annotation, evaluation, and prompt creation.

LLM
0 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers