Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Collaborate with product, data science, and engineering teams to define AI test strategies and use cases.
  • Design and execute test cases for LLM-based features including summarization, classification, conversational flows, and content generation.
  • Validate model outputs for factual accuracy, tone, relevance, and compliance with domain-specific standards.
  • Develop automated test scripts and frameworks for prompt-response validation and regression testing.
  • Evaluate ML model performance using metrics such as precision, recall, F1-score, and domain-specific thresholds.
  • Conduct integration testing across cloud platforms such as AWS, Azure, and GCP, as well as data environments like Snowflake and Databricks.
  • Document test results, anomalies, and improvement recommendations in a structured format.
  • Ensure ethical AI practices and compliance with data governance policies.

Requirements

  • 5+ years of experience in software testing, QA, or data validation roles.
  • 2–3 years of hands-on experience testing Generative AI models such as GPT, LLaMA, or Claude.
  • Strong proficiency in Python and experience with ML testing tools or frameworks.
  • Familiarity with cloud platforms including AWS, Azure, and GCP.
  • Experience with data engineering environments such as Snowflake and Databricks.
  • Understanding of ML workflows including time series, regression, classification, and neural networks.
  • Excellent communication skills and ability to drive cross-functional discussions with ownership.
  • Exposure to domain-specific AI applications such as pharma, healthcare, or finance is preferred.
  • Experience with prompt engineering and LLM evaluation frameworks is preferred.
  • Knowledge of ethical AI principles and regulatory compliance such as HIPAA and GDPR is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Mortgage Underwriter - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking mortgage underwriting and loan origination professionals for project-based AI evaluation work focused on testing and improving mortgage-related AI outputs and compliance decisions.

14 hours, 58 minutes ago

Claims Processing Agent - Freelance AI Trainer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking part-time project-based insurance and claims specialists to evaluate and improve AI systems for auto insurance decision-making, fraud detection, and subrogation testing.

14 hours, 58 minutes ago

Record Your Daily Routine & Get Paid - AI Training (Remote)

Toloka 251-1K Internet Software & Services

Project-based freelance opportunity with an AI training platform recording first-person videos of everyday household activities to help train AI systems and robots.

14 hours, 58 minutes ago

Freelance Agent Evaluation Engineer

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift is seeking a project-based software specialist to create realistic coding evaluation tasks and tests for AI agents in simulated development environments.

Docker FastAPI JavaScript Kafka PostgreSQL Python React Redis TypeScript
14 hours, 58 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers