Xenon7

Xenon7

Xenon7 provides advanced AI solutions and consultancy services, leveraging a team of highly qualified experts and a strong emphasis on research and innovation to address complex industry challenges and enhance operational efficiency.

Internet Software & Services
Founded 2014

Description

  • Collaborate with product, data science, and engineering teams to define AI test strategies and use cases.
  • Design and execute test cases for LLM-based features including summarization, classification, conversational flows, and content generation.
  • Validate model outputs for factual accuracy, tone, relevance, and compliance with domain-specific standards.
  • Develop automated test scripts and frameworks for prompt-response validation and regression testing.
  • Evaluate ML model performance using metrics such as precision, recall, F1-score, and domain-specific thresholds.
  • Conduct integration testing across cloud platforms such as AWS, Azure, and GCP, as well as data environments like Snowflake and Databricks.
  • Document test results, anomalies, and improvement recommendations in a structured format.
  • Ensure ethical AI practices and compliance with data governance policies.

Requirements

  • 5+ years of experience in software testing, QA, or data validation roles.
  • 2–3 years of hands-on experience testing Generative AI models such as GPT, LLaMA, or Claude.
  • Strong proficiency in Python and experience with ML testing tools or frameworks.
  • Familiarity with cloud platforms including AWS, Azure, and GCP.
  • Experience with data engineering environments such as Snowflake and Databricks.
  • Understanding of ML workflows including time series, regression, classification, and neural networks.
  • Excellent communication skills and ability to drive cross-functional discussions with ownership.
  • Exposure to domain-specific AI applications such as pharma, healthcare, or finance is preferred.
  • Experience with prompt engineering and LLM evaluation frameworks is preferred.
  • Knowledge of ethical AI principles and regulatory compliance such as HIPAA and GDPR is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Manual Quality Assurance Engineer, SIMBA Team - Bangalore, India

Speechify 51-250 Internet Software & Services

Speechify is hiring a remote QA Engineer for its SIMBA Voice Agents team to ensure the quality and stability of a fast-growing voice AI product used across global customer deployments.

Agile JIRA Linear Postman
45 minutes ago

Manual Quality Assurance Engineer, SIMBA Team - Cluj‑Napoca, Romania

Speechify 51-250 Internet Software & Services

Speechify is hiring a remote QA Engineer for its SIMBA Voice Agents team to ensure quality and stability of a fast-growing AI voice product used by customers worldwide.

Agile JIRA Linear Postman
56 minutes ago

Senior Consultant - AI Training & Evaluation (MBB & Top-Tier Firms)

Mindrift.ai: Be the “I” in AI Internet Software & Services

Mindrift, powered by Toloka, is launching a management consulting domain where experienced strategy consultants will help translate real client engagements into structured learning environments for AI systems.

LLM Machine Learning Reinforcement Learning
58 minutes ago

AI Automation Specialist

teamified.com Hotels, Restaurants & Leisure

Teamified is seeking a hands-on AI Automation Specialist to work directly with clients on analyzing business processes, implementing AI-driven automations in Alexia.ai, and improving how remote teams operate.

CRM HubSpot OAuth Salesforce
1 hour, 46 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers