Xsolla

Xsolla

Xsolla is an international payment solution provider for online games, offering tools to launch, monetize, and scale games worldwide with local payment methods and fraud prevention.

Internet Software & Services
251-1K
Founded 2005

Description

  • Design and implement AI/ML-powered infrastructure solutions for predictive autoscaling, anomaly detection, cost optimization, and automated remediation across GCP and multi-cloud environments.
  • Build and maintain AI-driven monitoring and observability systems that correlate logs, metrics, and traces to identify root causes, predict bottlenecks, and reduce MTTR.
  • Develop automated incident response workflows using AI-powered playbooks to diagnose, contain, and resolve infrastructure issues with minimal manual intervention.
  • Integrate AI tools into CI/CD pipelines to improve deployment reliability, predict test outcomes, score release health, and support rollback automation.
  • Contribute to internal AI agents and virtual assistants embedded in developer workflows such as Slack, IDEs, and Confluence.
  • Implement AI/ML-based anomaly detection and automated vulnerability management workflows to strengthen infrastructure security.
  • Prototype and productionize generative AI solutions for infrastructure automation, including IaC generation, runbooks, and change documentation.
  • Collaborate with senior engineers and leadership to define and execute the infrastructure AI strategy across implementation phases.
  • Maintain clear documentation for AI tools, integrations, and automated workflows, and share best practices with the team.

Requirements

  • 5–7 years of experience in infrastructure engineering, DevOps, SRE, or a related field.
  • Hands-on experience with GCP, preferably, and/or AWS, with understanding of cloud resource management, scaling, and cost structures.
  • Practical experience building or integrating AI/ML-powered tools in an operational context.
  • Experience with infrastructure-as-code tools such as Terraform, Puppet, Ansible, or equivalent.
  • Proficiency in Python for scripting, automation, and AI/ML integration; Bash or Go is a plus.
  • Working knowledge of Kubernetes and production container orchestration.
  • Familiarity with observability and monitoring stacks such as Prometheus, Grafana, ELK, Datadog, or similar.
  • Familiarity with LLM APIs such as OpenAI or Anthropic, and prompt engineering for operational use cases.
  • Strong problem-solving mindset with a bias toward automation and eliminating toil.
  • Fluent in English, both written and verbal.
  • Experience with AI workflow orchestration frameworks such as LangChain, LlamaIndex, n8n, or similar (nice to have).
  • Exposure to AIOps platforms such as Dynatrace, Datadog AI, Moogsoft, or BigPanda (nice to have).
  • Background in FinOps or AI-driven cloud cost optimization (nice to have).
  • Familiarity with vector databases such as Weaviate, Pinecone, or Qdrant (nice to have).
  • Experience with VMware or hybrid cloud environments (nice to have).
  • GCP and/or AWS cloud certifications (nice to have).
  • Prior experience in gaming, high-growth tech, or SaaS platform environments (nice to have).

Benefits

  • $120,000 - $160,000 annual salary.
  • Medical, dental, and vision coverage.
  • PTO.
  • Personalized career roadmap for each employee.
  • Training and educational opportunities for professional development.
  • A supportive environment focused on physical, mental, and emotional well-being.
  • An inclusive workplace committed to diversity and equal opportunity.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows 251-1K Diversified Telecommunication Services

Tucows Domains is hiring a remote Intermediate Software Engineer specializing in Artificial Intelligence to help build AI-powered systems for domain services and related tools.

Go Hugging Face LLM Machine Learning Python REST API TensorFlow
54 minutes ago

AI Full Stack Engineer - KS001

An AI engineer at an Amazon brand management company will build and scale production AI infrastructure and workflows across communication, sales intelligence, content quality, lead qualification, and executive assistant functions.

Linux LLM Node.js OAuth PostgreSQL React REST API SSH TypeScript
1 hour, 37 minutes ago

AI Tech Lead - Staff Machine Learning Engineer

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Staff Machine Learning Engineer – AI Tech Lead to lead the design and production delivery of agentic AI systems for Security Operations Center use cases at global scale.

Apache Airflow AWS Azure Docker GCP Kubernetes LLM Machine Learning MLflow Python PyTorch System Design Vertex AI
1 hour, 40 minutes ago

AI Native Engineer, Growth Marketing

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring an AI Native Engineer, Growth Marketing to embed AI across its Growth organization, building automation and AI-powered workflows that improve marketing efficiency, personalization, and conversion.

Affiliate Marketing dbt Email Marketing Google Ads JIRA Notion Python SEM Snowflake SQL Tableau TypeScript
1 hour, 47 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers