Xsolla

Xsolla

Xsolla is an international payment solution provider for online games, offering tools to launch, monetize, and scale games worldwide with local payment methods and fraud prevention.

Internet Software & Services
251-1K
Founded 2005

Description

  • Design and implement AI/ML-powered infrastructure solutions for predictive autoscaling, anomaly detection, cost optimization, and automated remediation across GCP and multi-cloud environments.
  • Build and maintain AI-driven monitoring and observability systems that correlate logs, metrics, and traces to identify root causes, predict bottlenecks, and reduce MTTR.
  • Develop automated incident response workflows using AI-powered playbooks to diagnose, contain, and resolve infrastructure issues with minimal manual intervention.
  • Integrate AI tooling into CI/CD pipelines to improve deployment reliability, automate test prediction, score release health, and support rollback automation.
  • Contribute to internal AI agents and virtual assistants integrated into developer workflows such as Slack, IDEs, and Confluence for self-service provisioning, troubleshooting, and guidance.
  • Implement AI/ML-based anomaly detection and automated vulnerability management workflows to strengthen infrastructure security.
  • Prototype and productionize generative AI solutions for infrastructure automation, including auto-generation of Terraform or Puppet modules, IaC configurations, runbooks, and change documentation.
  • Collaborate with senior engineers and leadership to define and execute the infrastructure AI strategy across implementation phases.
  • Maintain clear documentation of AI tools, integrations, and automated workflows, and share knowledge and best practices with the team.

Requirements

  • 5–7 years of experience in infrastructure engineering, DevOps, SRE, or a related field.
  • Hands-on experience with GCP, preferably as the primary cloud platform, and/or AWS.
  • Practical experience building or integrating AI/ML-powered tools in operational contexts such as anomaly detection, predictive models, or LLM-based automation.
  • Experience with infrastructure-as-code tools such as Terraform, Puppet, Ansible, or equivalent.
  • Proficiency in Python for scripting, automation, and AI/ML integration; Bash or Go is a plus.
  • Working knowledge of Kubernetes and container orchestration in production environments.
  • Familiarity with observability and monitoring stacks such as Prometheus, Grafana, ELK, Datadog, or similar.
  • Familiarity with LLM APIs such as OpenAI or Anthropic and prompt engineering for operational use cases.
  • Strong problem-solving mindset with a bias toward automation and eliminating toil.
  • Fluent English communication skills, both written and verbal.
  • Experience with AI workflow orchestration frameworks such as LangChain, LlamaIndex, n8n, or similar is nice to have.
  • Exposure to AIOps platforms such as Dynatrace, Datadog AI, Moogsoft, or BigPanda is nice to have.
  • Background in FinOps or AI-driven cloud cost optimization is nice to have.
  • Familiarity with vector databases such as Weaviate, Pinecone, or Qdrant for knowledge retrieval systems is nice to have.
  • Experience with VMware or hybrid cloud environments is nice to have.
  • GCP and/or AWS cloud certifications are nice to have.
  • Prior experience in gaming, high-growth tech, or SaaS platform environments is nice to have.

Benefits

  • Unlimited Flexible Time Off.
  • A personalized career roadmap for each employee.
  • Professional development through training and educational opportunities.
  • A comprehensive benefits program supporting physical, mental, and emotional well-being.
  • Remote full-time work arrangement.
  • Opportunity to work on AI-driven infrastructure at a global gaming company.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows 251-1K Diversified Telecommunication Services

Tucows Domains is hiring a remote Intermediate Software Engineer specializing in Artificial Intelligence to help build AI-powered systems for domain services and related tools.

Go Hugging Face LLM Machine Learning Python REST API TensorFlow
54 minutes ago

AI Full Stack Engineer - KS001

An AI engineer at an Amazon brand management company will build and scale production AI infrastructure and workflows across communication, sales intelligence, content quality, lead qualification, and executive assistant functions.

Linux LLM Node.js OAuth PostgreSQL React REST API SSH TypeScript
1 hour, 38 minutes ago

AI Tech Lead - Staff Machine Learning Engineer

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Staff Machine Learning Engineer – AI Tech Lead to lead the design and production delivery of agentic AI systems for Security Operations Center use cases at global scale.

Apache Airflow AWS Azure Docker GCP Kubernetes LLM Machine Learning MLflow Python PyTorch System Design Vertex AI
1 hour, 40 minutes ago

AI Native Engineer, Growth Marketing

CookUnity 251-1K Hotels, Restaurants & Leisure

CookUnity is hiring an AI Native Engineer, Growth Marketing to embed AI across its Growth organization, building automation and AI-powered workflows that improve marketing efficiency, personalization, and conversion.

Affiliate Marketing dbt Email Marketing Google Ads JIRA Notion Python SEM Snowflake SQL Tableau TypeScript
1 hour, 48 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers