Technical Service Operations Lead (TSO Lead)

8 hours, 51 minutes ago
Full-time
Senior
Customer and Technical Support
Xsolla

Xsolla

Xsolla is an international payment solution provider for online games, offering tools to launch, monetize, and scale games worldwide with local payment methods and fraud prevention.

Internet Software & Services
251-1K
Founded 2005

Description

  • Serve as incident commander for major incidents, coordinating cross-functional response teams and driving resolution within SLA targets.
  • Own incident communications, including executive updates and customer-facing status page updates throughout the incident lifecycle.
  • Facilitate blameless post-incident reviews, identify root causes, assign corrective actions, and track them to closure.
  • Analyze incident trends, recurring issues, and production bugs to create problem tickets and recommendations for product and engineering teams.
  • Enforce the incident management framework, including severity levels, priority matrix, SLA targets, escalation procedures, and deployment readiness gates.
  • Mentor and oversee the Operations Engineer on shift, including triage, investigation, runbook execution, documentation quality, and knowledge transfer.
  • Produce shift handoff reports and regular operational reporting on incident trends, KPI performance, SLA adherence, proactive detection, and repeat incidents.
  • Audit service catalogue completeness and govern JIRA Service Management workflows for incident, PIR, and problem management.
  • Cover the Operations Engineer role during absences, breaks, and surge incidents.
  • Participate in a rotating weekend on-call schedule for major incidents.

Requirements

  • 6+ years of experience in incident management, SRE, NOC leadership, or technical operations in a production environment.
  • Experience supporting high-availability, high-transaction platforms such as payments, e-commerce, SaaS, or gaming.
  • Proven ability to coordinate multi-team incident response and make real-time escalation decisions under pressure.
  • Excellent written and verbal communication skills in English for executive, technical, customer, and partner audiences.
  • Strong ITIL foundation with practical experience in incident, problem, and change management workflows.
  • Ability to read and interpret logs, traces, and metrics in Datadog or equivalent tools such as Grafana, Splunk, or New Relic.
  • Hands-on experience with Datadog, PagerDuty or OpsGenie, JIRA or JIRA Service Management, Slack, and Confluence.
  • Understanding of APM, SLOs, error budgets, burn-rate alerting, and synthetic monitoring.
  • Analytical mindset with the ability to identify trends from incident data and turn them into actionable recommendations.
  • Comfort with 24x7 shift-based operations in a follow-the-sun model and rotating weekend on-call coverage.
  • Experience with or strong interest in AI/ML-assisted operations such as anomaly detection, alert correlation, predictive alerting, automated remediation, or self-healing automation.
  • Experience in gaming, payments, or fintech is preferred.
  • JIRA Service Management administration experience is preferred.
  • Familiarity with Datadog Service Catalog, scorecards, SLOs, burn-rate alerts, and multi-window SLOs is preferred.
  • Experience building an operations function from scratch is preferred.
  • Background in Kubernetes, cloud infrastructure, GCP, microservices architecture, or distributed systems is preferred.
  • ITIL certification (Foundation or higher) is a plus but not required.

Benefits

  • $120,000 - $150,000 annual salary in British Columbia, depending on location and experience.
  • Medical, dental, and vision coverage.
  • PTO.
  • Personalized career roadmap for each employee.
  • Training and educational opportunities for professional development.
  • Supportive environment focused on employees’ physical, mental, and emotional well-being.
  • Equal opportunity and inclusive workplace environment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Merchandise Manager

Level99 51-250 Media

Level99 is seeking a remote Merchandise Manager in Lake Buena Vista to run its merchandise program end to end for a growing entertainment venue brand and ensure products launch, replenish, and perform effectively across locations.

E-commerce Shopify
22 minutes ago

Middleware Operations Lead

Qualysoft 251-1K Internet Software & Services

Qualysoft Group is seeking a Middleware Operations Lead in Budapest to oversee enterprise middleware platform operations and drive reliable service delivery across client environments.

Apache Bash Python
37 minutes ago

Director of E-Commerce - Health & Wellness - REMOTE

Stonehenge Health is hiring a remote, California-based Director of E-Commerce to own and scale its Shopify Plus direct-to-consumer storefronts, driving revenue growth, conversion, and the overall shopping experience.

Google Analytics Power BI SEO Shopify
52 minutes ago

ブラウンダスト2 日本ファンダム事業 Operations PM(未経験/経験者)

NEOWIZ 251-1K Internet Software & Services

NEOWIZ GameOn is hiring an Operations PM to lead Japanese market marketing and fandom initiatives for Brown Dust 2, with a focus on planning and executing community-driven campaigns for anime-style collectible games.

52 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers