Xsolla

Xsolla

Xsolla is an international payment solution provider for online games, offering tools to launch, monetize, and scale games worldwide with local payment methods and fraud prevention.

Internet Software & Services
251-1K
Founded 2005

Description

  • Serve as the primary dashboard monitor during shifts and continuously watch production health signals in Datadog.
  • Detect anomalies by correlating APM, logs, metrics, synthetic tests, and Real User Monitoring data.
  • Triage and investigate production incidents, create incident tickets in JIRA Service Management, and route issues to the correct team.
  • Own lower-severity incidents end-to-end from detection through resolution, including diagnosis and runbook execution.
  • Support the Technical Shift Operations Lead during major incidents as a technical partner in the war room.
  • Draft internal and customer-facing incident communications, including Slack updates and status page posts.
  • Analyze incident trends, recurring issues, and production bugs and contribute findings to reports and post-incident reviews.
  • Compile incident timelines, draft initial PIR documents, and track action items after reviews.
  • Build and maintain operational automation, incident templates, Slack workflows, dashboard widgets, and runbooks.
  • Conduct structured shift handoffs and participate in knowledge transfer sessions to improve independent resolution capability.
  • Cover for the TSO Lead when needed, including severity classification, escalation decisions, and basic incident commander functions.
  • Publish periodic health reports for critical applications.

Requirements

  • 4+ years of experience in SRE, DevOps, production operations, NOC, or technical operations in a high-availability environment.
  • Experience supporting payments, e-commerce, SaaS, or gaming workloads is preferred.
  • Strong troubleshooting and investigation skills across logs, traces, metrics, databases, and network paths.
  • Hands-on experience with Datadog or a similar observability platform such as Grafana, Splunk, New Relic, or Elastic.
  • Proficiency in at least one scripting language: Python, Go, or Bash.
  • Clear written and verbal communication skills in English.
  • Working knowledge of Kubernetes and cloud infrastructure; GCP is preferred, while AWS or Azure are acceptable.
  • Understanding of SLOs, error budgets, and burn-rate alerting.
  • Experience with JIRA or JIRA Service Management, PagerDuty or OpsGenie, Slack, and Confluence.
  • Interest in or experience with AI/ML-assisted operations such as anomaly detection, alert correlation, predictive monitoring, or automated remediation.
  • Comfort with 24x7 shift-based operations in a follow-the-sun model, including weekend on-call rotation.
  • Experience in gaming, payments, or fintech environments is a plus.
  • Familiarity with Datadog Service Catalog, synthetic monitoring, and RUM is a plus.
  • Exposure to database and platform tools such as MySQL, PostgreSQL, Redis, Kafka, GitLab CI, ArgoCD, and Helm is a plus.
  • JIRA Service Management administration experience or ITIL Foundation certification is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Partner Operations Specialist

Plusgrade 251-1K Consumer Services

Plusgrade is hiring a Partner Operations Specialist to build scalable processes, data foundations, and automation that help Partner Success onboard, support, and grow partners more efficiently across the lifecycle.

JSON Salesforce SQL
1 hour, 23 minutes ago

Senior Transportation Specialist

ShipBob 251-1K Air Freight & Logistics

ShipBob is hiring a remote Australia-based Senior Transportation Specialist to coordinate carrier, final mile, and freight operations, ensuring reliable execution and timely issue resolution across assigned sites.

Power BI
1 hour, 57 minutes ago

Experienced Heavy Body Technician

Carvana 10K-50K Automotive

Carvana is hiring an Experienced Heavy Body Technician to perform extensive autobody repair work on multiple panels at its vehicle inspection and reconditioning centers.

2 hours, 41 minutes ago

Estimator (Civil Infrastructure) - 217

D2B Professional Services

Estimator (Civil Infrastructure) at an Australian client, responsible for preparing accurate construction cost estimates and bid documentation while coordinating project details across operations, engineering, and subcontractor teams.

Salesforce
3 hours, 4 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers