Incident Commander

2 hours, 13 minutes ago
Full-time
Senior
DevOps and Infrastructure
Caseware

Caseware

CaseWare International Inc. provides cutting-edge software solutions for accounting firms, corporations, and governments, enabling users worldwide to work smarter and transform insights into impact.

Internet Software & Services
251-1K
Founded 1988

Description

  • Initiate and oversee incident response efforts as the primary point of coordination after an incident is detected.
  • Act as the authoritative voice during incidents and drive teams toward rapid resolution.
  • Collaborate with engineers, product management, support, and other cross-functional teams during active incidents.
  • Use and integrate tools such as JIRA, PagerDuty, New Relic, AWS, and Microsoft Teams to monitor and coordinate incident handling.
  • Ensure the right stakeholders are engaged to support recovery and resolution efforts.
  • Communicate timely updates, resolution plans, and incident status to internal and external audiences.
  • Track and report uptime metrics to promote transparency in system reliability and performance.
  • Lead post-mortem sessions and produce PIR and RCA documentation, including timelines, impact, root cause, remediation, and preventive actions.
  • Follow up on action items from post-incident reviews to help prevent recurrence.
  • Implement proactive strategies and tools to reduce operational risk and strengthen system resilience.

Requirements

  • 5+ years of experience managing critical incidents in SaaS environments.
  • Experience in a similar role, preferably within a software or technology company.
  • Prior knowledge of cloud environments, AWS, DevOps practices, or related technical operations.
  • Strong technical background in incident management and response.
  • Proven ability to lead teams through rapid incident resolution.
  • Solid understanding of the modern software landscape.
  • Familiarity with JIRA and PagerDuty integrations.
  • Excellent written and verbal communication skills.
  • Strong English communication and collaboration skills.
  • Ability to perform well under pressure and manage competing priorities effectively.

Benefits

  • Remote, full-time permanent role.
  • Flexible work options.
  • Generous time-off policies.
  • Competitive salary.
  • Comprehensive benefits, including health insurance and retirement plans.
  • Performance bonuses and recognition programs.
  • Opportunities for career growth.
  • Opportunity to work on international projects with a global team.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer

Zeta Global 1K-5K Media

Zeta Global is hiring a Senior Site Reliability Engineer to help build and operate scalable observability and reliability systems for high-throughput distributed services processing millions of transactions daily.

Argo CD AWS Docker GitOps Go Grafana Honeycomb Jenkins Kubernetes Microservices OpenTelemetry Prometheus Python Terraform
13 minutes ago

Senior SRE Engineer / DevOps

Margo Bank Professional Services

Senior SRE Engineer / DevOps position at a consulting team in Warsaw focused on developing an internal developer platform and establishing CI/CD standards across multiple teams.

Bash CI/CD DevSecOps Git Kubernetes Python
13 minutes ago

Senior Site Reliability Engineer (SRE)

KOMOJU Internet Software & Services

KOMOJU is hiring a Site Reliability Engineer to own the reliability, performance, and developer experience of its cloud-based payment platform supporting merchants across cross-border integrations.

AWS CI/CD CircleCI Datadog GitHub Actions Go Jenkins Python Ruby Ruby on Rails Shopify TCP/IP Terraform
28 minutes ago

DevOps & Site Reliability Engineer

Oowlish 51-250 Internet Software & Services

Oowlish is hiring a DevOps & Site Reliability Engineer to support an AI-focused SaaS startup by maintaining, optimizing, and scaling the infrastructure behind its platform for high availability, performance, and reliability.

AWS Azure Azure Pipelines Bash CI/CD CircleCI Datadog Docker GCP Grafana Helm Jenkins Kubernetes New Relic Prometheus
43 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers