Operations Team Lead (Production & Reliability)

6 hours, 6 minutes ago
Full-time
Lead
DevOps and Infrastructure
Complexio

Complexio

Complexio connects your data, people, and systems into one intelligence layer. Ask questions in natural language, get answers from real operational data.

Description

  • Own production stability and availability across all live systems.
  • Lead operational readiness for new releases and manage safe production access and change coordination.
  • Own the full incident management lifecycle, including detection, response, communication, and postmortems.
  • Design and maintain sustainable on-call rotations, escalation paths, severity levels, and runbooks.
  • Define SLIs and SLOs for critical systems and improve visibility into reliability signals.
  • Track reliability metrics such as MTTR, incident frequency, and escalation trends.
  • Drive reliability roadmap initiatives and systemic fixes that prevent recurring incidents.
  • Lead and grow the Operations team by setting standards, KPIs, ownership, and accountability.
  • Raise the bar on operational discipline across both systems and team performance.

Requirements

  • Strong experience in SRE, DevOps, Infrastructure, or Production Engineering.
  • Prior experience leading technical teams.
  • Deep hands-on incident management experience.
  • Strong observability and reliability mindset.
  • Calm under pressure and clear in communication.
  • Systems thinker who fixes root causes rather than symptoms.
  • Experience building structured incident response and escalation processes is highly relevant.
  • Experience defining SLIs/SLOs, runbooks, or on-call practices is preferred.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Revenue Strategy & Operations Manager

360Learning 251-1K Diversified Consumer Services

360Learning is hiring a Revenue Strategy & Operations Manager to work with its Strategy & Operations team on customer success and support performance, translating CS strategy into scalable business plans and operational improvements.

7 minutes ago

Senior Database Reliability Engineer

PointClickCare 1K-5K Health Care Providers & Services

PointClickCare is hiring a Senior Database Reliability Engineer to manage and improve the cloud database infrastructure behind its mission-critical SaaS platform.

Ansible AWS Azure C# Databricks GCP Git Grafana InfluxDB JIRA MySQL PostgreSQL PowerShell Python SQL SQL Server Terraform
7 minutes ago

Marketing Agency Operations & Administration Manager

John Short 11-50 Professional Services

Compound Growth Marketing (CGM) is seeking an Operations Manager to support the day-to-day financial, HR, IT, and administrative operations that keep the agency running smoothly.

Cybersecurity SEO
21 minutes ago

Platform Manager - Cresta

DSI 251-1K Retailing

DSI Systems Inc. is hiring a Platform Manager – Cresta to oversee and evolve its Cresta AI platform supporting Sales and Support operations, with the goal of improving agent performance, efficiency, and customer experience.

CRM Salesforce
21 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers