Senior AIOps Engineer, Incident Response [Remote-US]

2 weeks, 2 days ago
Full-time
Senior
DevOps and Infrastructure
Quanata

Quanata

Quanata is a software development company based in San Francisco, specializing in context-based insurance solutions. The company leverages AI, real-time telematics, and data science to enhance risk prediction, promote safer driving behaviors, and create modern insurance products. Quanata aims to transform the insurance industry by fostering positive behaviors and advancing digital experiences. The company develops a range of software platforms and tools for insurers. Their offerings include AI-powered risk assessment, telematics for driver monitoring, and claims solutions that optimize and automate processes. Quanata also focuses on customer engagement through personalized products and retention tools, supporting insurtech modernization with big data analytics and cloud-native platforms. With a team of around 26 professionals, Quanata draws on talent from Silicon Valley to drive innovation in the insurance sector.

information technology & services
201-500

Description

  • Own production health, reliability, and operational support processes across critical systems and services.
  • Lead incident response efforts, stakeholder communication, root cause analysis, and post-incident reviews.
  • Identify patterns in production issues and drive improvements to reduce recurring incidents and operational overhead.
  • Design and implement AI-driven agents and workflows that automate support and operational tasks.
  • Partner with engineering, product, and AI orchestration teams to improve system resilience and operational efficiency.
  • Build and maintain operational runbooks, documentation, and knowledge base content for human and AI-assisted workflows.
  • Support observability, monitoring, and troubleshooting across cloud-based production environments.
  • Participate in on-call rotations and continuously improve operational readiness and response processes.

Requirements

  • 6–8 years of experience in production operations, site reliability engineering, technical support engineering, or similar operational roles.
  • Strong background in incident management, root cause analysis, and production system troubleshooting.
  • Experience working within modern SDLC, DevOps, and change management environments.
  • Familiarity with operational tooling such as Jira, Confluence, and observability/monitoring platforms.
  • Strong analytical and problem-solving skills with the ability to identify trends and drive operational improvements.
  • Comfortable working cross-functionally with engineering, product, operations, and leadership teams.
  • Strong communication skills and ability to operate effectively in fast-moving technical environments.
  • Bachelor’s degree in Computer Science, Engineering, or equivalent relevant experience.
  • Experience building or working with AI/LLM-powered systems, intelligent agents, or workflow automation tools (bonus point).
  • Familiarity with cloud platforms such as AWS and modern observability ecosystems (bonus point).
  • Experience with event-driven architectures, orchestration frameworks, or operational automation platforms (bonus point).
  • Background leading operational transformation or reliability improvement initiatives (bonus point).

Benefits

  • Salary range of $215,000 to $280,000.
  • Medical, dental, vision, life insurance, and supplemental income plans for employees and dependents.
  • Headspace app subscription and a monthly wellness allowance.
  • 401(k) plan with company match.
  • One-time $2,000 home office equipment allowance for remote work.
  • Four weeks of PTO in the first year.
  • Twelve weeks of fully paid parental leave for new parents.
  • Up to $5,000 per year for professional learning, continuing education, and career development, plus LinkedIn Learning and BetterUp access.
  • Remote-first work environment within the U.S., with occasional travel not required for most positions.
  • Core meeting hours from 9 AM to 2 PM Pacific time.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Forward Deployed Engineer

Unity 5K-10K Internet Software & Services

Unity is hiring a Principal Forward Deployed Engineer to work across internal business functions and build AI-powered agents, workflows, and integrations that reduce manual effort and improve how teams operate.

LLM
12 hours, 7 minutes ago

Lead Backend Engineer - Piktochart

ThriveCart 11-50 Internet Software & Services

Piktochart is hiring a Lead Backend Engineer to own the backend architecture for its web-based visual design platform, supporting scalable and reliable product growth.

CI/CD Ruby on Rails TypeScript
12 hours, 22 minutes ago

Manager, Software Engineering - Storage Platform

Figma 1K-5K Internet Software & Services

Figma is hiring an Engineering Manager to lead its Databases team, which owns the core data layer behind the company’s product and platform as it scales.

LLM MySQL PostgreSQL
12 hours, 22 minutes ago

Développeuse ou développeur backend sénior, Outils IA internes / Senior Backend Developer, Internal AI Tooling

Unity 5K-10K Internet Software & Services

Unity is hiring a Senior Backend Developer for its Engine AI team to build and operate internal AI infrastructure and agentic tools that improve developer productivity across the organization.

CI/CD Docker GCP GitHub Go Grafana Kubernetes Microservices Node.js Prometheus Python
12 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers