SRE Technical Project Manager

3 weeks, 1 day ago
Full-time
Mid Level
DevOps and Infrastructure
HHAeXchange

HHAeXchange

HHAeXchange is a premier homecare management software connecting providers, payers, and caregivers for proactive care, efficiency, and transparency in the industry.

Health Care Providers & Services
251-1K
Founded 2008

Description

  • Partner with managers and leads to prioritize and drive strategic SRE projects, including milestones, acceptance criteria, and progress reporting.
  • Facilitate collaboration among SRE, development, and technical customer-facing teams.
  • Lead kanban planning, roadmap meetings, and retrospectives using a kanban framework.
  • Refactor and improve the incident management process to support better incident response, work/life balance, and on-call orchestration.
  • Manage MS Teams-integrated bots for incident creation, stakeholder communication, and work-intake management.
  • Drive the use of AI bots to automate incident scribing and initial timeline generation.
  • Own the weekly uptime report and communicate system health metrics to executive stakeholders.
  • Track SLIs, SLOs, and error budgets using Datadog and translate technical data into clear updates.
  • Lead the post-incident review process, including RCA creation, approval, and post-incident action item ownership.
  • Perform other duties as assigned by the supervisor or company leader.

Requirements

  • Bachelor’s degree.
  • Software and operations agile certifications.
  • 3-5 years of project management experience with globally distributed teams.
  • 1-2 years of experience working with IT Operations and Site Reliability Engineering teams.
  • Experience working in a healthcare data environment is a bonus.
  • Proficiency with Jira.
  • Experience with on-call rotation management tools such as OpsGenie, Jira Service Management, or PagerDuty.
  • Experience with MS Teams integrations.
  • Experience leading agile technical teams; kanban experience is preferred.
  • Professional English writing skills.
  • Demonstrated hands-on experience leveraging AI for operational efficiency.
  • Executive- or customer-facing communication experience.
  • Must be able to work fully remote from the EST or CST time zones within the United States.
  • Travel up to 10%, including overnight travel.

Benefits

  • Base salary range of $100,000-$110,000 per year, not including variable compensation.
  • Benefits-eligible position.
  • Competitive health plans.
  • Paid time off.
  • Company-paid holidays.
  • 401(k) retirement program with a company elected match.
  • Access to other company-sponsored programs.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
55 minutes ago

SRE - DevOps Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure that supports its global digital solutions.

Argo CD CI/CD Flux GitHub Actions GitOps Helm Jenkins Kubernetes OpenShift Terraform
4 hours ago

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
4 hours, 52 minutes ago

DevOps - SRE Engineer - Argentina

Coderio 51-250 Internet Software & Services

Coderio is hiring a remote DevOps/SRE Engineer in Argentina to ensure the stability, scalability, and efficient operation of the infrastructure supporting its digital platforms.

Argo CD Flux GitHub Actions Helm Jenkins Kubernetes OpenShift Terraform
5 hours, 6 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers