Senior Site Reliability Engineer

1 week, 2 days ago
Senior
DevOps and Infrastructure

Lyrebird Health

Lyrebird Health is an AI medical scribe platform for healthcare practitioners that listens during consultations, generates clinical notes and documents, and transfers them into medical records and EMRs. It is built for Australian GPs and specialists and also serves a range of other healthcare professionals.

healthtech
11-50
Founded 2023
$12M raised

Description

  • Keep production systems online and restore them quickly when failures occur.
  • Lead and manage incidents, making high-quality decisions under pressure.
  • Design and implement scalable infrastructure and deployment patterns.
  • Build and improve CI/CD pipelines and release systems.
  • Improve monitoring, telemetry, and observability across the stack.
  • Own cloud infrastructure, security, and access controls.
  • Work closely with engineers to ensure systems are designed to scale from day one.
  • Prevent incidents by creating systems and standards that improve reliability and reduce risk.

Requirements

  • 5–7 years of experience in SRE, platform engineering, or DevOps roles.
  • Strong experience with AWS, including ECS/Fargate, EC2, Lambda, SQS, and IAM.
  • Experience running and scaling production systems.
  • Strong understanding of distributed systems and scaling approaches.
  • Hands-on experience with Docker and containerised environments.
  • Experience with Kubernetes or ECS.
  • Comfort operating with ambiguity and taking ownership end to end.
  • Ability to stay calm and make good decisions during incidents.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Contract: Senior Site Reliability Engineer

Newsela 251-1K Diversified Consumer Services

Newsela is hiring a Senior Site Reliability Contractor to improve and automate infrastructure, monitoring, and release operations for its cloud-based education platform.

Agile AWS CI/CD Datadog Docker GCP GitHub Actions JIRA MySQL Neo4j PostgreSQL Prefect Python Redis SQL Terraform
2 minutes ago

Principal Site Reliability Engineer

Zscaler 1K-5K Internet Software & Services

Zscaler is hiring a Principal Site Reliability Engineer to join its Infrastructure Services and Architecture team, owning cloud and infrastructure reliability for customer-facing systems in a hybrid or remote role.

Agile Ansible CI/CD Git Go HashiCorp Vault Kubernetes Linux OpenID Connect Python Terraform
32 minutes ago

Senior Site Reliability Engineer

OfficeSpace Software 251-1K Internet Software & Services

OfficeSpace Software is hiring a Senior Site Reliability Engineer to own the performance, reliability, and cost efficiency of its production platform at scale while helping modernize operations with AI-assisted reliability engineering.

Ansible Apache Argo CD CI/CD Datadog GitOps Grafana Kubernetes Linux MariaDB Microservices MySQL Nginx PostgreSQL Prometheus Puppet Python Redis Ruby Ruby on Rails Sidekiq Terraform
2 hours, 17 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring an Engineering Manager to lead its Resilience Engineering team in building production load testing and chaos engineering capabilities that improve the safety and reliability of its production systems.

AWS Java Kotlin Kubernetes Python
2 hours, 54 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers