Senior Site Reliability Engineer (Calgary, Canada)

1 month, 1 week ago
Full-time
Senior
DevOps and Infrastructure
Syndio

Syndio

Syndio provides expert-backed technology that helps companies measure, achieve, and sustain workplace equity. Their Workplace Equity Analytics Platform identifies and addresses pay gaps, compliance issues, and trust-building within organizations. With ...

Professional Services
51-250
Founded 2009
$83M raised

Description

  • Design, implement, maintain, and evolve solutions that improve application and system reliability and availability.
  • Own the operational and architectural state of production infrastructure and applications.
  • Build and operate production systems using automation, monitoring, and observability best practices.
  • Collaborate with developers, SREs, and other engineers to support smooth deployments and reduce downtime.
  • Experiment with cloud infrastructure environments and services to improve system performance and resilience.
  • Participate in the 24/7 on-call rotation and help with emergency response and incident management.
  • Contribute to developer tooling, infrastructure, capacity planning, and continuous improvement initiatives.
  • Apply software engineering principles to reduce single points of failure and improve failure recovery.
  • Work across areas that may include platform, data, security, and software engineering.

Requirements

  • 5+ years of experience in Site Reliability Engineering or a similar role operationalizing and maintaining cloud services.
  • Strong experience with Infrastructure as Code tools such as Terraform.
  • Strong experience with Linux, Kubernetes, Helm, and public cloud platforms such as GCP.
  • Experience with monitoring and alerting tools such as Datadog.
  • Experience with CI/CD pipelines and GitOps deployment models.
  • Ability to diagnose technical problems, debug code, and automate routine tasks.
  • Python and/or Go programming experience is a plus.
  • Experience with security best practices for cloud deployments is a plus.
  • Availability and willingness to respond to emergencies and participate in incident management.
  • Strong sense of ownership, urgency, self-discipline, and collaboration.
  • Relevant experience managing Kubernetes applications in an SRE role.

Benefits

  • Competitive base salary targeted at $130k–145k CAD, with final offer based on experience and expertise.
  • Syndio equity so employees can share in the company’s success.
  • 20 days of annual PTO, plus paid sick and safe time, compassion leave, and voting leave.
  • Pension contribution.
  • Remote-first work with opportunities for local meet-ups in Calgary.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer (SRE)

The Investigo Group Professional Services

The Investigo Group is hiring a Senior Site Reliability Engineer to operate and mature its production Kubernetes and OpenShift platforms across secure on-premises and hybrid environments.

Ansible Argo CD CI/CD Flux GitHub Actions GitOps Go Grafana Helm Juniper Kubernetes Linux Load Balancing Machine Learning OpenID Connect OpenShift OpenTelemetry Palo Alto Prometheus Python SAML Shell Scripting Terraform
6 hours, 36 minutes ago

Senior DevOps Engineer - Cloud Operations

Black Duck Inn 1K-5K Internet Software & Services

Black Duck Software is hiring a Sr. DevOps Engineer, Cloud Operations to own and operate global customer-facing SaaS and hosted infrastructure on Google Cloud Platform for enterprise applications.

Argo CD Bash CI/CD DevSecOps DNS GCP GitHub Actions GitOps Go HashiCorp Vault Helm Java Kubernetes Load Balancing Microservices Python Terraform TLS
8 hours, 1 minute ago

Site Reliability Engineer (Hosted Infra) - Platform

Elastic 1K-5K Internet Software & Services

Elastic is hiring a Cloud Infrastructure SRE to help build and operate large-scale multi-cloud infrastructure that powers Elastic Cloud across globally distributed regions.

Ansible Argo CD Docker Go Kubernetes Linux Prometheus Puppet Terraform Ubuntu
10 hours, 14 minutes ago

Senior AIOps Engineer, Incident Response [Remote-US]

Quanata 201-500 information technology & services

Quanata is hiring an experienced production operations and reliability leader to oversee production health, incident response, and operational support for its AI-driven insurance technology platform.

AWS Confluence JIRA
17 hours, 37 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers