Senior Site Reliability Engineer

2 weeks, 2 days ago
Full-time
Senior
DevOps and Infrastructure
Calendly

Calendly

Calendly offers a modern scheduling platform that simplifies the process of finding meeting times, allowing users to eliminate the hassle of back-and-forth communication and enhance productivity through automated scheduling features.

Internet Software & Services
251-1K
Founded 2013
$351M raised

Description

  • Design, build, maintain, and operate Calendly’s next-generation infrastructure platform.
  • Build tools and applications that extend the infrastructure platform.
  • Evaluate and deploy cloud-native open-source tools.
  • Implement resilient infrastructure using infrastructure as code.
  • Improve infrastructure observability and provide patterns for application teams to improve application observability.
  • Participate in an on-call rotation to support the infrastructure platform.
  • Define standard practices and tooling for new services, changes, incidents, postmortems, and capacity management.
  • Collaborate with application engineering teams to adopt infrastructure standards and best practices.
  • Enable and evangelize monitoring best practices and advise teams on optimal infrastructure use.
  • Foster a collaborative environment for learning and knowledge sharing.

Requirements

  • Experience designing, building, and running highly available production infrastructure.
  • Strong technical knowledge of cloud infrastructure, especially Google Cloud Platform (GCP).
  • Experience with distributed systems and reliability practices.
  • Strong Golang or Python development experience, especially for writing APIs to build, orchestrate, and manage cloud infrastructure.
  • Solid working knowledge of cloud-native application design on Kubernetes, including Controllers and Operators.
  • Strong understanding of Linux.
  • Robust knowledge of computer networking principles and cloud networking technologies.
  • Extensive experience with software and infrastructure monitoring tools, especially Datadog.
  • Comfort working directly with internal customers to understand requirements and collaborate on solutions.
  • Eagerness to learn, share knowledge, and mentor others.
  • Authorized to work legally in the United States, as Calendly does not offer immigration sponsorship at this time.

Benefits

  • Base salary range of $198,025 to $287,952 USD depending on location tier.
  • Top Performer Bonus program or Sales incentive for full-time employees working 30 hours per week or more.
  • Equity awards.
  • Competitive benefits package.
  • Reasonable accommodation support during the application and recruiting process.
  • Occasional travel for company events, team collaboration, or offsites may be part of the role.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer (Senior or Staff), Atlas

MongoDB 1K-5K Internet Software & Services

MongoDB is hiring a Senior Site Reliability Engineer for its Atlas team to help support, maintain, and grow a multi-cloud platform for customer-facing production workloads.

AWS Azure DNS GCP Go HTTP Linux Python Ruby TLS
50 minutes ago

Intermediate Site Reliability Engineer - OP02119

Dev.Pro 251-1K Internet Software & Services

Dev.Pro is hiring an IT Specialist for its SRE team to support company and client environments by maintaining infrastructure, monitoring services, and automating operations across cloud and on-premises systems.

Ansible Apache AWS Bash CI/CD DHCP DNS Docker ELK Stack GCP Git Grafana Jenkins Linux MySQL Nginx PostgreSQL Prometheus Puppet Python SQL SQL Server SSH TCP/IP TeamCity Terraform TLS Ubuntu Windows Server Zabbix
2 hours, 52 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is hiring an Engineering Manager to lead its Resilience Engineering team in building production load testing and chaos engineering capabilities that improve the safety and reliability of its production systems.

AWS Java Kotlin Kubernetes Python
3 hours, 51 minutes ago

Manager, Software Engineering (Resilience Engineering)

Affirm 1K-5K Diversified Financial Services

Affirm is seeking an Engineering Manager to lead its Resilience Engineering team, building production load testing and chaos engineering capabilities that improve the safety and reliability of production systems.

AWS Java Kotlin Kubernetes Microservices Python
11 hours, 48 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers