Senior Site Reliability Engineer

3 days, 11 hours ago
Full-time
Senior
DevOps and Infrastructure
Calendly

Calendly

Calendly offers a modern scheduling platform that simplifies the process of finding meeting times, allowing users to eliminate the hassle of back-and-forth communication and enhance productivity through automated scheduling features.

Internet Software & Services
251-1K
Founded 2013
$351M raised

Description

  • Design, build, maintain, and operate Calendly’s next-generation infrastructure platform.
  • Build tools and applications that extend the infrastructure platform.
  • Evaluate and deploy cloud-native open source tools and cloud services.
  • Implement resilient infrastructure using Infrastructure as Code.
  • Improve infrastructure observability and provide observability patterns for application teams.
  • Support the infrastructure platform through an on-call rotation.
  • Define and promote standard practices for new services, changes, incidents, postmortems, and capacity management.
  • Collaborate with application engineering teams to adopt reliability and operational best practices.
  • Advise application teams on optimal infrastructure use and monitoring practices.
  • Foster a collaborative environment for learning and knowledge sharing.

Requirements

  • Strong understanding of the Linux operating system.
  • Strong technical knowledge of cloud infrastructure, especially GCP, distributed systems, and reliability practices.
  • Deep experience designing, building, and running highly available production infrastructure.
  • Strong Golang or Python development experience, especially writing APIs to build, orchestrate, and manage cloud infrastructure.
  • Working knowledge of designing and implementing cloud-native applications on Kubernetes, including Controllers and Operators.
  • Strong knowledge of computer networking principles and cloud networking technologies for scalable and secure environments.
  • Extensive working experience with software and infrastructure monitoring tools, especially Datadog.
  • Comfort working directly with internal customers to understand requirements and collaborate on solutions.
  • Creative problem-solving ability, strong attention to detail, and comfort working through complex issues.
  • Authorized to work lawfully in the United States, as Calendly does not offer immigration sponsorship at this time.

Benefits

  • Annual base salary range of $198,025 to $287,952 USD, depending on location tier.
  • Top Performer Bonus program for full-time employees working 30 hours per week or more.
  • Equity awards as part of the total rewards package.
  • Competitive benefits package.
  • Opportunity to work during a period of strong product and company growth.
  • Occasional travel for company events, team collaboration, or offsites.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Database Reliability Engineer

Sporty Group 51-250 Media

Sporty is seeking a Database Reliability Engineer to own and improve its database infrastructure supporting multiple platforms and international expansion.

Ansible Argo CD Elasticsearch GitHub Actions Go Grafana Helm Jenkins Kubernetes MongoDB MySQL PostgreSQL Prometheus Python RabbitMQ Terraform
1 day, 5 hours ago

Senior Site Reliability Engineer

Moniepoint 1K-5K Diversified Financial Services

Moniepoint is hiring an experienced Site Reliability Engineer to improve the reliability, scalability, and observability of its highly distributed financial platform serving emerging markets.

AWS Azure Datadog GCP Go Java Kafka Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python RabbitMQ Rust
1 day, 5 hours ago

Senior Site Reliability Engineer, Identity Platform

Coinbase 1K-5K Capital Markets

Coinbase is hiring an experienced Site Reliability Engineer to build and scale identity and access management tooling for its IT Operations Corporate Engineering team supporting cloud-based, security-first systems.

Ansible AWS Azure C# CI/CD Docker GCP Go Java Kubernetes Python Ruby Secrets Management Terraform
1 day, 6 hours ago

Database Reliability Engineer - Core Team

ClickHouse 51-250 IT Services

ClickHouse is hiring a Site Reliability Engineering team member for ClickHouse Core to improve the reliability, availability, scalability, and performance of ClickHouse Cloud for customers worldwide.

AWS Azure C++ ClickHouse GCP Python SQL
1 day, 6 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers