OpenTable

OpenTable

OpenTable is a leading provider of free, real-time online restaurant reservations for diners and reservation and guest management solutions for restaurants. With millions of diners and tens of thousands of restaurants, OpenTable empowers the dining exp...

Consumer Services
1K-5K
Founded 1998
$48M raised

Description

  • Act as the primary SRE partner for the DBA team across managed database platforms.
  • Own the end-to-end observability stack for databases, including metrics, logs, traces, dashboards, and alerts.
  • Design, implement, and improve monitoring and alerting for database reliability signals such as replication lag, query latency, and backup health.
  • Lead and participate in on-call support and incident response for database and system incidents.
  • Build and maintain automation and self-service workflows for cluster provisioning, configuration rollout, user and role management, backup and restore, and failover procedures.
  • Develop and maintain runbooks, playbooks, and standard operating procedures for database operations.
  • Promote an automation-first culture and reduce manual toil through tooling and platform improvements.
  • Participate in a weekly 24/7 on-call rotation.
  • Collaborate effectively with DBAs and cross-functional teams while also working independently.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
  • 4–7 years of total experience with a strong focus on SRE, Production Engineering, or Platform Engineering.
  • Solid experience running Linux-based production systems at scale.
  • Proficiency with infrastructure and configuration management tools, especially Puppet.
  • Experience with PostgreSQL or MongoDB in an operational context.
  • Familiarity with containerization and orchestration technologies such as Docker and Kubernetes is a plus.
  • Strong scripting or programming skills in Python, Go, Shell (bash), or similar.
  • Solid experience with Git and GitHub workflows, including branching, pull requests, code reviews, and automation via GitHub Actions or similar CI systems.
  • Experience building and maintaining CI/CD pipelines and integrating operational checks and tests.
  • Practical experience with monitoring and metrics tools such as Prometheus, CloudWatch, or Grafana.
  • Practical experience with alerting tools such as PagerDuty.
  • Excellent communication, collaboration, and problem-solving skills.
  • Must be able to work independently and without direct supervision.
  • Comfortable partnering with DBAs and focusing on systems, automation, and observability around database platforms rather than schema or query design.
  • Interest in learning more about database behavior in production, including replication, failover, backups, and performance from a reliability perspective.

Benefits

  • 100% remote across India.
  • Work from (almost) anywhere for up to 20 days per year.
  • Company-paid therapy sessions through SpringHealth.
  • Company-paid Headspace subscription.
  • Annual company-wide week off each year.
  • Paid parental leave.
  • Generous paid vacation plus time off for your birthday.
  • Paid volunteer time.
  • Development Dollars, leadership development, and access to thousands of on-demand e-learnings.
  • Travel discounts, employee resource groups, and quarterly team offsites.
  • Tax optimization options, generous health insurance, and a pension fund.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Intermediate Site Reliability Engineer, Environment Automation

GitLab 1K-5K Internet Software & Services

GitLab is hiring a Site Reliability Engineer for its Dedicated Environment Automation team to help build and operate automated, isolated customer environments across cloud platforms.

Ansible GitLab Go Helm Kubernetes Terraform
21 minutes ago

Senior Site Reliability Engineer

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Senior Site Reliability Engineer to own and improve the reliability, scalability, and automation of its U.S.-focused infrastructure and distributed systems while supporting a rapidly growing fintech platform.

AWS CI/CD Datadog Elasticsearch Git GitLab Go Grafana Kubernetes Microservices MySQL New Relic PostgreSQL Prometheus Python React React Native REST API SQL TypeScript
1 hour, 6 minutes ago

Site Reliability Engineering (SRE)

Riskified 251-1K Internet Software & Services

Riskified is hiring a Site Reliability Engineer to own the cloud infrastructure behind its real-time fraud and risk decisioning platform, ensuring scalability, reliability, and fast delivery at global transaction volumes.

Argo CD AWS CI/CD Cloudflare Go Helm Kubernetes Microservices Node.js
1 hour, 6 minutes ago

Site Reliability Engineer

TextNow 51-250 Wireless Telecommunication Services

TextNow is hiring a remote Site Reliability Engineer in Canada to own infrastructure, monitoring, logging, CI/CD, and reliability for the systems supporting its free phone service platform.

Ansible AWS CI/CD GitHub System Design Terraform
8 hours, 51 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers