Senior SRE/DevOps Engineer

1 month, 2 weeks ago
Full-time
Senior
DevOps and Infrastructure
Metabase

Metabase

Metabase provides business intelligence, dashboards, and data visualization tools with open-source, no SQL options for fast analytics. They simplify enterprise data management with user-friendly solutions and tools for data exploration and sharing.

IT Services
51-250
Founded 2014

Description

  • Own and operate the application stack and AWS infrastructure that runs hosted customer instances of Metabase.
  • Debug runtime issues across the application stack and hosting stack.
  • Develop internal tooling and automation for the lifecycle of hosted Metabase installations, from purchase through deployment and zero-downtime upgrades.
  • Improve automated deployments and testing to increase reliability and operational quality.
  • Build and maintain infrastructure automation for Kubernetes and cloud environments.
  • Collaborate with core application developers on changes that improve metrics, deployment speed, and CI integration.
  • Work on multi-region hosting and EKS cluster provisioning.
  • Extend CRDs and operators to support the hosted platform.
  • Improve RDS sharding strategy for the multi-tenant platform.
  • Maintain SOC2 compliance and security posture.

Requirements

  • At least 5 years of experience building and operating production infrastructure, ideally on public cloud.
  • Strong Kubernetes experience.
  • Strong AWS experience.
  • Strong experience with infrastructure as code and Terraform.
  • Ability to write high-quality, readable code in a modern language such as Python or Go.
  • Experience with modern monitoring stacks such as Prometheus, Grafana, or Datadog.
  • Thoughtful and careful approach to operations work.
  • Ability to make solid technical judgments and explain them clearly.
  • Compulsive automation and documentation habits.

Benefits

  • Flexible work from anywhere arrangement with the ability to define your own schedule.
  • Fully distributed global team with asynchronous work and plenty of uninterrupted time.
  • Autonomy and a growth-oriented environment that supports learning and development.
  • Opportunity to work on a rapidly growing hosted product at a company used by tens of thousands of organizations.
  • Long-term funding support, including a $30M Series B.
  • Supportive team culture focused on getting things done collaboratively.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

DevSecOps Engineer

INflow Federal 51-250 Aerospace & Defense

INflow Federal is seeking a fully remote DevSecOps Engineer to support an enterprise case management solution for Department of Defense mission partners by securing and automating cloud-based CI/CD and infrastructure operations in AWS GovCloud.

Agile AWS Bash CI/CD CloudFormation Docker ELK Stack Git GitLab CI Helm Jenkins Kubernetes PowerShell Prometheus Python Terraform
1 hour, 8 minutes ago

Site Reliability Engineer

Alpaca 51-250 Capital Markets

Alpaca is hiring a Site Reliability Engineer to keep its brokerage platform reliable and operable across cloud, Kubernetes, observability, messaging, and database systems, with a strong focus on PostgreSQL reliability on the trading-critical path.

DNS GitOps Go Kafka Kubernetes Linux Load Balancing PostgreSQL Python RabbitMQ Secrets Management TLS
1 hour, 39 minutes ago

Site Reliability Engineer

Kaseya 1K-5K IT Services

Kaseya is hiring a Site Reliability Engineer to own the reliability, automation, and production stability of its AWS-based services used by thousands of MSPs worldwide.

Ansible AWS Chef CloudFormation Datadog DevSecOps Elasticsearch Kibana Kubernetes MySQL PostgreSQL Puppet Secrets Management Serverless Terraform
5 hours, 38 minutes ago

DevOps Engineer

Apptronik 51-250 Aerospace & Defense

Apptronik is seeking a Senior DevOps Engineer to own the infrastructure, CI/CD, and developer platforms that help software teams ship Apollo robot software quickly and reliably across on-prem and cloud environments.

AWS Azure Bash Buildkite C++ CircleCI Embedded Systems GCP GitHub Actions GitLab CI GitOps Grafana Helm Jenkins Kubernetes Linux OpenTelemetry Prometheus Python Terraform
8 hours, 32 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers