Team Lead, Site Reliability Engineering - Fleet Management

3 weeks ago
Full-time
Lead
Software Development
MongoDB

MongoDB

MongoDB provides a developer data platform that simplifies data management and accelerates application development, enabling businesses to leverage modern database technology for innovative solutions across various industries.

Internet Software & Services
1K-5K
Founded 2007

Description

  • Manage a team of 6-8 engineers, supporting career growth, performance conversations, and blocker removal.
  • Develop a technical vision and roadmap for the runtime environment that balances long-term strategy with immediate engineering needs.
  • Provide light hands-on technical leadership through architectural design reviews, PR reviews, and operational guidance on complex issues.
  • Serve as the primary liaison for the Fleet Management team and coordinate with other engineering leaders on platform alignment and stakeholder expectations.
  • Oversee the end-to-end lifecycle of the Kubernetes fleet and the reliability and security components that support it.
  • Help drive the migration from Terraform-based infrastructure-as-code to an operator-driven lifecycle management approach.
  • Guide the team through infrastructure and operational challenges across multi-cloud Kubernetes environments.
  • Optimize team workflows and promote automation to reduce toil and manual operational work.

Requirements

  • 10+ years of experience working on software and operating distributed systems.
  • 2+ years of experience managing engineering teams.
  • Deep technical familiarity with Kubernetes ecosystems and containerization technologies.
  • Experience with modern infrastructure-as-code tooling such as Terraform, Crossplane, or Operators.
  • Ability to translate complex business and engineering requirements into actionable, phased technical roadmaps.
  • Customer-focused mindset with internal developers treated as primary users.
  • Track record of improving efficiency in processes and operations.
  • High empathy, responsibility, ownership, and accountability.
  • Excellent verbal and written technical communication skills.
  • Preferred experience leading major architectural shifts from traditional IaC to operator-driven lifecycle management.
  • Preferred experience managing and scaling infrastructure across multi-cloud environments such as AWS, GCP, or Azure.
  • Preferred experience designing secure, multi-tenant runtime environments at scale.

Benefits

  • Base salary range of $151,000 to $297,000 USD for U.S.-based candidates.
  • Equity as part of total compensation for eligible employees.
  • Employee stock purchase program.
  • Flexible paid time off.
  • 20 weeks of fully paid gender-neutral parental leave.
  • Fertility and adoption assistance, including fertility support.
  • 401(k) plan.
  • Mental health counseling and transgender-inclusive health insurance coverage.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Principal Software Engineer - Distributed

MariaDB.org 11-50 Internet Software & Services

MariaDB is hiring a Principal Software Engineer (Distributed) to help build and optimize its distributed database systems for high-scale use across on-prem and cloud environments.

Bash C++ Git Grafana Linux MariaDB MySQL PostgreSQL Prometheus Python SQL Unix YAML
1 hour, 16 minutes ago

Tech Lead, Web Core Product & Chrome Extension - Milwaukee, WI, USA

Speechify 51-250 Internet Software & Services

Speechify is hiring a web product engineer to help build and ship its text-to-speech products for a global user base in a fully distributed company.

Firebase JavaScript React TypeScript
2 hours, 32 minutes ago

Lead Software Engineer: Java Full Stack (Remote)

LegalMatch 251-1K Specialized Consumer Services

Lead Software Engineer at an unspecified company, responsible for hands-on technical leadership and delivery of critical software systems across cloud, web, mobile, and desktop applications.

Agile AWS Azure BDD C# C++ CI/CD Docker GCP Git Java JavaScript Kubernetes Microservices Python Scrum SQL TDD
4 hours, 24 minutes ago

Tech Lead, Web Core Product & Chrome Extension - Munich, Germany

Speechify 51-250 Internet Software & Services

Speechify is hiring a web product engineer to help ship and shape user-facing text-to-speech experiences for millions of users in a fully distributed environment.

Firebase JavaScript React TypeScript
8 hours, 30 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers