Senior Site Reliability Engineer Team Lead - OP02087

18 hours ago
Full-time
Lead
DevOps and Infrastructure
Dev.Pro

Dev.Pro

Dev.Pro is a globally distributed software development partner, specializing in custom outsourced software development for innovative technology companies to scale their businesses efficiently.

Internet Software & Services
251-1K
Founded 2011

Description

  • Lead the Cloud/SRE Support team with coaching, prioritization, and day-to-day oversight.
  • Drive team performance to ensure high-quality support, SLA compliance, and continuous improvement.
  • Coordinate with India-based and cross-functional teams to maintain alignment and 24/7 coverage.
  • Translate complex operational issues into actionable plans and scalable solutions.
  • Design and improve support processes and operational frameworks for the team.
  • Identify operational gaps and risks, and help improve team engagement and effectiveness.
  • Collaborate with cross-functional stakeholders to define priorities and communicate progress, risks, and solutions.
  • Oversee MDM operations, cloud and user access management, monitoring, incident handling, and root cause analysis.
  • Maintain documentation, runbooks, and escalation procedures.
  • Promote reliability best practices and customer-focused operational support.

Requirements

  • Based in Chile.
  • Upper-Intermediate English level.
  • 5+ years of experience in cloud operations, platform support, or IT operations.
  • 2+ years of experience leading technical support or SRE teams.
  • Strong operational support mindset, including incident handling, user requests, and escalation management.
  • Solid understanding of cloud technologies, monitoring, and observability tools.
  • Knowledge of incident management best practices and access management concepts.
  • Ability to break down complex problems into structured, actionable plans.
  • Strategic thinking to evaluate options, weigh tradeoffs, and design processes.
  • Experience collaborating with global, multi-time zone teams.
  • Strong communication skills for both technical and non-technical stakeholders.
  • Preferred experience leading MDM support teams such as Esper, MobileIron, or Workspace ONE.
  • Familiarity with cloud platforms such as Azure, GCP, or AWS.
  • Basic understanding of CI/CD pipelines, Docker, and Kubernetes.
  • Experience working with onshore/offshore teams, ideally in the U.S. and India.
  • Experience in 24/7 or follow-the-sun operational models.
  • Strong analytical and documentation skills, including process mapping and root cause analysis.

Benefits

  • 99.9% remote work with the ability to work from anywhere in the world.
  • 30 paid days off per year for vacation, holidays, or personal time.
  • 5 paid sick days, up to 60 days of medical leave, and up to 6 paid days off for major family events.
  • Partially covered health insurance after probation.
  • Wellness bonus for gym memberships, sports nutrition, and similar needs after 6 months.
  • Salary paid in U.S. dollars with all approved overtime covered.
  • English lessons and access to Dev.Pro University programs.
  • Online activities and team-building events.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

TextNow 51-250 Wireless Telecommunication Services

TextNow is hiring a remote Site Reliability Engineer in Canada to own infrastructure, monitoring, logging, CI/CD, and reliability for the systems supporting its free phone service platform.

Ansible AWS CI/CD GitHub System Design Terraform
4 hours, 45 minutes ago

Senior Application Engineer

Warner Music Group is hiring a Senior Application Engineer to support, improve, and modernize the software systems behind its global music operations.

Angular AWS CI/CD GitHub Actions Java Oracle PostgreSQL Python React SQL
5 hours ago

Site Reliability Engineer - Backstage

Spotify Media

Site Reliability Engineer for Spotify’s Backstage team in New York City, focused on building and operating cloud infrastructure for an external developer portal and internal AI-driven coding workflows.

AWS GCP Go Java LLM Microservices Python React Terraform TypeScript
6 hours, 15 minutes ago

Blockchain Site Reliability Engineer

InfStones 51-250 Internet Software & Services

InfStones is hiring a remote Blockchain Site Reliability Engineer in Dallas to ensure the reliability, availability, and performance of its blockchain node infrastructure.

Docker Ethereum Go Grafana JavaScript Kubernetes Linux Prometheus Python Rust Solana
7 hours ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers