Remote

Global HR Solutions & Employment Tools for Distributed Teams | Remote Hire international talent in minutes. Remote is the most disruptive global payroll, tax, HR and compliance solution for distributed teams. The easier way to employ internationally 🌍....

Professional Services

Industrials

251-1K (1000)

Founded 2019

$496M raised

46 open positions

Links

View All Jobs

Senior Site Reliability Engineer

1 month ago

Europe

Full-time

Senior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

AWS Bash CI/CD Docker Elixir GitHub Actions GitLab CI Go Grafana Kubernetes Linux Node.js OpenTelemetry Prometheus Python Terraform

Apply Now

Remote

Professional Services

251-1K

Founded 2019

$496M raised

View All Jobs 46

Description

Lead the discovery and delivery of reliability and infrastructure solutions for complex, ambiguous problems.
Own planning and execution of features and projects within the SRE/Platform domain.
Contribute to platform architecture, tooling, and roadmap decisions.
Define and operate reliability practices such as SLOs, SLIs, error budgets, alerting, and observability.
Resolve cross-team requests, identify systemic issues, and turn recurring issues into reusable fixes and runbooks.
Build and operationalize AI-native workflows, reusable prompts, skills, and tooling for the team.
Establish secure-by-default patterns, CI protections, and AI-assisted review practices.
Mentor less-senior engineers and provide timely, actionable feedback.
Participate in hiring, onboarding, and RFC discussions.
Collaborate with Security on platform hardening, threat mitigation, capacity, and cost-efficiency.
Participate in incident response and on-call rotations to maintain system reliability.

Requirements

Solid professional experience in SRE, DevOps, or Platform Engineering.
Hands-on experience operating and scaling Kubernetes production clusters and Docker/container tooling.
Experience building and managing cloud infrastructure on AWS or a similar cloud provider.
Strong infrastructure-as-code experience with Terraform.
Experience with reliability frameworks including SLOs, SLIs, error budgets, and alerting strategies.
Solid observability experience with OpenTelemetry, Grafana, Prometheus, or similar tools.
Experience with CI/CD and deployment automation, such as GitLab CI or GitHub Actions.
Comfort with Golang and Bash/scripting; broader programming experience is a plus.
Practical, embedded use of AI in infrastructure, operations, or development work with observable results.
Clear communication skills in an async-first, global environment.
Proactive, curious, and comfortable taking ownership of challenges.
Collaborative and respectful across cultures, time zones, and backgrounds.
Experience with one backend programming language such as Elixir, Node.js, or Python is preferred.
Experience running and configuring Linux systems in a non-cloud environment is preferred.
Security knowledge from both defensive and offensive perspectives is preferred.
Must submit application and CV in English.
Must upload a PDF CV or provide an up-to-date LinkedIn profile.

Benefits

Annual salary range of $53,300 to $119,850 USD.
Fair, unbiased compensation with equity pay.
Stock options.
Work from anywhere with a fully remote setup.
Flexible paid time off.
Flexible working hours in an async work environment.
16 weeks of paid parental leave.
Mental health support services.
Learning budget.
Home office budget and IT equipment.
Budget for local in-person social events or co-working spaces.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

SITE RELIABILITY ENGINEER III

Harford County Public Library 51-250 Diversified Consumer Services

Site Reliability Engineer na Stone, atuando no time de Foundation Platform para fortalecer a plataforma interna de tecnologia com foco em observabilidade, automação e estabilidade dos sistemas.

Brazil Full-time Senior Site Reliability Engineer (SRE)

Ansible Argo CD AWS Azure Datadog Docker GCP GitHub Actions Go Grafana Kubernetes Linux Node.js OpenTelemetry Prometheus Python Splunk Terraform

23 minutes ago

Apply

23 minutes ago

Sr. Site Reliability Engineer (Starlink)

SpaceX 10K-50K Aerospace & Defense

SpaceX is hiring a Sr. Site Reliability Engineer for Starlink to improve the reliability, scalability, and performance of the systems supporting its satellite internet service.

United States Full-time Senior Site Reliability Engineer (SRE)

$165k-$265k

Apache Spark C# CI/CD Flink Git Go HDFS Java Kafka Kubernetes Linux Python Scala

38 minutes ago

Apply

38 minutes ago

Head of Platform Engineering

dLocal 251-1K Diversified Financial Services

dLocal is seeking a senior leader to own its engineering platform, reliability posture, and AI-assisted development transformation across a global payments business serving emerging markets.

Argentina Spain Uruguay Brazil Full-time Lead Platform Engineer Site Reliability Engineer (SRE)

CI/CD Microservices

53 minutes ago

Apply

53 minutes ago

Database Reliability Engineer

Alex Staff Agency 11-50 Professional Services

Senior Database Reliability Engineer for an infrastructure DBA team, responsible for keeping production database services reliable and automating operational work across a multi-database environment.

Poland Armenia Serbia Georgia Spain Greece Full-time Senior Database Administrator Site Reliability Engineer (SRE)

Ansible ClickHouse DNS Grafana Linux MongoDB OpsGenie PostgreSQL Redis Terraform TLS

1 hour, 23 minutes ago

Apply

1 hour, 23 minutes ago

Remote

Tags

Links

Senior Site Reliability Engineer

Remote

Description

Requirements

Benefits

Similar Roles

SITE RELIABILITY ENGINEER III

Sr. Site Reliability Engineer (Starlink)

Head of Platform Engineering

Database Reliability Engineer

You're on a roll! Sign up now to keep applying.