Obsidian Security

Obsidian Security is a Southern California-based company at the forefront of cybersecurity, artificial intelligence, and hybrid cloud environments. They offer a comprehensive security solution for businesses, including advanced threat protection, insid...

Internet Software & Services

Information Technology

51-250 (150)

Founded 2017

$30M raised

31 open positions

Links

View All Jobs

Site Reliability Engineer

2 weeks, 3 days ago

United Kingdom

Full-time

Junior

Site Reliability Engineer (SRE)

DevOps and Infrastructure

Argo CD AWS Datadog GCP GitHub Actions GitOps Grafana Helm Kubernetes Microservices Prometheus

Apply Now

Obsidian Security

Internet Software & Services

51-250

Founded 2017

$30M raised

View All Jobs 31

Description

Improve the reliability, availability, and resiliency of production systems and distributed services.
Build and maintain monitoring, alerting, dashboards, and observability tooling to improve system visibility and reduce operational noise.
Support incident response, on-call operations, troubleshooting, and postmortem processes.
Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability-focused workflows.
Automate infrastructure operations, deployment workflows, and platform tooling across Kubernetes, cloud infrastructure, and data pipelines.
Help detect and resolve production issues quickly and contribute to continuous operational improvement.
Collaborate closely with DevOps, Platform Engineering, and product teams to improve service resilience across the platform.

Requirements

2–5 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles.
Experience operating and supporting production systems in AWS and/or GCP.
Familiarity with Kubernetes and Helm in cloud-native environments.
Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms.
Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent.
Strong troubleshooting and debugging skills across distributed systems and microservices.
Experience writing automation or infrastructure tooling using scripting or programming languages.
Strong systems thinking and a collaborative engineering mindset.
AI Agent development experience is preferred.
Experience supporting SaaS platforms in production environments is preferred.
Familiarity with incident management and postmortem practices is preferred.
Exposure to infrastructure-as-code and GitOps workflows is preferred.
Understanding of SLI/SLO concepts and operational metrics is preferred.
Experience with enterprise-scale monitoring or customer-facing production systems is preferred.

Benefits

Competitive salary in the £85,000–£103,000 GBP base pay range.
Competitive compensation with equity and 401(k).
Comprehensive healthcare with dental and vision coverage.
Flexible paid time off plus paid holiday time off.
12 weeks of new parent or family leave.
Personal and professional development resources.
Potential eligibility for equity awards.
Potential eligibility for sales commission or incentive compensation, depending on role or function.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Site Reliability Engineer (SRE)

The Investigo Group Professional Services

The Investigo Group is hiring a Senior Site Reliability Engineer to operate and mature its production Kubernetes and OpenShift platforms across secure on-premises and hybrid environments.

United Kingdom Full-time Senior Site Reliability Engineer (SRE)

Ansible Argo CD CI/CD Flux GitHub Actions GitOps Go Grafana Helm Juniper Kubernetes Linux Load Balancing Machine Learning OpenID Connect OpenShift OpenTelemetry Palo Alto Prometheus Python SAML Shell Scripting Terraform

6 hours, 15 minutes ago

Apply

6 hours, 15 minutes ago

Senior DevOps Engineer - Cloud Operations

Black Duck Inn 1K-5K Internet Software & Services

Black Duck Software is hiring a Sr. DevOps Engineer, Cloud Operations to own and operate global customer-facing SaaS and hosted infrastructure on Google Cloud Platform for enterprise applications.

United States Full-time Lead DevOps Engineer Site Reliability Engineer (SRE)

$136k-$168k

Argo CD Bash CI/CD DevSecOps DNS GCP GitHub Actions GitOps Go HashiCorp Vault Helm Java Kubernetes Load Balancing Microservices Python Terraform TLS

7 hours, 41 minutes ago

Apply

7 hours, 41 minutes ago