Obsidian Security

Obsidian Security

Obsidian Security is a Southern California-based company at the forefront of cybersecurity, artificial intelligence, and hybrid cloud environments. They offer a comprehensive security solution for businesses, including advanced threat protection, insid...

Internet Software & Services
51-250
Founded 2017
$30M raised

Description

  • Improve the reliability, availability, and resiliency of production systems and distributed services.
  • Build and maintain monitoring, alerting, dashboards, and observability tooling to improve system visibility and reduce operational noise.
  • Support incident response, on-call operations, troubleshooting, and postmortem processes.
  • Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability-focused workflows.
  • Automate infrastructure operations, deployment workflows, and platform tooling across Kubernetes, cloud infrastructure, and data pipelines.
  • Help detect and resolve production issues quickly and contribute to continuous operational improvement.
  • Collaborate closely with DevOps, Platform Engineering, and product teams to improve service resilience across the platform.

Requirements

  • 2–5 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles.
  • Experience operating and supporting production systems in AWS and/or GCP.
  • Familiarity with Kubernetes and Helm in cloud-native environments.
  • Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms.
  • Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent.
  • Strong troubleshooting and debugging skills across distributed systems and microservices.
  • Experience writing automation or infrastructure tooling using scripting or programming languages.
  • Strong systems thinking and a collaborative engineering mindset.
  • AI Agent development experience is preferred.
  • Experience supporting SaaS platforms in production environments is preferred.
  • Familiarity with incident management and postmortem practices is preferred.
  • Exposure to infrastructure-as-code and GitOps workflows is preferred.
  • Understanding of SLI/SLO concepts and operational metrics is preferred.
  • Experience with enterprise-scale monitoring or customer-facing production systems is preferred.

Benefits

  • Competitive salary in the £85,000–£103,000 GBP base pay range.
  • Competitive compensation with equity and 401(k).
  • Comprehensive healthcare with dental and vision coverage.
  • Flexible paid time off plus paid holiday time off.
  • 12 weeks of new parent or family leave.
  • Personal and professional development resources.
  • Potential eligibility for equity awards.
  • Potential eligibility for sales commission or incentive compensation, depending on role or function.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Manager, Software Engineering

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is seeking a Senior Manager to lead CorpTech Platform software teams that build and operate AI-enabled production systems and improve how internal engineering work is designed, shipped, and maintained.

CI/CD Computer Vision ERP LLM Microservices
38 minutes ago

Staff Site Reliability Engineer

Puck 1-10 Internet Software & Services

Domino is hiring a senior Site Reliability Engineer to build AI-assisted reliability systems and strengthen the operational resilience of its cloud-based data science platform.

Go Kubernetes Linux LLM Python
1 hour ago

DevOps Engineer / SRE

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a DevOps Engineer/SRE to own on-premise infrastructure and keep its global fundraising platform stable, fast, and secure.

Ansible Bash CI/CD ClickHouse Elasticsearch Git GitOps HAProxy HashiCorp Vault Jenkins Kafka Koa Kubernetes Linux MongoDB NestJS Nginx Node.js Prometheus Python React Redis Terraform TypeScript Ubuntu Vue.js
1 hour, 54 minutes ago

Senior Database Reliability Engineer

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Senior Database Reliability Engineer to design and scale the database platform behind its applications, with a focus on making database usage safer, more reliable, and easier for developers across the company.

AWS CI/CD Datadog Elasticsearch Encryption Git GitLab Go Grafana Helm Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python React React Native Secrets Management Terraform TypeScript
2 hours, 40 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers