Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring | United Kingdom | Remote

1 hour, 29 minutes ago
Full-time
Lead
Software Development
Grafana

Grafana

Grafana is the open observability platform providing analytics, monitoring, and visualization solutions with a focus on user control and cost efficiency.

IT Services
1K-5K
Founded 2014
$535M raised

Description

  • Design and implement scalable integrations for infrastructure components, applications, and data ingestion pipelines.
  • Build middleware components and libraries that simplify development and maintenance of observability solutions.
  • Own backend services for opinionated applications such as Cloud Provider Observability, Database Observability, and Kubernetes Monitoring.
  • Develop dashboards, alerts, documentation, and infrastructure that support the Cloud Observability stack.
  • Collaborate with product, design, docs, Sales, and Support teams to deliver features and a cohesive customer experience.
  • Lead technical direction and contribute to strategic decisions for the team’s observability solutions.
  • Estimate, plan, coordinate, and deliver large cross-system initiatives.
  • Mentor and coach other team members while helping resolve technology and product process issues.
  • Represent Grafana Labs in open source forums, working groups, and events when needed.
  • Contribute to open source projects and community efforts such as Alloy, Prometheus, OpenTelemetry, and Beyla.

Requirements

  • 8+ years of experience with at least one major programming language such as Python, .NET, Java, Go, or Rust.
  • Experience operating high-scale production systems on Kubernetes, including monitoring, on-call participation, incident response, and postmortems.
  • Familiarity with observability tooling such as Grafana.
  • Strong understanding of time-series data, metrics cardinality challenges, and observability cost/performance tradeoffs.
  • Experience in a hands-on technical leadership role influencing architecture and setting technical direction.
  • Deep knowledge of distributed systems concepts including scalability, consistency, high availability, and failure modes.
  • Experience writing clean, maintainable, robust, and performant software.
  • Demonstrated ability to deliver projects from start to finish in a self-driven manner.
  • Excellent problem-solving and debugging skills.
  • Strong mentoring and leadership skills.
  • Experience operating or scaling Prometheus in high-cardinality, multi-tenant environments is preferred.
  • Experience with OpenTelemetry Collector pipelines or similar telemetry ingestion systems is preferred.
  • A Kubernetes certification such as CKA, CKAD, or another CNCF Kubernetes certification is preferred.
  • Experience developing Kubernetes operators, controllers, or custom resources is preferred.
  • Experience contributing to or maintaining open source projects is preferred.
  • Experience designing and building observability backends for various systems and applications is preferred.

Benefits

  • Remote-first, 100% remote global work environment.
  • Compensation in the UK range of GBP 103,958 to GBP 124,750, depending on level, experience, and skillset.
  • Restricted Stock Units (RSUs) for all roles.
  • Global annual leave policy of 30 days per year.
  • Three annual leave days reserved for Grafana Shutdown Days.
  • In-person onboarding for new hires.
  • Access to modern AI coding assistants with a company-funded usage budget.
  • Access to frontier models for development work, subject to security guidelines.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Lead Software Engineer

HHAeXchange 251-1K Health Care Providers & Services

HHAeXchange is hiring a Lead Software Engineer to drive development of its next-generation home care agency and caregiver platform across full-stack systems in a remote role.

Angular AWS CI/CD Generative AI Go Kubernetes Microservices PostgreSQL Redis Serverless
14 minutes ago

Staff Software Engineer

Newrich Network IT Services

Newrich is hiring a Staff Software Engineer to help build and scale its creator platform and supporting infrastructure for coaches, entrepreneurs, and digital businesses.

AWS CloudFormation Express.js GitHub Actions Go Laravel Microservices MySQL Node.js PHP Playwright PostgreSQL React Redis REST API System Design TypeScript WebRTC
14 minutes ago

[job-28646] Developer Master IA (SDLC+GenAI)

CI&T 5K-10K Internet Software & Services

CI&T is hiring a Developer Master IA (SDLC+GenAI) to lead the evolution of an AI-accelerated, human-coordinated development pipeline for a large education institution.

CI/CD Generative AI Git GPT
14 minutes ago

[Job 28136] Tech Lead / Developer Backend, Brazil

CI&T 5K-10K Internet Software & Services

CI&T is hiring a Tech Lead / Backend Developer in Brazil to lead development for a data product serving an entertainment client, with responsibility for the platform’s performance, resilience, and architectural evolution.

Agile Apache CDN Datadog Docker Go Grafana Kubernetes Lua Microservices New Relic Nginx Prometheus Solid.js SQL Terraform Unix WAF
14 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers