Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring | Spain | Remote

2 hours, 26 minutes ago
Full-time
Lead
Software Development
Grafana

Grafana

Grafana is the open observability platform providing analytics, monitoring, and visualization solutions with a focus on user control and cost efficiency.

IT Services
1K-5K
Founded 2014
$535M raised

Description

  • Design and implement scalable integrations for infrastructure components, applications, and data ingestion pipelines.
  • Create middleware components and libraries that simplify observability solution development and maintenance.
  • Build and maintain backend systems for Cloud Provider Observability, Database Observability, and Kubernetes Monitoring.
  • Develop dashboards, alerts, documentation, and infrastructure for observability products.
  • Collaborate with product, design, docs, Sales, and Support teams to deliver aligned customer experiences.
  • Lead technical direction and contribute to strategic discussions about observability solutions.
  • Estimate, plan, coordinate, and deliver large cross-system technical initiatives.
  • Coach and mentor team members and help identify process or technology issues.
  • Represent Grafana Labs in open source forums, working groups, and events when needed.
  • Contribute to open source projects and communities, including Alloy, Prometheus, OpenTelemetry, Beyla, and related efforts.

Requirements

  • 8+ years of experience with at least one major programming language such as Python, .NET, Java, Go, or Rust.
  • Experience operating high-scale production systems on Kubernetes, including monitoring, on-call participation, incident response, and postmortem practices.
  • Familiarity with observability tooling such as Grafana.
  • Strong understanding of time-series data, metrics cardinality challenges, and cost/performance tradeoffs in observability systems.
  • Experience in a hands-on technical leadership role, including setting technical direction and influencing architecture beyond your immediate team.
  • Deep understanding of distributed systems concepts, including scalability, consistency, high availability, and failure modes.
  • Experience writing clean, maintainable, robust, and performant software.
  • Experience delivering projects end-to-end in a self-driven manner.
  • Excellent problem-solving and debugging skills.
  • Strong mentoring and leadership skills.
  • Passion for observability and willingness to write documentation and blog posts.
  • Relevant open source experience, ideally in the observability domain.
  • Willingness to become an active member of the OpenTelemetry and Prometheus communities.
  • Comfort operating production services and organizing on-call, preferred.
  • Experience with Prometheus in high-cardinality, multi-tenant environments, preferred.
  • Experience with OpenTelemetry Collector pipelines or similar telemetry ingestion systems, preferred.
  • Kubernetes certification such as CKA, CKAD, or another CNCF Kubernetes certification, preferred.
  • Experience developing Kubernetes operators, controllers, or custom resources, preferred.
  • Strong understanding of metrics collection, visualization, and alerting concepts, preferred.
  • Experience contributing to or maintaining open source projects, preferred.
  • Experience designing and building observability backends for various systems and applications, preferred.

Benefits

  • Competitive compensation in Spain of EUR 94,025 to EUR 112,830, depending on level, experience, and skillset.
  • Restricted Stock Units (RSUs) included for all roles.
  • 100% remote, global work environment.
  • Global annual leave policy of 30 days per year.
  • 3 days of annual leave reserved for Grafana Shutdown Days.
  • In-person onboarding with fellow new hires.
  • Defined career growth pathways.
  • Access to company-funded AI coding assistant usage budget and frontier models, within security guidelines.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer - Python and Data Ecosystem

ClickHouse 51-250 IT Services

ClickHouse is hiring a Senior Software Engineer to own and evolve Python integrations that connect its real-time analytics platform with orchestration, transformation, and AI/data tooling used by data practitioners.

Apache Airflow Apache Spark ClickHouse Dagster dbt Flink LLM Machine Learning Metabase NumPy Pandas Power BI Prefect Python SQL Superset Tableau
11 minutes ago

Backend Developer (Node.js)

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Backend Developer to build and scale the high-load infrastructure behind its global nonprofit fundraising platform.

Bull ClickHouse Datadog Elasticsearch Grafana Kafka Koa MongoDB NestJS Node.js Prometheus RabbitMQ React Redis REST API TypeScript Vue.js
41 minutes ago

Engineering Team Leader, DX Platform

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Team Lead for its Donor Experience Platform to guide the development of an internal API powering online donation products for global nonprofit clients.

Bull ClickHouse Elasticsearch Kafka Koa MongoDB NestJS Node.js React Redis REST API SPA TypeScript Vue.js
1 hour, 11 minutes ago

Backend Developer (Node.js)

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a Backend Developer to help build and scale the high-load systems behind its global nonprofit fundraising platform.

Bull ClickHouse Datadog Elasticsearch Grafana Kafka Koa MongoDB NestJS Node.js Prometheus RabbitMQ React Redis REST API TypeScript Vue.js
1 hour, 41 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers