Senior Software Engineer - Grafana Databases, Managed Services | Spain | Remote

1 hour, 14 minutes ago
Full-time
Senior
DevOps and Infrastructure
Grafana

Grafana

Grafana is the open observability platform providing analytics, monitoring, and visualization solutions with a focus on user control and cost efficiency.

IT Services
1K-5K
Founded 2014
$535M raised

Description

  • Operate and evolve 100+ multi-cloud streaming clusters and related database infrastructure in production.
  • Diagnose and eliminate cross-layer failure modes affecting latency, scalability, and reliability.
  • Design safe upgrade and rollout strategies across large production environments.
  • Improve observability, automation, and operational ergonomics for shared infrastructure.
  • Partner with database and platform teams on scaling, partitioning, consumer fan-out, and query performance.
  • Work directly with distributed systems, Kubernetes scheduling, storage engines, and compression trade-offs.
  • Serve as a primary escalation point and participate in on-call incident response.
  • Own vendor relationships with system providers such as WarpStream Labs.
  • Review and define SLOs, and reduce error budgets through monitoring, automation, and system design improvements.
  • Participate in design reviews, PR reviews, automation, tooling, code improvements, and post-incident reviews.

Requirements

  • 6+ years of engineering experience, including time in SRE, platform engineering, production engineering, infrastructure engineering, or distributed systems roles.
  • Experience operating distributed systems in production, such as streaming systems, analytical databases, or large-scale storage backends.
  • Strong Kubernetes experience in AWS, GCP, or Azure.
  • Familiarity with infrastructure-as-code tools such as Helm, Terraform, or Jsonnet.
  • Solid understanding of distributed systems design and large-scale system trade-offs.
  • Proficiency in at least one programming language; Go is preferred but not required.
  • Working knowledge of Linux internals, networking, cloud storage, and performance/scaling behavior.
  • Experience participating in blameless incident response and writing high-quality post-incident reviews.
  • Clear communication skills and the ability to collaborate across teams while working autonomously.
  • Curious, pragmatic, action-oriented, and kind.
  • Experience with systems such as Kafka, Redpanda, WarpStream, Postgres, ClickHouse, Snowflake, or Cassandra is a plus.
  • Remote applicants must live in Spain time zones only at this time.

Benefits

  • Base salary range in Spain: EUR 82,988 to EUR 99,586.
  • Equity in the form of Restricted Stock Units (RSUs).
  • Bonus eligibility, if applicable.
  • Remote-first, 100% global work environment.
  • Company-funded access to modern AI coding assistants and frontier models.
  • 30 days of annual leave, including 3 Grafana Shutdown Days.
  • In-person onboarding with new team members.
  • Career growth pathways and development opportunities.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

66degrees 251-1K IT Services

66degrees is hiring a Site Reliability Engineer to help enterprise cloud clients maintain, optimize, and scale Google Cloud environments through reliability engineering, automation, and incident response.

Agile Datadog GCP JIRA Kanban Kubernetes Linux Prometheus Python Scrum SQL Server Terraform
1 hour, 14 minutes ago

Site Reliability Engineer

Arbor 51-250 IT Services

Arbor is hiring a Remote Site Reliability Engineer to help ensure platform resilience, performance, availability, and scalable service delivery across its school management systems.

Agile Datadog Docker Kanban Nginx Prometheus Terraform
1 hour, 29 minutes ago

Senior Data Engineer II, Finance

instacart.careers 1K-5K Internet Software & Services

Instacart is hiring a Finance Data Engineer to build and own critical financial data infrastructure and reporting pipelines that support accounting, billing, revenue, and finance operations across its marketplace platform.

Apache Airflow Apache Spark dbt Python Snowflake SQL
1 hour, 44 minutes ago

Data Development & Support Analyst - Fixed Term Contract

Livestock Information 11-50 Professional Services

Livestock Information Ltd is hiring a Data Development & Support Analyst on a 12-month fixed-term contract to support and improve its Azure-based data platform, reporting services, and delivery processes.

Agile Azure CI/CD Databricks Power BI Python Scrum SQL
1 hour, 44 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers