Senior Software Engineer - Grafana Databases, Managed Services | Ireland | Remote

3 hours, 17 minutes ago
Full-time
Senior
DevOps and Infrastructure
Grafana

Grafana

Grafana is the open observability platform providing analytics, monitoring, and visualization solutions with a focus on user control and cost efficiency.

IT Services
1K-5K
Founded 2014
$535M raised

Description

  • Operate and evolve 100+ multi-cloud streaming clusters and related database infrastructure in production.
  • Diagnose and resolve cross-layer failure modes affecting storage, query performance, control planes, and scaling.
  • Design and execute safe upgrade and rollout strategies across production clusters.
  • Improve observability, automation, and operational ergonomics for shared infrastructure.
  • Partner with database and platform teams on scaling, partitioning, consumer fan-out, and query performance.
  • Work directly with distributed systems, Kubernetes scheduling, storage engines, and compression trade-offs.
  • Serve as a primary escalation point and participate in on-call and incident response.
  • Own vendor relationships for the systems and services used by the team.
  • Review code, contribute to design documents, and improve automation and tooling to reduce operational risk.
  • Share distributed systems knowledge and best practices with partner teams.

Requirements

  • 6+ years of engineering experience, including time in SRE, platform engineering, production engineering, infrastructure engineering, or distributed systems roles.
  • Experience operating distributed systems in production, such as streaming systems, analytical databases, or large-scale storage backends.
  • Strong Kubernetes experience in AWS, GCP, or Azure.
  • Familiarity with infrastructure-as-code tools such as Helm, Terraform, or Jsonnet.
  • Solid understanding of distributed systems design and large-scale system trade-offs.
  • Proficiency in at least one programming language, with Go preferred.
  • Working knowledge of Linux internals, networking, cloud storage, and performance/scaling behavior.
  • Experience participating in blameless incident response and writing high-quality post-incident reviews.
  • Clear communication skills and the ability to collaborate across teams while working autonomously.
  • Applicants must be able to work from Ireland time zones.
  • Experience with systems such as Kafka, Redpanda, WarpStream, Postgres, ClickHouse, Snowflake, or Cassandra is relevant and desirable.

Benefits

  • Base salary range of EUR 104,000 to EUR 124,800 in Ireland.
  • Equity through Restricted Stock Units (RSUs) for all roles.
  • Bonus eligibility, if applicable.
  • Global annual leave policy of 30 days per year.
  • 3 days of annual leave reserved for Grafana Shutdown Days.
  • 100% remote, global work environment.
  • Company-funded access to AI coding assistants and frontier models within security guidelines.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Site Reliability Engineer

66degrees 251-1K IT Services

66degrees is hiring a Site Reliability Engineer to help enterprise cloud clients maintain, optimize, and scale Google Cloud environments through reliability engineering, automation, and incident response.

Agile Datadog GCP JIRA Kanban Kubernetes Linux Prometheus Python Scrum SQL Server Terraform
2 hours, 32 minutes ago

Site Reliability Engineer

Arbor 51-250 IT Services

Arbor is hiring a Remote Site Reliability Engineer to help ensure platform resilience, performance, availability, and scalable service delivery across its school management systems.

Agile Datadog Docker Kanban Nginx Prometheus Terraform
2 hours, 47 minutes ago

Senior Site Reliability Engineer

OfficeSpace Software 251-1K Internet Software & Services

OfficeSpace Software is hiring a Senior Site Reliability Engineer to own the performance, reliability, and cost efficiency of its production platform at scale while helping modernize operations with AI-assisted reliability engineering.

Ansible Apache Argo CD CI/CD Datadog GitOps Grafana Kubernetes Linux MariaDB Microservices MySQL Nginx PostgreSQL Prometheus Puppet Python Redis Ruby Ruby on Rails Sidekiq Terraform
3 hours, 20 minutes ago

Senior Database Reliability Engineer

Sezzle 251-1K Diversified Financial Services

Sezzle is hiring a Senior Database Reliability Engineer to design, build, and scale the database platform that supports its applications and helps teams use databases more reliably, securely, and efficiently.

AWS CI/CD Datadog Elasticsearch Encryption Git Go Grafana GraphQL Helm Kubernetes Microservices MySQL New Relic OpenTelemetry PostgreSQL Prometheus Python React React Native REST API Secrets Management Terraform TypeScript
6 hours, 17 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers