Staff Software Engineer - Grafana Cloud k6 | Germany | Remote

3 hours, 3 minutes ago
Full-time
Lead
Software Development
Grafana

Grafana

Grafana is the open observability platform providing analytics, monitoring, and visualization solutions with a focus on user control and cost efficiency.

IT Services
1K-5K
Founded 2014
$535M raised

Description

  • Build and scale a culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Drive DevOps/SRE practices including incident response, PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • Establish and apply reliability frameworks such as SLIs/SLOs and error budgets to guide prioritization and engineering trade-offs.
  • Provide visibility into system health through operational metrics and reliability reporting.
  • Guide teams in the design, development, evolution, and operation of large-scale distributed cloud systems.
  • Influence product and system direction through design reviews, architectural discussions, and cross-team collaboration.
  • Share knowledge through clear documentation and technical communication to help teams build and operate systems more effectively.
  • Grow into broader application and product development leadership as the reliability foundation matures.

Requirements

  • Strong experience with DevOps/SRE practices, including operating and evolving production systems at scale.
  • Strong programming background in a modern language; Python and Go are primary, but prior experience with them is not required.
  • Experience designing, building, and operating large-scale distributed systems.
  • Strong understanding of reliability engineering concepts such as incident management, observability, and failure modes.
  • Experience with test automation, including performance and functional testing.
  • Ability to influence engineering practices through clear technical communication, reviews, and collaboration.
  • Strong interpersonal skills and ability to work effectively across teams.
  • Familiarity with modern software engineering processes and delivery practices.
  • Self-driven and comfortable operating with a high degree of autonomy and ambiguity.
  • Experience with containerized and cloud-native systems such as Docker, Kubernetes, and AWS.
  • Familiarity with observability tooling and platforms, such as the Grafana stack.
  • Experience working with Python, Go, JavaScript, and/or Jsonnet.
  • Experience building or operating event-driven or asynchronous systems.
  • Experience defining or applying SLIs/SLOs, error budgets, or reliability metrics.
  • Interest in, or experience with, building testing frameworks or developer tooling.

Benefits

  • Base compensation in Germany of EUR 109,709 to EUR 131,651.
  • Equity eligibility.
  • Bonus eligibility if applicable.
  • 100% remote, global work environment.
  • 30 days of annual leave, including 3 Grafana Shutdown Days.
  • In-person onboarding with the new-hire cohort.
  • Career growth pathways and development opportunities.
  • Access to modern AI coding assistants and a company-funded usage budget.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

DevOps Engineer / SRE

Fundraise Up 51-250 Capital Markets

Fundraise Up is hiring a DevOps Engineer/SRE to own on-premise infrastructure and keep its global fundraising platform stable, fast, and secure.

Ansible Bash CI/CD ClickHouse Elasticsearch Git GitOps HAProxy HashiCorp Vault Jenkins Kafka Koa Kubernetes Linux MongoDB NestJS Nginx Node.js Prometheus Python React Redis Terraform TypeScript Ubuntu Vue.js
55 minutes ago

Site Reliability Engineer

Obsidian Security 51-250 Internet Software & Services

Obsidian Security is hiring a Site Reliability Engineer in the UK to help ensure the reliability, scalability, and operational excellence of its multi-tenant SaaS platform for enterprise and financial customers.

Argo CD AWS Datadog GCP GitHub Actions GitOps Grafana Helm Kubernetes Microservices Prometheus
1 hour, 16 minutes ago

Tech Lead, Web Core Product & Chrome Extension - Bishkek, Kyrgyzstan

Speechify 51-250 Internet Software & Services

Speechify is hiring a web product engineer to help build and ship text-to-speech experiences used by millions across its distributed product team.

Firebase JavaScript React TypeScript
1 hour, 17 minutes ago

Tech Lead, Android Core Product - Tampa, FL, USA

Speechify 51-250 Internet Software & Services

Speechify is hiring a Senior Android Engineer to help scale its high-traffic text-to-speech Android app and shape new product experiences for a global, fully distributed team.

Android iOS Jetpack Compose JUnit Kotlin Node.js
1 hour, 29 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers