Senior Software Engineer - Grafana Databases, Managed Services | UK | Remote

Grafana Labs Grafana Labs · Data AI · Canada, Ireland, Spain, UK, United States · Remote · R&D : Databases

Senior Software Engineer role focused on operating and evolving production-critical, high-throughput, multi-cloud streaming clusters and related database infrastructure. The role involves diagnosing and eliminating failure modes, designing safe upgrade strategies, improving observability and automation, and partnering with database and platform teams. It emphasizes distributed systems behavior, Kubernetes, storage engines, and compression trade-offs, with an on-call component.

What you'd actually do

  1. Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure
  2. Diagnosing and eliminating cross-layer failure modes (e.g., object storage latency, noisy neighbors, control-plane bottlenecks, query performance regressions, etc.)
  3. Designing safe upgrade and rollout strategies at scale
  4. Improving observability, automation, and operational ergonomics
  5. Partnering closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance

Skills

Required

  • distributed systems
  • Kubernetes
  • high-throughput systems
  • multi-cloud infrastructure
  • observability
  • automation
  • storage engines
  • compression trade-offs
  • incident response
  • communication skills

Nice to have

  • experience with Grafana Mimir, Loki, or Tempo
  • experience with WarpStream