Senior Software Engineer - Grafana Databases, Managed Services | Spain | Remote

Grafana Labs Grafana Labs · Data AI · Canada, Ireland, Spain, UK, United States · Remote · R&D : Databases

Senior Software Engineer role focused on operating and evolving shared, production-critical infrastructure for Grafana Cloud's next-generation database products (Mimir, Loki, Tempo). The role involves managing multi-cloud streaming clusters, diagnosing failure modes, designing upgrade strategies, improving observability and automation, and partnering with database and platform teams. It emphasizes distributed systems, Kubernetes, storage engines, and operational excellence, with an on-call component.

What you'd actually do

  1. Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure
  2. Diagnosing and eliminating cross-layer failure modes (e.g., object storage latency, noisy neighbors, control-plane bottlenecks, query performance regressions, etc.)
  3. Designing safe upgrade and rollout strategies at scale
  4. Improving observability, automation, and operational ergonomics
  5. Partnering closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance

Skills

Required

  • 6+ years of engineering experience
  • distributed systems
  • Kubernetes
  • high-throughput systems
  • multi-cloud infrastructure
  • observability
  • automation
  • incident response
  • communication skills

Nice to have

  • experience with Grafana Mimir, Loki, or Tempo
  • experience with WarpStream
  • storage engines
  • compression trade-offs
  • vendor management