Staff Backend Engineer - Databases Tempo | Canada | Remote

Grafana Labs Grafana Labs · Data AI · Canada, United States · Remote · R&D : Databases

Staff Backend Engineer focused on Grafana Tempo, an open-source distributed tracing backend. The role involves evolving Tempo into a platform for observability products, including AI-driven assistants, improving performance, and preparing for agent-driven workloads. Responsibilities include leading technical initiatives, owning architecture, designing APIs for humans and agents, driving operational excellence, and contributing to open source.

What you'd actually do

  1. Lead multi-quarter technical initiatives from problem framing through rollout, e.g., trace aggregation APIs, Limitless Tempo, autoscaling cells and customer limits, or query engine improvements.
  2. Own the architecture of core Tempo components: ingestion, storage, query, and metrics generation. Drive design reviews, make sharp trade-offs on performance, cost, and complexity, and document the “why” for the team.
  3. Design APIs for humans and agents. Shape the next generation of Tempo’s interfaces (structured, deterministic, discoverable) so that Act 3 products, LLM-driven assistants, and external integrators can build on Tempo reliably.
  4. Drive operational excellence. Own outcomes against concrete SLOs (P99 write latency, incident recurrence, TCO per ingested GB) and push the team toward Zero Ops through automation, parameterized rollouts, and actionable alerts.
  5. Partner with Product and sibling teams. Work closely with PMs and with App Observability, Asserts, Drilldown, and Grafana Assistant teams to understand how Tempo gets consumed and to ship what unblocks them.

Skills

Required

  • backend engineering
  • distributed systems
  • observability
  • tracing
  • API design
  • performance optimization
  • operational excellence
  • open source contribution
  • technical leadership
  • mentorship

Nice to have

  • Golang
  • Grafana
  • Tempo
  • Loki
  • Mimir
  • cloud infrastructure
  • Kubernetes
  • AI/ML concepts
  • LLM integration

What the JD emphasized

  • set technical direction on the hardest problems
  • raise the bar across the team
  • hardest problems in our roadmap
  • set technical direction
  • raise the bar
  • hardest problems
  • hard problems
  • hard problems