Senior Backend Engineer | Mimir Oss | Germany | Remote

Grafana Labs Grafana Labs · Data AI · EMEA, Germany, Ireland, Spain, Sweden, United Kingdom · Remote · R&D : Databases

Senior Backend Engineer for Grafana Mimir OSS, focusing on large-scale, high-performance distributed systems for metrics, logs, and traces. The role involves designing, building, operating, and maintaining critical systems, contributing to open-source projects, and participating in on-call rotations. The company encourages the use of AI coding assistants and provides access to frontier models for developer productivity.

What you'd actually do

  1. Work with your team to deliver new features solving customer problems and enhancing operations on a large database at scale
  2. Drive projects from initial idea all the way to operations once it is in the hands of customers
  3. Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability
  4. Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
  5. Support other team members, participate in design discussions and collaborate with the team

Skills

Required

  • Solid experience with at least one programming language (Go preferred, Python, C, C++, Rust acceptable)
  • Some experience with delivering projects from gathering requirements, brainstorming ideas all the way to shipping a product to the customer’s hands in a self-driven way
  • Some experience with developing software that runs in the Cloud or some experience with systems engineering
  • Some experience with being on-call and following the DevOps model
  • Experience writing clean, robust, and performant software that is easily maintained by others
  • Familiarity with observability systems, know when to use metrics, logs, traces, to debug a problem

Nice to have

  • Experience working with Kubernetes
  • Experience working with queue systems, e.g. the Kafka protocol
  • Been a user of Grafana and Prometheus in operational roles
  • Exposure to microservices architecture and distributed systems, or a desire to learn
  • Familiarity with the concept of infrastructure as code

What the JD emphasized

  • large-scale, high-performance distributed systems
  • efficiency, resilience, and reliability are essential
  • operating the system in production
  • owning the reliability, performance, and availability
  • on-call rotation