Senior Engineering Manager - Platform Metal | UK | Remote

Grafana Labs Grafana Labs · Data AI · EMEA, Germany, Spain, Sweden, United Kingdom · Remote · R&D: Platform

Senior Engineering Manager for Platform Metal at Grafana Labs, responsible for building and provisioning infrastructure (storage, networking, Kubernetes clusters) on physical hardware outside of public cloud. The role involves managing a team, collaborating with leadership, and ensuring the delivery of high-quality platform services to application engineers. While the role itself is not directly building AI models, it heavily utilizes AI-assisted tooling for developer productivity and expects comfort with AI-powered tools in team workflows.

What you'd actually do

  1. Hire, manage, and develop a team of engineers, providing regular feedback and supporting each person’s growth through career conversations.
  2. Collaborate closely with product, design, and engineering leadership to define goals that move the squad forward and align with broader business objectives.
  3. Provide guidance throughout the project lifecycle from early ideation through planning, execution, and post‑launch, guiding the team to deliver high‑quality results consistently.
  4. Foster a psychologically safe environment where engineers can learn, experiment, and iterate quickly, encouraging innovation and a culture of continuous improvement.
  5. Work across team boundaries by contributing to projects outside your team’s scope and partnering with other functions to solve customer problems.

Skills

Required

  • experience building and managing on-prem/bare metal environments
  • strong bias toward action and ownership
  • navigate ambiguity and drive work forward in complex problem spaces
  • collaborating across time zones and cultures in a fully remote environment
  • clear, empathetic communication to build trust, connection, and psychological safety
  • guiding and supporting other engineers
  • leading initiatives or small teams
  • collaborating effectively to drive projects to successful outcomes
  • manage complexity
  • align stakeholders
  • elevate team performance
  • customer-focused
  • grounding decisions and technical choices in user needs and real-world impact
  • Solid understanding of distributed systems principles, including scalability, fault tolerance, consistency, production-readiness, and observability
  • Comfort with AI-assisted tooling
  • curious and comfortable using AI-powered tools and ideally have practical experience folding them into a team’s workflow

Nice to have

  • Running Kubernetes and/or Ceph in production environments
  • Experience dealing with colo facilities and associated vendors
  • You’ve worked in or on open source, or other community-based projects previously

What the JD emphasized

  • building and managing on-prem/bare metal environments
  • Comfort with AI-assisted tooling
  • running Kubernetes and/or Ceph in production environments