Senior Engineering Manager - Platform Metal | Ireland | Remote

Grafana Labs Grafana Labs · Data AI · EMEA, Germany, Spain, Sweden, United Kingdom · Remote · R&D: Platform

Senior Engineering Manager for Platform Metal at Grafana Labs, responsible for building and provisioning infrastructure including storage, networking, and Kubernetes clusters on physical hardware. The role involves managing a team, collaborating with leadership, and fostering a productive environment. While the company uses AI tools for developer productivity, the core function of the role is infrastructure management, not direct AI/ML model development.

What you'd actually do

  1. Hire, manage, and develop a team of engineers, providing regular feedback and supporting each person’s growth through career conversations.
  2. Collaborate closely with product, design, and engineering leadership to define goals that move the squad forward and align with broader business objectives.
  3. Provide guidance throughout the project lifecycle from early ideation through planning, execution, and post‑launch, guiding the team to deliver high‑quality results consistently.
  4. Foster a psychologically safe environment where engineers can learn, experiment, and iterate quickly, encouraging innovation and a culture of continuous improvement.
  5. Work across team boundaries by contributing to projects outside your team’s scope and partnering with other functions to solve customer problems.

Skills

Required

  • Experience building and managing on-prem/bare metal environments
  • Strong bias toward action and ownership
  • Ability to navigate ambiguity and drive work forward in complex problem spaces
  • Experience collaborating across time zones and cultures in a fully remote environment
  • Clear, empathetic communication
  • Experience guiding and supporting other engineers
  • Leading initiatives or small teams
  • Collaborating effectively to drive projects to successful outcomes
  • Demonstrated ability to manage complexity
  • Align stakeholders
  • Elevate team performance
  • Deeply customer-focused
  • Solid understanding of distributed systems principles
  • Scalability
  • Fault tolerance
  • Consistency
  • Production-readiness
  • Observability
  • Curious and comfortable using AI-powered tools
  • Practical experience folding AI tools into a team’s workflow

Nice to have

  • Running Kubernetes and/or Ceph in production environments
  • Experience dealing with colo facilities and associated vendors
  • Worked in or on open source, or other community-based projects

What the JD emphasized

  • building and managing on-prem/bare metal environments
  • Comfort with AI-assisted tooling
  • running Kubernetes and/or Ceph in production environments