Engineering Manager - Platform Reliability

Databricks Databricks · Data AI · London, United Kingdom · Engineering

Engineering Manager for Platform Reliability at Databricks, focusing on building and scaling the data and AI infrastructure platform. The role involves hiring and developing engineers, ensuring high technical standards, and working with leadership on roadmaps. Responsibilities include leading the development of resource management infrastructure, reliable distributed services, and tools for operating services across clouds, with a focus on supporting big data and machine learning workloads.

What you'd actually do

  1. Hire great engineers to build an outstanding team.
  2. Support engineers in their career development by providing clear feedback and develop engineering leaders.
  3. Ensure high technical standards by instituting processes (architecture reviews, testing) and culture (engineering excellence).
  4. Work with engineering and product leadership to build a long-term roadmap.
  5. Coordinate execution and collaborate across teams to unblock cross-cutting projects.

Skills

Required

  • 5+ years of Engineering experience
  • 2+ years of Engineering Management experience
  • Experience with large-scale distributed services and the processes around testing, monitoring, and SLAs
  • Ability to align multiple stakeholders on competing priorities
  • Able to balance short-term delivery against long-term stability

Nice to have

  • BS (or higher) in Computer Science, or a related field.