Program Leader, Outage Management Lead – Community Operations

Uber Uber · Consumer · Phoenix, AZ +1 · Community Operations

This role leads Uber's global customer support coordination during outages and high-severity incidents. It involves owning and evolving the strategy, processes, and operating model for outage management, including strengthening detection, improving stakeholder engagement, and driving technology adoption. The role also involves leading the Ops Commander function and crisis managers, and translating incident learnings into durable improvements. While the role mentions driving technology strategy and potentially implementing AI-enabled operational solutions, its core focus is on program leadership and operational excellence in incident response, not direct AI/ML development.

What you'd actually do

  1. Lead the development of the vision and support strategy for outage management and severe incident management within the Community Operations org
  2. Design and continuously improve the processes, playbooks, and escalation/routing standards for both standard outages and critical outages
  3. You will lead the 24x7 Ops Commander team through a Team Leader who oversees day-to-day operations.
  4. You will elevate support anomaly detection maturity in partnership with Engineering and Product, improving signal quality, reducing noise, and accelerating escalation precision for outages.
  5. You will develop business cases and partner cross-functionally to implement tools and automation that improve detection speed, impact visibility, stakeholder routing, resolution verification, and executive reporting.

Skills

Required

  • program management
  • incident management
  • customer operations
  • team leadership
  • cross-functional collaboration
  • process design
  • strategic thinking
  • decision-making
  • execution
  • communication

Nice to have

  • AI-enabled operational solutions
  • ITIL or equivalent service management certification

What the JD emphasized

  • high-severity or crisis / incident / outage environments requiring structured decision-making under pressure