Senior Incident Commander

Aurora Innovation Aurora Innovation · Robotics · DFW1 · Software Technology Foundations

Senior Incident Commander responsible for leading the enterprise-wide technical incident response function for Aurora's autonomous vehicle platform, on-vehicle systems, enterprise IT infrastructure, and supporting NOC/SOC. This role involves operational leadership of tactical response to complex, high-severity incidents across software, hardware, IT, and cybersecurity domains, making decisive real-time decisions, and coordinating stakeholders to resolve events. The role also includes designing and executing projects to mature response capabilities and observability, leading incident retrospectives, and delivering executive communications.

What you'd actually do

  1. Lead Incident Command: Lead as the Incident Commander by establishing command and control over technical cross-functional incidents. You will orchestrate response efforts, make decisive real-time decisions, and establish a common operating picture for all stakeholders.
  2. Manage Escalations & Stakeholders: Manage and prioritize escalations, ensuring targeted stakeholder representation (e.g., Software/Hardware, Cybersecurity, Network, IT, Legal, Ops, Safety, Policy, & Communications) throughout the incident lifecycle.
  3. Drive Continuous Improvement: Lead thorough incident retrospectives to accurately document events and identify root causes. You will define actionable follow-ups and track their implementation to continuously enhance safety, efficiency, and overall incident response capabilities for our distributed team, ensuring lessons learned are integrated back into processes.
  4. Develop Cross-Functional Partnerships: Develop robust cross-functional partnerships to proactively understand stakeholder concerns during events. You will collaborate closely with a wide variety of technical teams, providing critical support and guidance during real-time incident response to maintain seamless 24/7 coverage.
  5. Deliver Executive Communications: Deliver executive-level communications on active incidents and present comprehensive analyses of important outcomes, trends, and strategic insights from past events to leadership and a global audience.

Skills

Required

  • 5+ years of direct experience in a command-oriented role such as incident management, crisis response, Site Reliability Engineering (SRE), NOC/SOC leadership, or military/first-responder operations, having served as an Incident Commander or lead responder for high-stakes technical events.
  • A strong, holistic understanding of the interconnectedness of complex systems, including on-vehicle compute, enterprise IT infrastructure, cloud services, Network Operations (NOC), and Security Operations (SOC).
  • Understanding of the principles of monitoring and observability in complex, distributed systems.
  • Demonstrated capability to remain calm and decisive under pressure, efficiently coordinating response efforts and making sound judgments in rapidly evolving, high-stakes situations.
  • Experience briefing senior executives and leadership in real-time on complex, fast-moving incidents, articulating technical details, impact, and proposed resolutions clearly and concisely.
  • Strong command and control presence.
  • Exceptional attention to detail with a track record of meticulous

What the JD emphasized

  • command and control
  • technical cross-functional incidents
  • high-severity incidents
  • real-time decisions
  • incident response capabilities
  • incident retrospectives
  • senior executives and leadership