Staff Software Engineer, Platform Infrastructure

Aurora Innovation Aurora Innovation · Robotics · PITHQ · Software Technology Foundations

Staff Software Engineer, Platform Infrastructure role at Aurora Innovation focused on stabilizing and modernizing the Offline Testing Infrastructure (OTI). This role involves leading a team to improve the velocity and reliability of engineering teams and release cycles by optimizing the testing ecosystem, owning critical OTI components, and establishing architecture and best practices for large-scale compute use cases in a self-driving technology context.

What you'd actually do

  1. Lead the OTI Team: Serve as the technical lead (TL) for the OTI team within PIE-Compute, driving the strategic vision, execution, and long-term stability of the core infrastructure.
  2. Lead the design of the next-generation offline testing architecture to meet diverse team needs, reducing redundancy and siloing across the organization.
  3. Partner with Test Creation and Test Drive teams to standardize end-to-end test execution and reporting (Creation -> Execution -> Reporting).
  4. Take ownership of the shared OTI components, including maintenance and on-call support
  5. Define and enforce data management policies for the testing ecosystem (storage, lifecycling, write strategies, data integrity, and lineage).

Skills

Required

  • Senior or Staff-level experience (P7 equivalent) as a Software Engineer, ideally in infrastructure, developer tooling, or critical shared services.
  • Proven experience leading technical projects and mentoring/directing other engineers.
  • Familiarity with distributed compute technologies, cloud services (e.g., AWS), and large-scale workflow management systems
  • Demonstrated ability to triage, debug, and perform on-call and incident management for complex, cross-cutting infrastructure issues.
  • Strong communication skills to manage stakeholder alignment and drive cross-team standardization efforts.

What the JD emphasized

  • stabilizing and modernizing our Offline Testing Infrastructure (OTI)
  • critical, shared middle-layer infrastructure
  • improving the velocity of our engineering teams and the reliability of our release cycles
  • stable, performant, and scalable platform
  • testing ecosystem
  • end-to-end test execution and reporting
  • full test lifecycle
  • offline test Modalities
  • common OTI tooling
  • test evaluations
  • testing ecosystem
  • offline testing
  • P7 equivalent