Staff Software Engineer, Production Engineering

Uber Uber · Consumer · San Francisco, CA · Engineering

Staff Software Engineer, Production Engineering role at Uber, focusing on building and maintaining software to enhance the reliability, scalability, and efficiency of production services. The role involves applying SRE concepts, leading initiatives, and improving system performance and developer productivity within a large-scale, complex environment.

What you'd actually do

  1. Design, build, and maintain software to increase the reliability, scalability, and efficiency of thousands of stateless and stateful production services spread across multiple datacenter zones and regions.
  2. Lead initiatives end-to-end within the team, the Production Engineering org, and across engineering at large to increase reliability through automation, setting standards, developer tooling, and reusable frameworks.
  3. Work with other engineers to deeply understand their services and guide them towards practical and reliable architecture and implementation.
  4. Apply SRE concepts such as observability, integration/load/chaos testing, on-call, incident management, failovers, and disaster recovery to improve mean time between failures (MTBF), time to detection (TTD), and time to mitigation (TTM) of incidents.
  5. Participate in on-call rotations, responding to and leading mitigation of production incidents, and driving post-incident improvements.

Skills

Required

  • Go
  • Java
  • Python
  • end-to-end solution delivery
  • technical proposal writing
  • driving engineering alignment

Nice to have

  • SRE/reliability engineering
  • incident management
  • platform/infrastructure engineering
  • compute platform
  • software networking
  • online storage
  • developer platform
  • observability
  • mentoring
  • team leadership