Staff Software Engineer - AI Platform (michelangelo)

Uber Uber · Consumer · Sunnyvale, CA · Engineering

Staff Software Engineer on Uber's ML Serving team within the AI Platform, focusing on infrastructure for real-time ML and generative AI inference at scale. Responsibilities include defining technical direction, leading cross-team initiatives, and designing foundational architectures for thousands of models in production.

What you'd actually do

  1. Define architecture and technical strategy for Uber’s ML serving and inference platforms
  2. Lead cross-team efforts to scale and evolve serving infrastructure for predictive and generative AI workloads
  3. Design systems that balance latency, cost, reliability, and developer productivity
  4. Act as a technical leader and mentor across the ML Platform organization
  5. Drive operational excellence and long-term sustainability of mission-critical ML systems

Skills

Required

  • BS or MS in Computer Science or a related technical discipline, or equivalent experience
  • 8+ years of full-time engineering experience
  • Extensive experience designing and operating large-scale distributed systems in production
  • Deep expertise in backend systems, system architecture, and performance optimization
  • Strong leadership skills with a track record of driving complex technical initiatives

Nice to have

  • Deep experience with ML serving platforms, inference orchestration, or real-time AI systems
  • Experience supporting high-throughput, low-latency workloads at global scale
  • Strong understanding of ML model lifecycle, observability, and reliability at scale
  • Proven ability to influence technical direction across multiple teams and stakeholders

What the JD emphasized

  • large-scale distributed systems
  • ML serving platforms
  • inference orchestration
  • real-time AI systems
  • high-throughput, low-latency workloads

Other signals

  • ML Serving
  • Inference
  • Generative AI
  • Large-scale distributed systems
  • Real-time ML