Staff Software Engineer - AI Platform (… at Uber

What you'd actually do

Define architecture and technical strategy for Uber’s ML serving and inference platforms

Lead cross-team efforts to scale and evolve serving infrastructure for predictive and generative AI workloads

Design systems that balance latency, cost, reliability, and developer productivity

Act as a technical leader and mentor across the ML Platform organization

Drive operational excellence and long-term sustainability of mission-critical ML systems

Skills

Required

BS or MS in Computer Science or a related technical discipline, or equivalent experience
8+ years of full-time engineering experience
Extensive experience designing and operating large-scale distributed systems in production
Deep expertise in backend systems, system architecture, and performance optimization
Strong leadership skills with a track record of driving complex technical initiatives

Nice to have

Deep experience with ML serving platforms, inference orchestration, or real-time AI systems
Experience supporting high-throughput, low-latency workloads at global scale
Strong understanding of ML model lifecycle, observability, and reliability at scale
Proven ability to influence technical direction across multiple teams and stakeholders

About the Role

This role is part of Uber’s ML Serving team within the AI Platform, responsible for defining and evolving the infrastructure that powers real-time ML and generative AI inference at Uber scale. As a Staff Software Engineer, you will set technical direction for ML serving systems, lead cross-team initiatives, and design foundational architectures that support thousands of models in production. Your work will shape Uber’s long-term strategy for scalable, reliable, and efficient ML serving.

What the Candidate Will Need / Bonus Points

---- What the Candidate Will Do ----

Define architecture and technical strategy for Uber’s ML serving and inference platforms
Lead cross-team efforts to scale and evolve serving infrastructure for predictive and generative AI workloads
Design systems that balance latency, cost, reliability, and developer productivity
Act as a technical leader and mentor across the ML Platform organization
Drive operational excellence and long-term sustainability of mission-critical ML systems

---- Basic Qualifications ----

BS or MS in Computer Science or a related technical discipline, or equivalent experience
8+ years of full-time engineering experience
Extensive experience designing and operating large-scale distributed systems in production
Deep expertise in backend systems, system architecture, and performance optimization
Strong leadership skills with a track record of driving complex technical initiatives

---- Preferred Qualifications ----

Deep experience with ML serving platforms, inference orchestration, or real-time AI systems
Experience supporting high-throughput, low-latency workloads at global scale
Strong understanding of ML model lifecycle, observability, and reliability at scale
Proven ability to influence technical direction across multiple teams and stakeholders

For Sunnyvale, CA-based roles: The base salary range for this role is USD$232,000 per year - USD$258,000 per year.

You will be eligible to participate in Uber's bonus program, and may be offered an equity award & other types of comp. All full-time employees are eligible to participate in a 401(k) plan. You will also be eligible for various benefits. More details can be found at the following link https://jobs.uber.com/en/benefits.

Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuels progress. What moves us, moves the world - let's move it forward, together.

Uber is proud to be an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you have a disability or special need that requires accommodation, please let us know by completing this form.

Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.