Sr Software - Core Infrastructure

Uber Uber · Consumer · Seattle, WA +2 · Engineering

This role is for a Senior Software Engineer on Uber's Core Infrastructure team, focusing on building and scaling foundational platform services for Compute, Networking, and Foundations. The engineer will design and maintain highly available, mission-critical systems, drive efficiency and optimization, enable multi-cloud resilience, and automate operational tasks using AIOps and agentic automation tools. The role also involves collaborating with other infrastructure teams and mentoring junior engineers. While the role mentions scaling for AI/ML workloads and AIOps/agentic automation, its core focus is on general infrastructure engineering, not AI/ML model development or research.

What you'd actually do

  1. Design & Scale Core Services: Build and maintain the next generation of infrastructure services (Compute, Networking, or Tooling) capable of supporting XM+ sustained concurrent trips.
  2. Drive Efficiency & Optimization: Contribute to large-scale initiatives to improve resource utilization (targeting 40%+ fleet utilization) and optimize unit costs across multi-cloud and on-prem environments.
  3. Enable Multi-Cloud Resilience: Develop features that support Uber’s "Thrive in Cloud" strategy, ensuring seamless failover and active-active reliability across multiple cloud providers.
  4. Automate the Lifecycle: Build "AIOps" and agentic automation tools to reduce manual operational load, aiming to automate incident response and alert triaging.
  5. Collaborate Across Infra: Work cross-functionally with Storage, Data, and Product teams to ensure infrastructure is a seamless enabler for business growth (e.g., scaling for AI/ML workloads).

Skills

Required

  • 5+ years of software engineering experience
  • backend services, infrastructure, or distributed systems
  • Go, Java, or C++
  • building, deploying, and operating production services that handle high throughput and require 99.99% availability
  • debugging complex issues across a distributed stack
  • technical writing skills for RFCs and design documents

Nice to have

  • Kubernetes, Envoy, or Prometheus
  • performance tuning, capacity planning, or cost-optimization of large-scale clusters
  • modern deployment patterns (e.g., Starship, GitOps)
  • architecting services that run across on-prem data centers and public clouds (AWS/GCP)
  • Zero-Trust architecture, mTLS, and building ransomware-resilient systems
  • leading technical projects from conception to launch

What the JD emphasized

  • 5+ years of software engineering experience
  • Go, Java, or C++
  • 99.99% availability
  • debug complex issues across a distributed stack
  • Kubernetes, Envoy, or Prometheus
  • architecting services that run across on-prem data centers and public clouds (AWS/GCP)