Sr. Staff Engineer (conversational/voice Ai)

Uber Uber · Consumer · Sunnyvale, CA +1 · Engineering

Senior Staff Engineer to architect, productionize, and scale an autonomous support agent for Uber's Customer Obsession team, focusing on mobile, web, and voice channels. The role involves end-to-end agent architecture, shipping production systems with high reliability, leading voice agent initiatives, advancing retrieval and reasoning pipelines, establishing evaluation frameworks, driving automation, and mentoring engineers.

What you'd actually do

  1. Own the end‑to‑end agent architecture: agentic planning and execution loops, long-term memory, persona/voice, knowledge routing, and policy enforcement for compliant, on‑brand conversations.
  2. Ship production systems that handle millions of conversations with rigorous SLOs, fallbacks, and canaries; design graceful degradation (e.g., human handoff) and safety guardrails (prompt‑injection, jailbreak, PII redaction).
  3. Lead voice agent initiatives: Drive the development of Uber’s voice support agent—covering real-time speech recognition (ASR), text-to-speech, natural turn-taking (barge-in and endpointing), and reliable telephony/WebRTC integration. Ensure low-latency, high-quality interactions that remain robust even in noisy environments.
  4. Advance retrieval & reasoning: Build next-generation retrieval and reasoning pipelines, where the agent can search across different knowledge sources, apply policy-driven tools, and call structured workflows and ensure that responses are consistently grounded.
  5. Establish evals that matter: offline rubrics, simulated scenarios, safety tests, cost/latency tradeoff suites, and LLM‑as‑judge (with calibrated human review) wired into CI/CD and experiment platforms.

Skills

Required

  • Python
  • Go/Java/C++
  • microservices
  • gRPC/HTTP
  • cloud infra
  • containers
  • CI/CD
  • real-time telemetry/observability
  • SLO/SLA design
  • incident response
  • on-call leadership
  • postmortems
  • A/B testing
  • metrics literacy

Nice to have

  • voice agents
  • agentic architectures
  • ASR/TTS streaming
  • barge-in
  • endpointing
  • telephony
  • WebRTC
  • conversational quality/NLP evaluation
  • RAG
  • support automation
  • multilingual NLU/NLG
  • hallucination mitigation
  • safety red-teaming
  • privacy-by-design
  • experiment frameworks
  • feature flags
  • canary/guarded rollouts
  • kill-switches

What the JD emphasized

  • end-to-end agent architecture
  • production systems
  • voice agents
  • agentic architectures
  • LLM orchestration
  • evaluation
  • safety guardrails
  • real-time voice
  • low-latency
  • high-quality
  • SLOs
  • fallbacks
  • canaries
  • graceful degradation
  • human handoff
  • prompt-injection
  • jailbreak
  • PII redaction
  • real-time speech recognition (ASR)
  • text-to-speech
  • natural turn-taking
  • barge-in
  • endpointing
  • reliable telephony/WebRTC integration
  • retrieval & reasoning
  • knowledge sources
  • policy-driven tools
  • structured workflows
  • LLM‑as‑judge
  • CI/CD
  • experiment platforms
  • cost per contact
  • high-availability
  • customer-facing

Other signals

  • autonomous support agent
  • LLM orchestration
  • voice agents
  • production systems
  • SLOs
  • safety guardrails