Senior Staff Uber Technical Lead, Observability Intelligence

Google Google · Big Tech · New York, NY +1

Senior Staff Uber Technical Lead for Observability Intelligence, driving the AI-driven paradigm shift in SRE incident response by evolving Google's monitoring systems into a cohesive observability ecosystem. This role involves leading large-scale ML infrastructure optimization, defining strategy, and collaborating across teams to ensure technical consistency and interoperability.

What you'd actually do

  1. Drive technical project strategy, lead large-scale ML infrastructure optimization, and oversee the design and implementation of solutions across multiple specialized ML areas.
  2. Define and socialize a cohesive "Observability Intelligence" strategy that aligns with the broader Monitoring Northstar, ensuring we build shared technical concerns once and solve them for the entire organization.
  3. Represent the Observability Intelligence organization in high-stakes technical reviews and collaborate across organizational boundaries (AlertManager, AI Operations, Incident Response Management, and Site Reliability Engineering teams across all Product Areas) to drive consensus on critical observability standards.
  4. Act as the primary technical partner to Product Management, translating broad product "Whats" into scalable architectural "Hows."
  5. Lead high-level design reviews that ensure technical consistency across the stack, prioritizing interoperability, reusability, and semantic cohesion.

Skills

Required

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience in software development.
  • 5 years of experience with design and architecture; and testing/launching software products.
  • Experience integrating generative AI tools or LLM interfaces into workflows.

Nice to have

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • 8 years of experience with data structures and algorithms.
  • 5 years of experience in a technical leadership role leading project teams and setting technical direction.
  • 3 years of experience working in a complex, matrixed organization involving cross-functional, or cross-business projects.
  • Familiarity with and interest in the current AI landscape (Large Language Model (LLMs), generative agents, etc).

What the JD emphasized

  • integrating generative AI tools or LLM interfaces into workflows
  • large-scale ML infrastructure optimization
  • technical leadership role leading project teams and setting technical direction

Other signals

  • driving initiatives to pivot SRE incident response toward an AI-driven paradigm
  • integrating generative AI tools or LLM interfaces into workflows
  • leader with a proven history of managing business-critical domains
  • architectural trade-offs between urgent product requirements and long-term technical durability
  • large-scale ML infrastructure optimization
  • solutions across multiple specialized ML areas
  • Observability Intelligence strategy
  • AI Operations
  • technical partner to Product Management
  • technical consistency across the stack
  • interoperability, reusability, and semantic cohesion