Senior Product Manager, AI and Compute Infrastructure

Google Google · Big Tech · Kirkland, WA +3

Product Manager for AI and Compute Infrastructure at Google Cloud, focusing on delivering a scalable and reliable AI substrate for the agentic era. This role involves defining the roadmap for GKE Node OS, driving customization strategies, and partnering with engineering on runtime primitives for GPU/TPU workloads to support AI/ML development and deployment.

What you'd actually do

  1. Evolve the GKE Node OS roadmap (COS, Ubuntu, Windows), aligning release cadences with Kubernetes and optimizing for security and reduced downstream toil.
  2. Drive the customization strategy, including GKE Image Builder, balancing customer demands for flexibility (BYONI, non-mainstream OSs, custom kernels) with platform supportability.
  3. Oversee predictable upgrade-in-place strategies to eliminate customer-experienced downtime during capacity constraints.
  4. Guide the transition and technical enablement for new CPU NPIs across GKE Standard and Autopilot, including Day 0 support.
  5. Partner with engineering to deliver runtime primitives for GPU/TPU workloads, including containerd strategic alignment, Pod Snapshotting, and image streaming.

Skills

Required

  • product management
  • Infrastructure as a Service
  • Kubernetes
  • infrastructure platforms
  • microservices
  • agentic architectures
  • Artificial Intelligence or Machine Learning (AI/ML)

Nice to have

  • Master's degree
  • technical presentations
  • software development
  • engineering
  • AI/ML in security

What the JD emphasized

  • AI substrate
  • Agentic era
  • AI/ML in security

Other signals

  • AI substrate
  • AI infrastructure
  • Agentic era
  • TPUs
  • Vertex AI