Product Manager, Accelerated Kubernetes Infrastructure

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2 · Remote

Product Manager for NVIDIA's Accelerated Kubernetes Infrastructure (NKE), focusing on defining and delivering Kubernetes as a service. This role requires deep expertise in Kubernetes internals, infrastructure, and platform engineering, with a track record of shipping enterprise-grade platform products. The PM will own the NKE product surface, define the Kubernetes distribution strategy, drive upstream alignment, and develop tooling for cluster management and operations.

What you'd actually do

  1. Own the NKE product surface: control plane lifecycle management, API server availability, component upgrades, and cluster provisioning and teardown
  2. Define our Kubernetes distribution strategy — packaging, conformance, version policy, and release cadence for NVIDIA-managed and on-premises environments
  3. Drive upstream Kubernetes alignment: feature adoption, contribution strategy, and release tracking that keeps us current without introducing instability
  4. Own developer and operator tooling for cluster management, diagnostics, and day-2 operations across environments
  5. Define and publish tooling that enables on-premises customers and partners to deploy, run, and upgrade NVIDIA Kubernetes clusters independently
  6. Drive service reliability, upgrade safety, and multi-tenant isolation at the provider layer

Skills

Required

  • 8+ years of product management experience in Kubernetes infrastructure, Kubernetes services, or platform engineering
  • Deep understanding of Kubernetes internals: control plane architecture, etcd, scheduling, networking, storage integration, and upgrade mechanics
  • Experience shipping a Kubernetes distribution, K8s service, or enterprise platform product
  • Track record of leading upstream open source alignment alongside production delivery constraints
  • Experience with on-premises and hybrid deployment models, not just public cloud

Nice to have

  • Building or operating EKS, AKS, GKE, OpenShift, Rancher, or similar K8s platforms
  • Hands-on experience with NVIDIA GPU infrastructure, DGX systems, or GPU-aware K8s scheduling
  • Shipping Kubernetes tooling used by operators in production (cluster management, diagnostics, lifecycle automation)
  • K8s conformance certification, CIS benchmarks, or security hardening for enterprise or government environments
  • Contributions to upstream Kubernetes or CNCF ecosystem projects

What the JD emphasized

  • deep Kubernetes infrastructure experience
  • Deep understanding of Kubernetes internals
  • Experience shipping a Kubernetes distribution, K8s service, or enterprise platform product
  • Track record of leading upstream open source alignment alongside production delivery constraints
  • Experience with on-premises and hybrid deployment models, not just public cloud