Technical Program Manager Iii, ML Strategy and Allocations, Cloud AI

Google Google · Big Tech · Sunnyvale, CA +3

Technical Program Manager III for Google Cloud's AI Strategy and Allocations PMO. This role focuses on resource governance, capacity allocation, and strategic alignment for Google's large-scale ML fleet, ensuring critical AI products have the necessary compute resources. Responsibilities include architecting data reconciliation frameworks, auditing discrepancies, developing forecasting models, engineering automated dashboards for scenario modeling, and translating fleet capacity into actionable allocation plans.

What you'd actually do

  1. Architect data reconciliation frameworks to unify disparate datasets from multiple systems (e.g., resource symphony, allocation data, and fleet guidance), ensuring a single, high-fidelity source of truth for the ML fleet.
  2. Audit and resolve data discrepancies across cross-functional team inputs to maintain the integrity of fleet capacity and usage metrics.
  3. Develop and maintain complex investigative models that enable real-time, dynamic forecasting of ML fleet supply and demand, accounting for hardware transitions and shifting deployment timelines.
  4. Engineer automated dashboards that provide leadership with what-if scenario modeling to visualize the impact of divestments or increased GPU demand on the total fleet.
  5. Translate fleet capacity into actionable allocation plans for Product Areas (PAs), ensuring resource distribution aligns with Google’s ML priorities and commitments.

Skills

Required

  • program management
  • SQL data pipelines
  • predictive dashboards
  • Machine Learning Financial Planning and Analysis (ML FP&A)

Nice to have

  • managing cross-functional or cross-team projects
  • business and economic climate of AI/ML technologies
  • hardware transitions (TPU/GPU)
  • architect automated data reconciliation frameworks

What the JD emphasized

  • ML fleet capacity
  • AI priorities
  • ML compute