Senior Technical Program Manager - Core AI Platform

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Technical Program Management

The Senior Technical Program Manager will define product strategy, roadmap, and success metrics for platform capabilities that increase overall fleet efficiency, while delivering cost-effective, high-performance solutions for customers running AI workloads. This role is in the CoreAI Infrastructure team which powers the Microsoft AI Foundry Services, managing the GPU fleet to run Microsoft’s AI services on a planet scale. The team focuses on fleet efficiency, reliability, and the agility to bring the latest AI innovations from research into production.

What you'd actually do

  1. Define product strategy, roadmap, and success metrics for platform capabilities that increase overall fleet efficiency, while delivering cost-effective, high-performance solutions for customers running AI workloads.
  2. Define and track efficiency metrics, manage dependencies and lead experimentation to validate hypotheses and drive optimizations
  3. Partner with engineering, data science, finance, and partner teams; manage dependencies and unblock execution in ambiguous environments
  4. Build crisp narratives and dashboards that support decision-making and keep stakeholders aligned on progress, tradeoffs, and outcomes
  5. Translate customer needs and platform constraints into clear requirements and iterative delivery plans

Skills

Required

  • Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience.
  • 2+ years of experience managing cross-functional and/or cross-team projects.

Nice to have

  • 5+ years of experience designing and shipping complex products for developers, ML professionals, or similar audiences
  • 3+ years of experience with distributed platform ecosystems (e.g., multi‑tenant systems, real‑time processing, batch computing)
  • Proven experience in technical program management, preferably supporting AI infrastructure or cloud services
  • Strong understanding of GPUs, virtual machines, operating systems, and cloud infrastructure fundamentals
  • Experience navigating ambiguity and driving clarity across complex, cross‑functional initiatives
  • Experience building solutions using Azure, AWS, or Google Cloud
  • Experience writing Python, including for machine learning workloads
  • Experience with machine learning platforms
  • Experience driving complex, multi‑stakeholder processes and cross‑team programs
  • Ability to build effective relationships, influence, and collaborate at all organizational levels
  • Strong verbal and written communication skills for a global audience
  • Strong business acumen with an analytical, detail‑oriented approach
  • Experience leading cross‑functional teams and managing complex dependencies
  • Ability to thrive in fast‑paced environments with a bias for action

What the JD emphasized

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.