Hardware Operations Technical Program Manager

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

OpenAI's Stargate team is seeking a Hardware Operations Technical Program Manager to manage the lifecycle of AI infrastructure hardware programs. This role involves driving cross-functional execution across hardware readiness, supplier coordination, deployment, integration, manufacturing, logistics, and operational handoff. The ideal candidate will understand hardware systems, identify operational blockers, drive accountability, and create scalable processes for high-volume infrastructure deployment, supporting large-scale AI systems.

What you'd actually do

  1. Drive end-to-end Hardware Operations readiness programs across AI infrastructure systems, including servers, racks, networking hardware, power and cooling interfaces, and related data center infrastructure.
  2. Develop and operationalize scalable hardware operations processes, workflows, and support models spanning deployment, repair operations, diagnostics, break/fix, escalation management, and sustaining operations.
  3. Lead cross-functional execution of Hardware Operations readiness initiatives, ensuring operational capabilities, tooling, documentation, staffing models, and workflows are established prior to production deployment and operational handoff.
  4. Partner across Hardware Engineering, Manufacturing, Supply Chain, Data Center Operations, Network Operations, Deployment, Reliability Engineering, and external suppliers to ensure alignment on operational requirements, supportability, and readiness milestones.
  5. Develop operational scorecards, reporting frameworks, and metric algorithms to measure hardware operational health, repair performance, deployment quality, readiness status, and execution efficiency.

Skills

Required

  • 7+ years of experience in technical program management, hardware operations, manufacturing operations, infrastructure deployment, or related technical execution roles.
  • Experience supporting hardware systems at scale, ideally including servers, racks, networking hardware, data center infrastructure, or high-performance compute environments.
  • Strong understanding of hardware development and deployment lifecycle, including NPI, qualification, manufacturing ramp, logistics, installation, validation, and operational support.
  • Demonstrated ability to manage complex cross-functional schedules, dependencies, risks, and executive communications.
  • Strong technical fluency across hardware systems, rack integration, manufacturing readiness, and infrastructure deployment.
  • Proven ability to operate in ambiguous environments and create scalable execution mechanisms.
  • Excellent written and verbal communication skills, with the ability to influence technical and non-technical stakeholders.

Nice to have

  • Experience with AI infrastructure, hyperscale data centers, cloud infrastructure, or high-density compute systems is a plus.
  • Experience with GPU, accelerator, server, rack, or cluster-scale infrastructure programs.
  • Background in hardware operations, manufacturing program management, supply chain operations, data center deployment, or technical infrastructure TPM.
  • Familiarity with rack integration, power/cooling constraints, cabling, networking, serviceability, and deployment readiness.
  • Experience building operating rhythms, readiness reviews, risk registers, launch dashboards, or executive presentations.

What the JD emphasized

  • AI infrastructure
  • hardware operations
  • technical program management
  • cross-functional execution
  • scalable processes
  • high-volume infrastructure deployment