Technical Program Manager- AI Infrastructure (ads Ranking)

Meta Meta · Big Tech · Sunnyvale, CA +2

Technical Program Manager (TPM) to lead complex, large-scale programs advancing AI infrastructure and platforms for ad ranking. Focus on optimizing the ML development lifecycle, system reliability, hardware efficiency, and performance at scale. Drive innovation by establishing frameworks for next-generation AI hardware and ML platforms.

What you'd actually do

  1. Orchestrate and align mission-critical cross-functional teams to deliver against ambitious ML/AI infrastructure objectives, ensuring absolute clarity and executive accountability
  2. Forge strategic partnerships with senior engineering, hardware, and research leadership to define long-term roadmaps for platform scaling and infrastructure prioritization
  3. Drive the technical execution for the integration of next-generation AI accelerators (GPUs, ASICs) and distributed machine learning systems
  4. Design and implement high-level communication frameworks to synthesize program health, risk profiles, and strategic shifts for executive leadership and global stakeholders
  5. Mitigate complex cross-functional dependencies and systemic risks by dynamically re-engineering scope and resources to maintain project momentum and on-time delivery

Skills

Required

  • 12+ years of leadership experience in software, hardware, or systems engineering and technical program management
  • Deep expertise in ML infrastructure, monetization platforms, or large-scale recommendation systems
  • Proven track record of delivering complex ML/AI technology programs from architectural inception through production deployment at scale
  • Exceptional ability to transform ambiguous technical challenges into actionable, high-impact strategies
  • Demonstrated analytical and problem-solving skills for large-scale ML/AI systems
  • Strong executive presence with a proven ability to synthesize deep technical complexity into clear strategic narratives for leadership
  • Effective communication skills, with experience influencing executive leadership and technical management teams
  • Hands-on understanding of large language models, machine learning, and scaling distributed systems, specifically within the context of high-stakes advertising auction dynamics and revenue-critical infrastructure
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)

Nice to have

  • Prompt/context engineering
  • Agent orchestration
  • Staying current with emerging AI technologies
  • Integrating AI tools to optimize/redesign workflows

What the JD emphasized

  • Proven track record of delivering complex ML/AI technology programs from architectural inception through production deployment at scale
  • Proven experience delivering complex ML/AI technology programs from inception to production at scale
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)

Other signals

  • large-scale production deployment
  • ML infrastructure
  • AI hardware
  • ML platforms
  • scaling distributed systems