Senior Technical Program Manager, Deep Learning Frameworks

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

NVIDIA is seeking a Senior Technical Program Manager to lead software programs for Deep Learning Training and Inference Frameworks. The role involves managing programs in a matrixed environment, coordinating with cross-functional teams (hardware, software, product, QA, compliance), and developing planning, reporting, and execution practices. The TPM will work with senior management to ensure alignment and exceptional execution, fostering collaborations between software, hardware, and infrastructure teams. Key responsibilities include guiding program management aspects, defining and implementing metrics, and standardizing processes for open-source contribution and release management within the Nvidia AI/ML ecosystem.

What you'd actually do

  1. Collaborate with hardware, software, and model program managers, product managers and engineering teams to deliver Deep Learning Frameworks programs on existing and new hardware.
  2. Engage with cross-company teams (hardware/software engineering, product, QA, compliance) and drive alignment on release scope, milestones, risk management, and dependencies.
  3. Guide software programs in all aspects of program management – planning, forecasting, documenting, scheduling, effective meetings, multi-faceted prioritization, management of dependencies, reporting, and effective handling of critical and blocking issues.
  4. Develop and implement metrics for measuring program effectiveness and improvement areas, collect and analyze data in support of planning and data driven decisions.
  5. Define and implement standard processes for open-source contribution and release management within the Nvidia AI/ML ecosystem.

Skills

Required

  • Postgraduate degree in Computer Science, Artificial Intelligence, Mathematics or equivalent experience
  • 10+ years software program management experience
  • proven track record leading global projects, adaptable to multiple time zones in fast-paced software development environments
  • Proven experience with delivering large software programs, spanning multiple layers of the software stack
  • Ability to think strategically and tactically and to build consensus to make programs successful by engaging and moderating successful engagements with engineering and product teams
  • Excellent communication, technical presentation, and attention to detail skills
  • Excellent organizational skills
  • ability to multitask in a dynamic environment with shifting priorities and changing requirements
  • project management tools (e.g. Jira, Aha!, Confluence)
  • distributed version control systems (e.g. Git)

Nice to have

  • Experience with Deep Learning Frameworks (PyTorch, Jax, etc.)
  • ML compilers (XLA, Triton, etc.)
  • GPU Technology
  • open-source development
  • production software development
  • release management
  • DevOps
  • driving process improvements
  • measuring efficiency
  • NVIDIA GPU programming and software stack (CUDA Toolkit, cuDNN, TensorRT, NCCL, etc.)
  • Engineering background

What the JD emphasized

  • 10+ years software program management experience
  • proven track record leading global projects
  • proven experience with delivering large software programs