Senior Product Manager, AI Frameworks

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Product Manager for AI Frameworks at NVIDIA, focusing on Recommender Systems and Generative Recommendation Models. The role involves building products for frontier RecSys and Generative Recommendation Models on Nvidia systems, enabling researchers and operators, and pushing the boundaries of what is possible in research-to-production. Responsibilities include creating and optimizing pre-training/inference and post-training frameworks, developing product strategy, roadmaps, and go-to-market plans, and collaborating with internal and external customers. Requires experience with training/inference post-training and optimization software, GenAI/ML concepts, large-scale distributed systems, and technical product management.

What you'd actually do

  1. Create and optimize pre-training/inference and post-training frameworks for RecSys and Generative Recommender researchers and production model builders
  2. Develop product strategy, roadmaps, and go-to-market plans
  3. Collaborate with internal and external customers to build product-based roadmaps for the E2E ML lifecycle
  4. Work with leadership to align with and drive company strategy

Skills

Required

  • Experience with design and scaling of training/inference post training and optimization software (Torchtitan, FSDP)
  • Demonstrable knowledge of GenAI or machine learning concepts, particularly around model training, performance optimization, inference, and software development and delivery
  • Experience with large scale distributed systems
  • BS or MS degree in Computer Science, Computer Engineering, or similar experience (or equivalent experience)
  • 10+ years of technical product management, or similar, experience at a technology company
  • Strong communication and interpersonal skills

Nice to have

  • Experience leading GR systems - GEM, TIGER
  • Working on Open Source & Github-first developer products with deep customer interactions
  • Knowledge of GPU architecture, HW/SW co-design, and performance profiling

What the JD emphasized

  • Experience with design and scaling of training/inference post training and optimization software (Torchtitan, FSDP)
  • Demonstrable knowledge of GenAI or machine learning concepts, particularly around model training, performance optimization, inference, and software development and delivery
  • Experience with large scale distributed systems
  • 10+ years of technical product management, or similar, experience at a technology company

Other signals

  • enabling researchers and operators
  • push the boundaries of what is possible
  • Generative Recommender models are gaining momentum
  • frontier model builders
  • push the bounds of scale
  • training/post training landscape
  • deep learning across all GPU use cases