AI Models, Product Manager

Cerebras · Semiconductors · Headquarters +1 · Product Management

Product Manager for AI Models at Cerebras, focusing on defining and launching the strategic model portfolio for their wafer-scale AI inference platform. Responsibilities include roadmap ownership, partnerships with AI labs and open-source communities, defining quality standards, leading go-to-market strategies, and making technical decisions on performance optimizations. The role requires strong product management experience, technical knowledge of AI models and inference, and cross-functional leadership.

What you'd actually do

  1. Own the models roadmap: decide which frontier and open-source models we support based on market demand, research trends, and strategic fit
  2. Define and enforce quality standards across our model catalog through systematic evaluation frameworks
  3. Lead high-impact model launches that generate buzz and adoption
  4. Select and prioritize performance optimizations (quantization, speculative decoding, etc.) based on customer needs and hardware capabilities
  5. Orchestrate launches across model enablement, optimization engineering, deployment, sales, and marketing

Skills

Required

  • Product management
  • Technical work experience
  • Fast-paced environment adaptability
  • Open-source models knowledge
  • Generative AI research knowledge
  • Community model ecosystem knowledge (PyTorch, Hugging Face, vLLM, SGLang)
  • Python
  • Chat completions API
  • Model testing

Nice to have

  • Product manager experience at a model training lab
  • Experience implementing open-source models
  • Solution engineering experience
  • Technical marketing asset creation
  • Social media content creation
  • Cross-functional leadership experience
  • Model quality evaluation experience
  • System prompt harness experience
  • Application code writing (code generation, deep research search)
  • Agentic flows expertise
  • LLM model family architecture expertise
  • Model compiler understanding
  • Model optimization understanding
  • Community contributor (vLLM, SGLang, PyTorch, Hugging Face transformers)
  • Model optimization methods (quantization, compression)

What the JD emphasized

  • 5+ years of experience as a product manager, currently at or above the level of Senior PM
  • 5+ years of total technical work experience
  • Knowledge and passion for the worlds of open-source models and generative AI research
  • Knowledge of the community model ecosystem, including: PyTorch, Hugging Face, vLLM, and SGLang
  • Experience writing model quality evaluations and system prompt harnesses
  • Expertise on agentic flows and current LLM model family architectures
  • Understanding of model compilers and optimization
  • Experience with model optimization or compression methods like quantization

Other signals

  • leading AI labs
  • shape the industry
  • exceptional quality at unprecedented speed
  • fastest Generative AI inference solution
  • real-time iteration
  • agentic computation