Research Product Manager, Model Behaviors

Anthropic Anthropic · AI Frontier · San Francisco, CA · Product Management, Support, & Operations

Product Manager for Model Behaviors at Anthropic, focusing on defining and shaping Claude's character and behaviors by partnering with the Alignment Finetuning team. The role involves identifying behavioral improvements, coordinating across teams, and translating research breakthroughs into product enhancements to ship well-aligned models.

What you'd actually do

  1. Define behavioral defaults and steerability constraints
  2. Develop and maintain taxonomies of model behaviors across capabilities
  3. Identify, triage, and prioritize behavior issues and opportunities, coordinating input from Users, Research, Product, and Safeguards teams
  4. Amplify alignment research breakthroughs, translating them into product, process, and model improvements
  5. Deeply understand user interaction patterns to identify behavior improvements that make Claude more helpful and safe

Skills

Required

  • Product management
  • Conversational AI products
  • First-principles thinking
  • User empathy
  • Judgment
  • ML concepts
  • Intellectual curiosity
  • Creative problem-solving

Nice to have

  • AI and LLMs passion
  • Hacker spirit

What the JD emphasized

  • 5+ years in product management leading scaled conversational AI products
  • track record of delivering products and features to end-users
  • strong user empathy
  • strong judgment and model taste
  • strong grasp of ML concepts

Other signals

  • Define behavioral defaults and steerability constraints
  • Develop and maintain taxonomies of model behaviors across capabilities
  • Identify, triage, and prioritize behavior issues and opportunities
  • Amplify alignment research breakthroughs, translating them into product, process, and model improvements
  • Contribute to evals that measure alignment progress
  • Identify and scale initiatives and tools that help researchers ship alignment improvements faster