Product Manager, Safeguards Rare Harms

Anthropic Anthropic · AI Frontier · San Francisco, CA · Product Management, Support, & Operations

Product Manager for Anthropic's Safeguards team, focusing on building and deploying systems to ensure AI safety and prevent misuse. This role involves ideation, design, development, and UX for safeguards, working closely with research and product teams to mitigate risks associated with frontier models across various platforms.

What you'd actually do

  1. Determine how to build in safety by design upstream and leverage downstream defenses for Anthropic’s frontier models, AI products, customers on different surfaces - Claude.ai, 1P API, external Cloud providers.
  2. Ability to write safety evals and communicate externally about safety.
  3. Drive impact via ruthless prioritization by clearly defining problems, solution options forward, clarity on both business & technical tradeoffs and accordingly clear requirements toward MVP vs. ideal state.
  4. Align & collaborate with policy, enforcement, research, engineering and cross functional stakeholders.
  5. Understand the AI landscape and ecosystem to plan for mitigation of deployment risks of increasingly powerful models and determined adversaries.

Skills

Required

  • 5+ years in product management
  • fast problem understanding
  • building roadmaps with tractable progress
  • ability to get into the details on data, detection & interventions, infrastructure & tools, and/or evals
  • Ability to make technical tradeoff decisions
  • experience working across policy experts, AI/ML research engineers and software engineering teams to design and build state of the art safety systems
  • Strong user understanding of how our products are used, their Safeguards concerns and how we provide the best solutions
  • Demonstrated ability to build product and engineering strategy across multiple cross-functional teams for a rapidly changing space
  • Demonstrated experience in designing and building metrics to evaluate risks, system performance, user impact and making crisp tradeoffs
  • Very strong ability to navigate, and prioritize amidst rapidly changing product specs, and to flex into different domains to bring clarity and execute
  • Evidence of exercising judgment and decision making in ambiguous situations
  • Planning, building, launching and measuring new products / systems in a zero to one environment
  • Ability to clearly articulate complex technical concepts to non-technical audiences in written and verbal communication
  • Think creatively about the risks and benefits of new technologies, and think beyond past checklists and playbooks

Nice to have

  • writing safety evals
  • communicate externally about safety

What the JD emphasized

  • deep technical expertise in development, deployment and measurement of Safeguards systems
  • Ability to make technical tradeoff decisions
  • designing and building metrics to evaluate risks, system performance, user impact and making crisp tradeoffs
  • Planning, building, launching and measuring new products / systems in a zero to one environment.

Other signals

  • Safeguards team builds protections for new AI features
  • protects new products and surfaces
  • develop detections, evals, interventions, and tools to measure and mitigate deployment and user risks
  • ensure we are advancing frontier models safely to users