Engineering Manager, Safeguards Review Tooling

Anthropic Anthropic · AI Frontier · San Francisco, CA · Safeguards (Trust & Safety)

Engineering Manager for Anthropic's Safeguards Review Tooling team, focusing on building and scaling systems for AI safety investigation and enforcement. This role involves leading a team to develop tooling that supports human reviewers and integrates AI (Claude) for automation, with a strong emphasis on privacy, analytics, and a sandbox environment for rapid iteration.

What you'd actually do

  1. Lead, grow, and develop a team of engineers building investigation, review, and enforcement tooling for both first-party and third-party platform surfaces
  2. Define the vision and roadmap for our review tooling platform, including analytics, privacy-compatible data access primitives, and a sandbox for rapidly developing new review interfaces
  3. Drive the team's strategy for scaling review through automation, including enabling reviewers to use Claude effectively and building toward Claude-assisted and Claude-driven review workflows
  4. Partner with policy, operations, legal, privacy, and data science stakeholders to translate enforcement and investigation needs into reliable, well-designed systems
  5. Ensure review tooling evolves alongside new privacy primitives and data retention commitments, so reviewers can do their work without compromising user trust

Skills

Required

  • Experience managing software engineering teams, including hiring, coaching, and developing engineers
  • A technical background in full-stack or platform engineering, with the ability to engage deeply in architecture and design discussions
  • Experience shipping internal tools or platforms with demanding operational users, and a track record of improving their workflows measurably
  • Experience working cross-functionally with non-engineering partners such as operations, policy, or legal teams
  • Excellent communication skills, including the ability to explain technical tradeoffs to non-technical stakeholders
  • Care about the societal impacts of AI and want your work to make powerful systems safer

Nice to have

  • 4+ years of management experience, 10+ years of industry software engineering experience
  • Experience building trust and safety, integrity, fraud, or abuse-prevention tooling, or other systems supporting human review at scale
  • Experience designing systems under strict privacy, compliance, or data governance constraints, such as zero data retention environments
  • Experience integrating LLMs or agentic systems into operational workflows, or building human-in-the-loop automation
  • Experience building developer platforms or extensible tooling frameworks that other teams build on top of
  • Experience supporting enforcement or moderation systems across multiple product surfaces, including enterprise or cloud platform contexts

What the JD emphasized

  • building systems where Claude meaningfully extends what human reviewers can do
  • scaling review through automation
  • building toward Claude-assisted and Claude-driven review workflows
  • privacy-preserving primitives
  • privacy, compliance, or data governance constraints
  • zero data retention environments

Other signals

  • building systems where Claude meaningfully extends what human reviewers can do
  • scaling review through automation
  • building toward Claude-assisted and Claude-driven review workflows