Research Manager, Interpretability

Anthropic Anthropic · AI Frontier · AI Research & Engineering

Manager for the Interpretability team focused on mechanistic interpretability of large language models, aiming to understand how they work internally for AI safety.

What you'd actually do

  1. Partner with a research lead on direction, project planning and execution, hiring, and people development
  2. Set and maintain a high bar for execution speed and quality, including identifying improvements to processes that help the team operate effectively
  3. Coach and support team members to have more impact and develop in their careers
  4. Drive the team's recruiting efforts, including hiring planning, process improvements, and sourcing and closing
  5. Help identify and support opportunities for collaboration with other teams across Anthropic

Skills

Required

  • management experience
  • research direction
  • project planning
  • people development
  • recruiting
  • collaboration

Nice to have

  • experience with large language models
  • understanding of neural networks

What the JD emphasized

  • mechanistic interpretability
  • AI safety

Other signals

  • mechanistic interpretability
  • AI safety
  • understanding neural networks