[expression of Interest] Research Manager, Interpretability

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Research Manager for the Interpretability team, focusing on mechanistic interpretability to understand how large language models work internally and ensure AI safety. The role involves partnering with a research lead on direction, project planning, execution, hiring, and people development, translating research ideas into tangible goals, and overseeing their execution. This is a management role, distinct from individual contributor research scientist or engineer roles.

What you'd actually do

  1. Partner with a research lead on direction, project planning and execution, hiring, and people development
  2. Set and maint

Skills

Required

  • people management
  • project planning
  • hiring
  • career development
  • performance management
  • cross-functional collaboration

Nice to have

  • background in interpretability research
  • understanding of large language models

What the JD emphasized

  • mechanistic interpretability
  • AI safety
  • understanding neural networks

Other signals

  • mechanistic interpretability
  • AI safety
  • understanding neural networks