Research Manager, Production Model Training

Anthropic Anthropic · AI Frontier · AI Research & Engineering

Research Manager for Anthropic's Applied Finetuning team, leading a team to train flagship production models (like Claude.AI) using techniques such as Constitutional AI and RLHF. Responsibilities include managing day-to-day execution, prioritizing work, coaching reports, and contributing technically to the team's efforts in post-training techniques, algorithm implementation, data mix experiments, evaluation design, and pipeline improvement.

What you'd actually do

  1. Lead research and engineering efforts to train production models through post-training techniques
  2. Become familiar with the team’s technical stack enough to make targeted contributions as an individual contributor
  3. Manage day-to-day execution of the team's work
  4. Prioritize the team’s work and manage projects to support fast iteration on research projects and training runs
  5. Coach and support your reports in understanding, and pursuing, their professional growth

Skills

Required

  • management experience
  • machine learning
  • AI
  • technical leadership
  • project management
  • stakeholder management
  • complex technical systems understanding

Nice to have

  • individual contributor technical contributions
  • fast iteration on research projects
  • AI safety

What the JD emphasized

  • lead a team of researchers and research engineers
  • train the flagship models we launch to the public
  • designing and iterating on state-of-the-art finetuning techniques
  • implement new algorithms
  • run experiments on data mixes
  • design evaluations
  • improve our production model finetuning pipeline
  • 3-5 years of management experience
  • background in machine learning, AI, or a related technical field
  • deeply interested in the potential transformative effects of advanced AI systems
  • committed to ensuring their safe development
  • building strong relationships with stakeholders
  • quick learner, capable of understanding and contributing to discussions on complex technical topics
  • experience managing teams through periods of rapid growth and change
  • comfortable working in a fast-paced, research-driven environment where priorities may shift quickly
  • quick study: this team sits at the intersection of a large number of different complex technical systems that you’ll need to understand (at a high level of abstraction) to be effective

Other signals

  • leading a team of researchers and research engineers
  • designing and iterating on state-of-the-art finetuning techniques
  • implementing new algorithms
  • running experiments on data mixes
  • designing evaluations
  • improving production model finetuning pipeline