Research Lead, Training Insights

Anthropic Anthropic · AI Frontier · United States · Remote · AI Research & Engineering

Research Lead focused on developing and executing strategies for measuring and characterizing model capabilities across training and deployment. This role involves driving original research into new evaluation methodologies, leading a team, and spanning the full lifecycle of model development, from pretraining to deployment. The work includes creating long-horizon evaluations, measuring emerging capabilities, and understanding their development during RL training and post-training. The role also involves cross-organizational collaboration to map evaluation landscapes and identify gaps, shaping the evaluation narrative for model releases, and contributing to the broader research community.

What you'd actually do

  1. Build new novel and long-horizon evaluations
  2. Develop novel measurement approaches for understanding how model capabilities emerge and evolve during RL training
  3. Lead strategic evaluation coverage across the company
  4. Shape the evaluation narrative for model releases
  5. Lead and mentor a small team of researchers and research engineers, setting research direction and fostering a culture of rigorous, creative research

Skills

Required

  • Designing and running evaluations for large language models or similar complex ML systems
  • Leading technical projects or teams
  • Synthesizing information across multiple teams and workstreams
  • Communicating complex technical findings clearly
  • AI safety

Nice to have

  • Experience building evaluations for long-horizon or agentic tasks
  • Deep familiarity with Reinforcement Learning training dynamics
  • Published research in machine learning evaluation, benchmarking, or related areas
  • Experience with safety evaluation frameworks and red teaming methodologies
  • Background in psychometrics, experimental psychology, or other measurement-focused disciplines
  • Track record of communicating evaluation results to inform high-stakes decisions
  • Experience managing or mentoring researchers and engineers

What the JD emphasized

  • significant experience designing and running evaluations for large language models or similar complex ML systems
  • led technical projects or teams
  • AI safety

Other signals

  • Develop novel measurement approaches for understanding how model capabilities emerge and evolve during RL training
  • Lead strategic evaluation coverage across the company
  • Shape the evaluation narrative for model releases
  • Build new novel and long-horizon evaluations