Research Scientist, Frontier Red Team (emerging Risks)

Anthropic Anthropic · AI Frontier · San Francisco, CA · AI Research & Engineering

Research Scientist focused on understanding and defending against societal risks from advanced AI models, particularly self-improving and autonomous systems. The role involves designing research experiments, building evals, and producing artifacts to communicate model capabilities and inform product/safeguards decisions. Emphasis on emerging risks from AI integration into the economy and society.

What you'd actually do

  1. Design and run research experiments to understand the emerging risks models may create
  2. Produce internal & external artifacts (research, products, demos, dashboards, tools) that communicate the state of model capabilities
  3. Shape product, safeguards, and training decisions based on what you find
  4. Work closely with Societal Impacts (SI) and Safeguards teams

Skills

Required

  • design and run research experiments
  • build evals
  • communicate research findings
  • scope ambiguous research questions

Nice to have

  • foundational infrastructure
  • simple interfaces for non-technical collaborators
  • prioritizing requests from stakeholders

What the JD emphasized

  • societal risks
  • advanced models
  • self-improving, highly autonomous AI systems
  • cyberphysical capabilities
  • emerging risks
  • autonomous AI-powered business
  • red team unsafeguarded models’ abilities to be used for control
  • indicators of models being used to scale movements that rely on social control

Other signals

  • research program
  • evals
  • societal risks
  • advanced models
  • autonomous AI systems