Anthropic Fellows Program — AI Security

Anthropic Anthropic · AI Frontier · BC +3 · Remote · AI Research & Engineering

This is a research fellowship program focused on AI safety and security, aiming to produce public outputs like paper submissions. Fellows will use external infrastructure and open-source models, working on empirical projects with mentorship from Anthropic researchers.

What you'd actually do

  1. 4 months of full-time research
  2. Direct mentorship from Anthropic researchers
  3. Access to a shared workspace (in either Berkeley, California or London, UK)
  4. Connection to the broader AI safety and security research community
  5. Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (these vary by country)

Skills

Required

  • Fluent in Python programming
  • Available to work full-time on the Fellows program

Nice to have

  • Strong technical background in computer science, mathematics, or physics
  • Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity)
  • Experience in areas of research or engineering related to their workstream
  • contributed to open-source projects in LLM- or security-adjacent repositories
  • demonstrated success in bringing clarity and ownership to ambiguous technical problems
  • experience with pentesting, vulnerabilit

What the JD emphasized

  • public output
  • paper submission
  • Fluent in Python programming
  • Available to work full-time on the Fellows program
  • AI Security Fellows
  • reducing catastrophic risks from advanced AI systems
  • bringing clarity and ownership to ambiguous technical problems
  • pentesting
  • vulnerabilit

Other signals

  • AI safety
  • AI security
  • research project
  • paper submission