Offensive Security Research Engineer, Safeguards

Anthropic Anthropic · AI Frontier · San Francisco, CA · Safeguards (Trust & Safety)

This role focuses on offensive security research to identify and mitigate risks associated with AI systems, specifically how adversaries might misuse LLMs to cause harm. The engineer will research potential vulnerabilities, develop defensive strategies, and work with a senior team to implement a security plan.

What you'd actually do

  1. Triage any vulnerabilities discovered, coordinate and assist the external and open-source community in remediation
  2. Write scaffolds designed to automate typical traditional attack techniques to help clarify our defensive problem selection
  3. Research how adversaries might mise-use LLMs to identify and exploit vulnerabilities at scale in the future
  4. Develop promising defensive strategies that could mitigate the ability of adversaries to mis-use models in harmful ways
  5. Work with a small, senior team of engineers and researchers to enact a forward-looking security plan

Skills

Required

  • pentesting
  • vulnerability research
  • offensive security experience
  • reverse engineering
  • network security
  • exploitation
  • physical security
  • software engineering
  • ambiguous technical problems
  • cross-functional security initiatives

Nice to have

  • Published research papers on computer security, language modeling, or related topics
  • given talks at Defcon, Blackhat, CCC, or related venues
  • Familiarity with large language models
  • written agent scaffolds
  • Reported CVEs
  • awarded for bug bounty vulnerabilities
  • Contributed to open-source projects in LLM- or security-adjacent repositories

What the JD emphasized

  • vulnerability research
  • offensive security experience
  • exploitation
  • remediation
  • adversaries might mise-use LLMs
  • defensive strategies

Other signals

  • vulnerability research
  • offensive security
  • LLM misuse
  • defensive strategies