Threat Modeler, Preparedness

OpenAI OpenAI · AI Frontier · San Francisco, CA · Safety Systems

This role focuses on identifying, modeling, and forecasting risks from frontier AI systems, ensuring evaluation frameworks and safeguards are robust. It involves developing threat models for misuse and alignment risks, forecasting future risks, and connecting technical, governance, and policy perspectives on AI safety and preparedness.

What you'd actually do

  1. Develop and maintain comprehensive threat models across all misuse areas (bio, cyber, attack planning, etc.).
  2. Develop plausible and convincing threat models across loss of control, self-improvement, and other possible alignment risks from frontier AI systems
  3. Forecast risks by combining technical foresight, adversarial simulation, and emerging trends.
  4. Pair closely with technical partners on capability evaluations to ensure these map to and cover the gambit of severe risks differentially enabled by frontier AI systems.
  5. Pair closely with Bio and Cyber Leads to size the remaining risk of the designed safeguards and translate threat models into actionable mitigation designs.

Skills

Required

  • Understanding of risks from frontier AI systems
  • Strong grasp of AI alignment literature
  • Deep experience in threat modeling, risk analysis, or adversarial thinking (e.g., security, national security, or safety)
  • Knowledge of how AI evaluations work and ability to connect eval results to capability testing and safeguard sufficiency
  • Ability to work across technical and policy domains to drive rigorous, multidisciplinary risk assessments
  • Clear and compelling communication of complex risks to both technical and non-technical audiences
  • Systems thinking and anticipation of second-order and cascading risks

What the JD emphasized

  • frontier AI models
  • frontier risks
  • frontier AI systems
  • AI alignment literature
  • AI evaluations

Other signals

  • risk assessment
  • evaluations
  • threat modeling
  • AI safety
  • alignment