Technical Program Manager – Adversarial Model Research

OpenAI OpenAI · AI Frontier · San Francisco, CA · Research

This role focuses on testing the safety and robustness of AI models through evaluations, red-teaming, and identifying failure modes. It involves leading programs to understand model behaviors, translating risks into research plans, and collaborating with research and engineering teams to integrate findings into model development and deployment cycles. The goal is to strengthen model reliability and public trust.

What you'd actually do

  1. Lead programs that explore unexpected model behaviors and identify failure modes.
  2. Translate vague or emergent risk signals into clear priorities and actionable research plans.
  3. Design and run creative evaluations, experiments, and red-teaming campaigns.
  4. Collaborate with research, product, and deployment teams to integrate findings into model training and deployment cycles.
  5. Develop repeatable systems for tracking model performance and understanding emerging behavior patterns.

Skills

Required

  • Technical program management
  • Organizational skills
  • Communication skills
  • Familiarity with large language models
  • Prompt engineering
  • Model evaluation techniques
  • Managing fast-paced, high-uncertainty projects
  • Creative and resourceful testing methods
  • Coordinating technical and non-technical stakeholders

What the JD emphasized

  • technical program management
  • large language models
  • prompt engineering
  • model evaluation techniques
  • high-uncertainty projects
  • testing model behavior and performance

Other signals

  • evaluations
  • vulnerabilities
  • model reliability
  • public trust
  • safety
  • robustness
  • research programs
  • model training
  • deployment cycles
  • model performance
  • emerging behavior patterns