Agent Post-training, Frontier Evals and Environments Research

OpenAI OpenAI · AI Frontier · San Francisco, CA · Research

OpenAI is seeking a researcher for their Agent Post-Training team, focusing on creating frontier agents. This role involves building ambitious RL environments, developing methodologies for exploring model behavior, and steering training for large runs. The goal is to guide research programs and ensure safe AGI/ASI development, with a focus on product impact and model behavior.

What you'd actually do

  1. Create ambitious RL environments to push our models to their limits, and measure frontier model capabilities, skills, and behaviors
  2. Develop new methodologies for automatically exploring the behavior of these models
  3. Dive deep into the science of measurement, including understanding scalability, reliability, and variance of our evaluation methodology
  4. Help steer training for our largest training runs, and see the future first
  5. Design scalable systems and processes to support continuous evaluation

Skills

Required

  • machine learning
  • software engineering
  • systems
  • statistics
  • LLMs
  • RL
  • RLHF/RLAIF
  • post-training
  • evals
  • graders
  • synthetic data
  • model training
  • coding agents
  • tool-using agents
  • production ML systems

Nice to have

  • open-ended problems
  • product impact
  • concrete experiment design
  • cross-functional communication
  • building load-bearing systems and processes

What the JD emphasized

  • safe AGI/ASI
  • frontier models
  • model behavior

Other signals

  • frontier agents
  • train models
  • build data, environments, graders, training methods, and feedback loops
  • measure frontier model capabilities
  • guide research programs of ambitious training runs