Agent Post-training, Personality

OpenAI OpenAI · AI Frontier · San Francisco, CA · Research

This role focuses on post-training AI agents to improve their collaborative abilities, including thoughtfulness, clarity, perceptiveness, and proactivity. It involves translating qualitative judgments into training signals, reward models, and evals, with a strong emphasis on user experience and behavioral research. The role bridges research, product thinking, and engineering to shape the personality and effectiveness of frontier agents.

What you'd actually do

  1. Develop a rigorous understanding of what makes an agent a great collaborator across professional, creative, technical, and everyday work.
  2. Turn qualitative judgments about model behavior into concrete hypotheses, evals, graders, and training interventions.
  3. Study explicit and implicit user signals to understand which behaviors create trust, satisfaction, continued use, and successful outcomes.
  4. Work with human experts and trainers to produce high-quality, tasteful rollouts and preference data that capture excellent collaborative behavior.
  5. Improve reward models and RL objectives for model behaviors.

Skills

Required

  • machine learning
  • software engineering
  • statistics
  • behavioral science
  • HCI
  • LLMs
  • post-training
  • RL/RLHF
  • reward modeling
  • evals
  • synthetic data
  • pretraining data
  • production ML systems

Nice to have

  • strong taste for model behavior
  • preserving individuality, adaptability, and behavioral diversity
  • building load-bearing systems and processes

What the JD emphasized

  • rigorous understanding
  • concrete hypotheses
  • user signals
  • human experts
  • preference data
  • reward models
  • RL objectives
  • consumer insight
  • consumer insight
  • model improvements
  • user feedback
  • model behavior
  • behavioral science
  • HCI
  • load-bearing systems
  • production ML systems

Other signals

  • agent post-training
  • personality
  • collaborator
  • reward models
  • RL objectives
  • user signals
  • human experts
  • preference data
  • product teams