Researcher, Agent Post-training, Personality

OpenAI OpenAI · AI Frontier · San Francisco, CA · Research

Researcher focused on post-training of AI agents to improve their collaborative personality, involving behavioral research, data creation, reward modeling, and collaboration with product teams to ship improved agent models.

What you'd actually do

  1. Develop a rigorous understanding of what makes an agent a great collaborator across professional, creative, technical, and everyday work.
  2. Turn qualitative judgments about model behavior into concrete hypotheses, evals, graders, and training interventions.
  3. Study explicit and implicit user signals to understand which behaviors create trust, satisfaction, continued use, and successful outcomes.
  4. Work with human experts and trainers to produce high-quality, tasteful rollouts and preference data that capture excellent collaborative behavior.
  5. Improve reward models and RL objectives for model behaviors.

Skills

Required

  • Machine learning
  • Software engineering
  • Statistics
  • Behavioral science
  • HCI
  • LLMs
  • post-training
  • RL/RLHF
  • reward modeling
  • evals
  • synthetic data
  • pretraining data
  • production ML systems

Nice to have

  • strong taste for model behavior
  • individuality, adaptability, and behavioral diversity
  • building load-bearing systems and processes

What the JD emphasized

  • frontier agents
  • post-training
  • personality
  • collaborative behavior
  • reward models
  • RL objectives
  • training data
  • model improvements

Other signals

  • agent post-training
  • personality
  • collaborative agents
  • reward models
  • preference data