Research Engineer, Frontier Evals & Environments

OpenAI OpenAI · AI Frontier · San Francisco, CA · Research

Research Engineer focused on building environments and methodologies for measuring and steering frontier AI models towards safe AGI/ASI, influencing training and launch decisions.

What you'd actually do

  1. Create ambitious RL environments to push our models to their limits
  2. Work on measuring frontier model capabilities, skills, and behaviors
  3. Develop new methodologies for automatically exploring the behavior of these models
  4. Help steer training for our largest training runs, and see the future first
  5. Design scalable systems and processes to support continuous evaluation

Skills

Required

  • ML research engineering
  • stochastic systems
  • observability and monitoring
  • LLM-enabled applications
  • AI evaluations
  • engineering skills
  • statistical analysis skills

Nice to have

  • red-teaming systems
  • cross-functional work
  • communication skills

What the JD emphasized

  • AGI/ASI measurement
  • frontier model capabilities
  • self-improvement loops
  • RL environments
  • continuous evaluation
  • model understanding

Other signals

  • AGI/ASI measurement
  • frontier model capabilities
  • self-improvement loops
  • RL environments