Researcher, Trustworthy AI

OpenAI OpenAI · AI Frontier · San Francisco, CA · Safety Systems

Researcher focused on AI safety and societal impacts, translating policy problems into technical research, building methods for public input into model values, and increasing rigor of external assurances for AI model deployments.

What you'd actually do

  1. Set research and strategies to study societal impacts of our models in an action-relevant manner and figure out how to tie this back into model design
  2. Build creative methods and run experiments that enable public input into model values
  3. Increasing rigor of external assurances by turning external findings into robust evaluations
  4. Facilitating and growing our ability to effectively de-risk flagship model deployments in a timely manner

Skills

Required

  • Python
  • AI safety
  • RLHF
  • adversarial training
  • robustness
  • LLM evaluations
  • interdisciplinary research
  • socio-technical topics

Nice to have

  • multimodal datasets

What the JD emphasized

  • 3+ years of research experience
  • AI safety
  • LLM evaluations

Other signals

  • AI safety research
  • societal impacts of AI
  • AGI deployment readiness
  • external assurances for AI