Senior Staff Research Scientist, Gemini Safety Post-training, Deepmind

Google Google · Big Tech · Mountain View, CA +1

Senior Staff Research Scientist focused on rethinking and developing safety post-training methods for agentic AI systems, particularly for Gemini models. The role involves designing and shipping post-training recipes (RL, SFT), building evaluation metrics, and translating research into production.

What you'd actually do

  1. Rethink how safety is trained into models, especially for agentic, long-horizon behavior.
  2. Design and ship post-training recipes (Reinforcement Learning (RL), Supervised Fine-Tuning (SFT), and beyond) that install safety and alignment properties into Gemini models. You own the path from research to production.
  3. Build the metrics and evaluations that tell us whether training is actually making models safer in deployment, not just on benchmarks.
  4. Work directly with the post-training pipeline and infrastructure. Partner with the AGI Safety team to bring alignment research into practical training. Translate between research and production.
  5. Shape the road map for where safety post-training goes next. Build and grow the team to execute on it.

Skills

Required

  • PhD in Computer Science, a related field, or equivalent practical experience.
  • 6 years of experience in Machine Learning Algorithms and Language Modeling.
  • One or more scientific publications in the ML/AI conferences or journals (e.g., NeurIPS, ICML, ICLR, CVPR).

Nice to have

  • 6 years of experience in ML research, with 3 years of experience shipping Reinforcement Learning-based (or equivalent) post-training pipelines.
  • 5 years of experience leading the cross-functional teams in complex, matrixed environments and ability to influence stakeholders, resolve incentives, and provide strategic technical judgment.
  • Ability to deploy the performance improvements in production foundation models.

What the JD emphasized

  • shipping Reinforcement Learning-based (or equivalent) post-training pipelines
  • deploy the performance improvements in production foundation models

Other signals

  • developing novel training methods for AI safety
  • shipping post-training recipes for safety and alignment
  • building metrics and evaluations for AI safety in deployment
  • working with agentic AI systems