Research Scientist, Frontier Safety Loss of Control, Deepmind

Google Google · Big Tech · San Francisco, CA +1

Research Scientist role focused on identifying and preventing harms from misaligned AI agents, developing and implementing technical controls, monitoring agent behavior, and conducting adversarial testing. The role involves working with internal product teams to ensure adoption of control systems on high-risk AI surfaces, with a background in frontier AI research and agentic assistance.

What you'd actually do

  1. Identify potential harms from misaligned agents and develop strategies for detection and prevention.
  2. Implement technical controls to monitor agent thoughts, behaviour, and respond to mitigate potential harms.
  3. Integrate various agent behaviour signals from across the organisation to inform response policies.
  4. Conduct adversarial testing of controls.
  5. Work with internal product teams to ensure that control systems are adopted over all high-risk AI surfaces.

Skills

Required

  • Python
  • engineering and agentic assistance
  • frontier AI research and development environment
  • professional software engineering or research team environment
  • technical stakeholders
  • frontier model risk

Nice to have

  • engineering or product design for AI tools or assistants
  • ML Research and Development (R&D)
  • cybersecurity detection and response
  • collaborating or leading an applied ML project
  • Large Language Model (LLM) training and inference
  • AI control
  • chain-of-thought
  • monitoring
  • faithfulness
  • monitorability

What the JD emphasized

  • frontier AI research
  • agentic assistance
  • misaligned agents
  • technical controls
  • adversarial testing
  • high-risk AI surfaces

Other signals

  • frontier AI research
  • agentic assistance
  • misaligned agents
  • technical controls
  • adversarial testing