What you'd actually do

Propose and pursue novel research directions by formulating and testing hypotheses.

Implement algorithmic ideas and conduct end-to-end experiments, analyzing results and iterating on findings.

Design evaluations and ablations to answer key questions and drive research decisions.

Build and improve infrastructure to enable research at scale.

Communicate research findings through write-ups, presentations, and publications, while fostering a culture of high standards and constructive feedback.

Skills

Required

PhD in Machine Learning or equivalent
Experience implementing algorithms in research codebases
Experience in reinforcement learning research
Contributions to peer-reviewed publications
Experience designing and executing end-to-end experiments

Nice to have

Advanced RL topics (RL for sequence models, post-training, preference-based learning, agentic systems)
Modern research stacks (JAX/Flax or PyTorch)
Scaling experiments
Experimental judgment
Scaling methodologies
Evaluation techniques
Diagnosing complex failure modes
High agency and drive
Prioritization skills
Initiative

What the JD emphasized

PhD in Machine Learning, or equivalent practical experience.

2 years of experience implementing algorithms within research codebases.

Experience conducting research in reinforcement learning, including contributions to peer-reviewed publications.

Experience designing and executing end-to-end experiments, including setup, analysis, and interpretation.

Experience with advanced reinforcement learning topics, such as RL for sequence models, post-training, preference-based learning, or agentic systems.

Other signals

DeepMind's RL team

building DQN, AlphaGo, Rainbow, AlphaZero, MuZero, AlphaStar, AlphaProof and Gemini

advancing AI development

pushing the boundaries across multiple domains

novel research directions

end-to-end experiments

design evaluations and ablations

build and improve infrastructure

peer-reviewed publications

advanced reinforcement learning topics

RL for sequence models

post-training

preference-based learning

agentic systems

As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.

As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

DeepMind’s Reinforcement Learning (RL) team is a long-standing and tight-knit team of collaborative scientists and engineers, led by Tom Schaul. We address large-scale research issues in reinforcement learning. We design, refine, and scale RL algorithms and deliver meaningful scientific or product impact. Over the past decade, members of the RL team have been instrumental in building DQN, AlphaGo, Rainbow, AlphaZero, MuZero, AlphaStar, AlphaProof and Gemini.

Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer diverse learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.

Responsibilities

Propose and pursue novel research directions by formulating and testing hypotheses.
Implement algorithmic ideas and conduct end-to-end experiments, analyzing results and iterating on findings.
Design evaluations and ablations to answer key questions and drive research decisions.
Build and improve infrastructure to enable research at scale.
Communicate research findings through write-ups, presentations, and publications, while fostering a culture of high standards and constructive feedback.

Qualifications

Minimum qualifications:

PhD in Machine Learning, or equivalent practical experience.
2 years of experience implementing algorithms within research codebases.
Experience conducting research in reinforcement learning, including contributions to peer-reviewed publications.
Experience designing and executing end-to-end experiments, including setup, analysis, and interpretation.

Preferred qualifications:

Experience with advanced reinforcement learning topics, such as RL for sequence models, post-training, preference-based learning, or agentic systems.
Familiarity with modern research stacks (e.g., JAX/Flax or PyTorch) and experience scaling experiments.
Strong experimental judgment, including selecting appropriate baselines and designing insightful ablations.
Comfort with scaling methodologies, evaluation techniques, and diagnosing complex failure modes.
High agency and drive to push projects forward, prioritize effectively, and take initiative.
Excellent communication skills, with a focus on clear and honest presentation of research results.

Responsibilities

Propose and pursue novel research directions by formulating and testing hypotheses.
Implement algorithmic ideas and conduct end-to-end experiments, analyzing results and iterating on findings.
Design evaluations and ablations to answer key questions and drive research decisions.
Build and improve infrastructure to enable research at scale.
Communicate research findings through write-ups, presentations, and publications, while fostering a culture of high standards and constructive feedback.

Qualifications

Minimum qualifications:

PhD in Machine Learning, or equivalent practical experience.
2 years of experience implementing algorithms within research codebases.
Experience conducting research in reinforcement learning, including contributions to peer-reviewed publications.
Experience designing and executing end-to-end experiments, including setup, analysis, and interpretation.

Preferred qualifications:

Experience with advanced reinforcement learning topics, such as RL for sequence models, post-training, preference-based learning, or agentic systems.
Familiarity with modern research stacks (e.g., JAX/Flax or PyTorch) and experience scaling experiments.
Strong experimental judgment, including selecting appropriate baselines and designing insightful ablations.
Comfort with scaling methodologies, evaluation techniques, and diagnosing complex failure modes.
High agency and drive to push projects forward, prioritize effectively, and take initiative.
Excellent communication skills, with a focus on clear and honest presentation of research results.