Currently tracking 498 active AI roles, down 12% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$1030k (avg $233k).
Google has 584 active AI-related job listings. The majority of these roles are focused on agents, representing 40% of the total, and serving infrastructure, at 26%. The most frequent technical tags include model_serving, agent_orchestration, and evals. Over the last 30 days, Google has added 413 new AI roles, a 105% increase compared to the preceding 30-day period.
Google currently has 586 active AI-related roles in our index. The most common open titles are: Software Engineer (5), AI Adoption Customer Engineer, Google Cloud (3), Conversational AI Consultant (2), Engineering Manager, Egregious Abuse Protection (2), Forward Deployed Engineer III, Generative AI, Google Cloud (2). Most positions are in Engineering and Product.
Google's active AI hiring is concentrated in: agents (43%), serving infrastructure (25%), application (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Google is hiring AI talent in: United States (376 roles), India (53 roles), Singapore (40 roles), Switzerland (20 roles).
Job postings at Google most frequently mention: Software Engineering, Algorithms & Data Structures, System Design, Computer Architecture, Machine Learning.
In the past 30 days, Google has posted 571 new AI-related roles. That is a +22% change versus the prior 30 days (469 → 571).
| Title | Stage | AI score |
|---|---|---|
| Threat Modeler Lead, CBRNE, DeepMind Lead threat modeler for AI safety in CBRNE domains, focusing on evaluating and mitigating dual-use risks of advanced AI models. This role involves refining threat modeling frameworks, designing evaluations for AI risks, collaborating with mitigation teams, and engaging with external stakeholders. Requires a PhD and experience in national labs or defense organizations, with a preference for experience in red-teaming LLMs and understanding CBRNE risks. | Eval Gate | 9 |
| Senior Technical Program Manager Lead, Gemini Audio, DeepMind Senior Technical Program Manager Lead for Gemini Audio at Google DeepMind, focusing on end-to-end model quality across the AI lifecycle. The role involves collaborating with researchers, data scientists, and serving/deployment teams to manage model training priorities, design and execute evaluations, and oversee the entire release cycle for foundational audio models. This includes checkpoint uploads, documentation, deployment coordination, capacity planning, and cross-functional testing, with a strong emphasis on applying deep AI evaluation methodologies and driving strategic outlook with high agency. |
| Eval GatePost-train |
| 9 |
| Senior Staff Software Engineer, Agentic Data Tooling, DeepMind Senior Staff Software Engineer focused on building agentic data tooling for Gemini, including evaluation frameworks (SmithBench, RE-Bench), data collection pipelines for agent interactions, and human-in-the-loop annotation systems to accelerate AI capabilities and agent development. | Eval GateAgent | 9 |
| Senior Staff Software Engineer, Agent Data Quality, DeepMind Senior Staff Software Engineer focused on Agent Data Quality within DeepMind, responsible for building data processing pipelines, experiment frameworks, and evaluation benchmarks for AI agents. The role involves analyzing agent behavior, identifying failure modes, and providing feedback for GenAI model improvement and product development, with a focus on reasoning, planning, and tool use. | Eval GateAgent | 9 |
| Staff Software Engineer, Gemini Evals, GenAI, DeepMind Staff Software Engineer focused on designing and optimizing distributed evaluation execution engines for AI agents. This role involves building systems for agent testing, developing test problems, creating visualizations, building leaderboards, and testing algorithms on robots. The engineer will also build abstractions for LLM agent loops, tool use, and automated rating systems, and design error classification, retry policies, and observability dashboards to meet SLOs. Collaboration with research scientists and data science teams is key, as is mentoring other engineers and advocating for code quality and system design. | Eval GateAgent | 9 |
| Research Engineer, Security and Privacy, DeepMind Research Engineer at Google DeepMind focused on evaluating agentic capabilities of AI models. The role involves building pipelines and tools for automated red-teaming to identify vulnerabilities and failure modes, collaborating with post-training teams to improve models, and generalizing solutions into reusable libraries. Emphasis on measuring and shaping model behavior through rigorous evaluation, with a goal of improving safety and robustness. | Eval GatePost-train | 9 |
| Research Scientist, Evaluations, Security and Privacy, DeepMind Research Scientist focused on security and privacy for AI models and agentic products, specifically Gemini. The role involves designing and evaluating novel defense mechanisms against adversarial attacks and prompt injections, translating research into practical solutions for training and inference pipelines, and collaborating with core modeling and engineering teams. The position requires a PhD and experience in ML research, benchmarking, and security, with a focus on next-generation security techniques for autonomous AI systems. | Eval GateAgent | 9 |
| Senior Staff Research Engineer, DeepMind Senior Staff Research Engineer at Google DeepMind focused on Agent Evals and Quality for GenAI model improvement and product development. The role involves developing, evaluating, and optimizing LLM-based agents for complex, multi-step tasks. Responsibilities include constructing quantitative benchmarks and automated evaluation frameworks (e.g., LLM-as-a-judge) to measure agent capabilities in reasoning, planning, and tool use, as well as creating and optimizing data mixes from user feedback for training and fine-tuning agents. The role also requires analyzing agent behavior to identify failure modes and performance bottlenecks. | Eval GateAgent | 9 |
| Staff Software Engineer, Model Quality Staff Software Engineer for Google Pics, an AI-powered visual editor, focusing on building and improving automated evaluation systems for generative AI models. The role involves establishing metrics, running evaluations, providing insights for model quality improvement, and creating tools to enhance the evaluation process, with a roadmap towards a 2026 launch. | Eval Gate | 8 |
| Technical Program Manager, Frontier Safety, Alignment and Collaboration, DeepMind Technical Program Manager for Frontier Safety, Alignment, and Collaboration at Google DeepMind. This role focuses on operational strategy and execution for safe and responsible AI development, bridging AI research with product deployment. Responsibilities include managing safety frameworks, implementing unified safety gates, coordinating evaluations for critical capability levels, and managing mitigation plans for model breaches. The role requires strong program management skills and an understanding of ML/AI safety and alignment principles. | Eval GatePost-train | 8 |
| Research Engineer, Benchmarking, Robotics, DeepMind Research Engineer focused on benchmarking foundation models for robotics. The role involves designing evaluation protocols, tooling, and frameworks to assess robot policies in both simulated and real-world environments. Key responsibilities include building infrastructure for large-scale evaluation, root-causing policy failures, establishing evaluation criteria for model releases, and innovating on hardware evaluation processes. The goal is to provide data-driven insights into technological readiness for robotics development. | Eval GateAgent | 8 |
| Staff Software Engineer, NotebookLM, Generative AI, Labs Staff Software Engineer focused on designing, developing, and maintaining robust evaluations for NotebookLM Chat and Content Studio features. This role involves improving evaluation infrastructure, defining quality metrics, and staying updated on LLM evaluation techniques within Google's Labs group, which incubates early-stage AI efforts. | Eval Gate | 8 |
| Senior Engineering Analyst, Workspace AI, Trust and Safety This role focuses on ensuring the safety and integrity of Workspace AI products by developing and implementing anti-abuse policies, strategies, and evaluation frameworks. It involves analyzing data, identifying safety issues, and collaborating with engineering and product teams to mitigate risks. | Eval GatePost-train | 7 |
| Technical Program Manager, Gemini Evals, DeepMind Technical Program Manager at Google DeepMind focused on Gemini Evals. The role involves collaborating with engineering and data science teams to design, integrate, and execute model evaluations, conduct loss analysis, and drive strategic goals for AI programs. Requires experience in leading engineering projects and understanding LLM evals, model training, or data science. | Eval GatePost-train | 7 |
| Senior Data Scientist, Research, Search Intelligence Quality This role focuses on evaluating and improving Google Search's Generative AI products, such as AI overview and AI mode. The Senior Data Scientist will develop SOTA AI Raters and advanced measurement frameworks to ensure the quality of AI-generated responses, working with large datasets and analytical methods to inform model development and product strategy. | Eval Gate | 7 |
| Clinical Specialist, Mental Health This role focuses on evaluating AI model performance in mental health safety and quality applications, providing clinical leadership and guidance for AI projects within Google for Health. The specialist will leverage clinical expertise to influence product development and ensure AI tools improve health journeys. | Eval Gate | 7 |
| Privacy and Security Technical Assurance, Risk, Compliance and Integrity This role focuses on providing technical assurance and risk management for AI/ML systems within Google's Risk, Compliance and Integrity organization. The individual will be responsible for designing and executing testing frameworks for AI/ML and traditional security controls, leading cross-functional security testing initiatives, and advocating for AI security assurance. The role requires a deep understanding of AI/ML architectures, offensive security testing, threat modeling, and program management capabilities, operating as a critical second line of defense. | Eval Gate | 7 |
| AI Software Developer, Android XR, Application Compatibility AI Software Developer for Android XR, focusing on application compatibility and evaluation. The role involves building scalable execution frameworks and designing an automated evaluation framework using LLMs and computer vision to detect XR-specific issues. | Eval GateAgent | 7 |
| Research Strategist, Emerging Impacts Team, DeepMind This role focuses on assessing the ethical and safety implications of DeepMind's AI research and applications, working with technical teams and stakeholders to ensure responsible development and deployment of AGI. The strategist will lead ethics and safety reviews, develop best practices, and inform model policy. | Eval Gate | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering for Google's key products like Search, AI Overview, and AI Mode. The role involves identifying quality and metric headroom, conducting analyses, applying AI methods, developing and automating evals and measurements to guide improvements, and partnering with engineering and product teams to drive system changes. | Eval Gate | 7 |
| Lead Technical Analyst, Workspace AI, Trust and Safety Lead Technical Analyst for Workspace AI Trust and Safety, defining strategy and technical roadmap for AI safety, prompt injection evaluations, and misuse prevention. Designs and implements scalable anti-abuse detection and action systems, including AI agent frameworks. Investigates novel GenAI failure modes and establishes benchmarking/evaluation protocols. Advises stakeholders and mentors analysts. | Eval GateAgent | 7 |
| Manager, Content Adversarial Red Team Manager for the Content Adversarial Red Team (CART) responsible for leading a team that conducts adversarial red teaming on Google's generative AI products to uncover loss patterns and ensure product safety. The role involves interfacing with stakeholders, reviewing analytic products, and leveraging AI augmentation for process improvement. | Eval Gate | 7 |
| Associate Principal Analyst, Content Adversarial Red Team This role focuses on identifying and mitigating emerging content safety risks within Google's Generative AI products. The analyst will develop strategies to uncover novel threats and vulnerabilities, partner with product and engineering teams to implement solutions, and shape internal programs for AI safety. The role involves adversarial testing and advocating for AI safety initiatives. | Eval Gate | 7 |
| Senior Software Engineer, Head Tracking, Beam, AI/ML Senior Software Engineer for Google Beam, focusing on AI/ML for head tracking. The role involves defining and owning the end-to-end strategy and roadmap for evaluating head tracking performance and robustness. Responsibilities include leading the development of evaluation infrastructure, collaborating with algorithm teams for improvements, designing testing scenarios, and working with cross-functional partners. Requires experience in C++, Python, and building evaluation systems for real-time systems like 3D tracking, robotics, or AR/VR, with a preference for ML frameworks and model evaluation experience. | Eval GateAgent | 7 |
| Senior Quality Engineer, Gemini Enterprise Quality Senior Quality Engineer for Gemini Enterprise Quality at Google Cloud AI Research. This role involves designing and implementing ML solutions, leveraging ML infrastructure, and focusing on quality assurance for AI products, particularly in specialized ML areas like speech/audio or reinforcement learning. The role requires experience in ML infrastructure, including model deployment and evaluation, and contributes to bringing AI innovations to real-world impact. | Eval GateServe | 7 |
| Senior Staff Uber Technical Lead, Observability Intelligence Senior Staff Uber Technical Lead for Observability Intelligence, driving the strategic shift of SRE incident response to an AI-driven paradigm within Google Cloud's monitoring systems. This role involves leading large-scale ML infrastructure optimization, defining the Observability Intelligence strategy, representing the organization in technical reviews, and partnering with Product Management to translate product needs into scalable architectural solutions. The focus is on building a cohesive, AI-powered observability ecosystem. | Eval GateServe | 7 |
| Senior Clinical Specialist, AI Evaluations This role focuses on evaluating AI model performance for health applications, leveraging clinical expertise to guide product development and ensure safety, quality, and efficacy. It involves applying evidence-based practices and contributing to the real-world implementation of AI health products. | Eval GateAgent | 7 |
| Software Engineer III, Skills Evaluation, Chrome Software Engineer III role focused on building and maintaining evaluation pipelines, safety classifiers, and automated testing systems for AI skills within the Chrome product. This involves designing and implementing metrics, visualization tools, and auto-raters to ensure the quality, safety, and performance of AI workflows, with a focus on integrating with various AI models and browser surfaces. | Eval GatePost-train | 7 |
| Principal Analyst, Trust and Safety Trusted Experiences, GenAI This role focuses on ensuring the safe launch of Generative AI models, acting as a key advisor and strategist for cross-functional teams. It involves anticipating risks, designing testing strategies, analyzing results, and driving mitigation and post-launch monitoring, with a specific emphasis on Text Models, Model Personalization, Model Governance, and Health/Mental Health. | Eval Gate | 7 |
| Staff Software Engineer, Agentic Data and Evals Staff Software Engineer focused on building and launching tools and solutions for GenAI data generation and evaluations. The role involves developing a self-service data generation platform, performing LLM/GenAI model evaluations, and fine-tuning models using techniques like RLHF. The engineer will work cross-functionally to deliver high-quality data sets and evaluation infrastructure for various GenAI use cases. | Eval GatePost-train | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering (CRAFT) for Google Search, AI Overview, and AI Mode products. The role involves identifying quality and metric headroom, conducting analyses, applying statistical/AI methods, developing and automating evals and measurements for iterative improvements, and partnering with engineering and product teams to drive system changes and launches. The position requires a Master's degree in a quantitative field and 5 years of experience in analytics and coding, with preferred experience in consumer-facing products and evaluation methodologies. | Eval GateShip | 7 |
| Senior Strategist, Kids and Learning Trust and Safety This role focuses on ensuring the safety and trustworthiness of Generative AI experiences for young users, specifically in educational contexts. The Senior Strategist will develop and implement product safety strategies, analyze risks, and work with engineering and product teams to build responsible AI capabilities, including those for image, video, and agentic AI. Key responsibilities include analyzing data to identify and combat abuse, enhancing operational workflows, improving model safety, debugging escalations, and managing technical projects. | Eval GateAgent | 7 |
| Staff Data Scientist, Research, Search Health Research Data Scientist focused on evaluation and metrics for AI answers in Search Health, developing advanced ML/LLM methodologies to identify product opportunities and influence product/engineering directions. | Eval Gate | 7 |