Currently tracking 498 active AI roles, down 12% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$1030k (avg $233k).
Google has 584 active AI-related job listings. The majority of these roles are focused on agents, representing 40% of the total, and serving infrastructure, at 26%. The most frequent technical tags include model_serving, agent_orchestration, and evals. Over the last 30 days, Google has added 413 new AI roles, a 105% increase compared to the preceding 30-day period.
Google currently has 586 active AI-related roles in our index. The most common open titles are: Software Engineer (5), AI Adoption Customer Engineer, Google Cloud (3), Conversational AI Consultant (2), Engineering Manager, Egregious Abuse Protection (2), Forward Deployed Engineer III, Generative AI, Google Cloud (2). Most positions are in Engineering and Product.
Google's active AI hiring is concentrated in: agents (43%), serving infrastructure (25%), application (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Google is hiring AI talent in: United States (376 roles), India (53 roles), Singapore (40 roles), Switzerland (20 roles).
Job postings at Google most frequently mention: Software Engineering, Algorithms & Data Structures, System Design, Computer Architecture, Machine Learning.
In the past 30 days, Google has posted 571 new AI-related roles. That is a +22% change versus the prior 30 days (469 → 571).
| Title | Stage | AI score |
|---|---|---|
| Threat Modeler Lead, CBRNE, DeepMind Lead threat modeler for AI safety in CBRNE domains, focusing on evaluating and mitigating dual-use risks of advanced AI models. This role involves refining threat modeling frameworks, designing evaluations for AI risks, collaborating with mitigation teams, and engaging with external stakeholders. Requires a PhD and experience in national labs or defense organizations, with a preference for experience in red-teaming LLMs and understanding CBRNE risks. | Eval Gate | 9 |
| Senior Technical Program Manager Lead, Gemini Audio, DeepMind Senior Technical Program Manager Lead for Gemini Audio at Google DeepMind, focusing on end-to-end model quality across the AI lifecycle. The role involves collaborating with researchers, data scientists, and serving/deployment teams to manage model training priorities, design and execute evaluations, and oversee the entire release cycle for foundational audio models. This includes checkpoint uploads, documentation, deployment coordination, capacity planning, and cross-functional testing, with a strong emphasis on applying deep AI evaluation methodologies and driving strategic outlook with high agency. |
| Eval GatePost-train |
| 9 |
| Senior Staff Software Engineer, Agentic Data Tooling, DeepMind Senior Staff Software Engineer focused on building agentic data tooling for Gemini, including evaluation frameworks (SmithBench, RE-Bench), data collection pipelines for agent interactions, and human-in-the-loop annotation systems to accelerate AI capabilities and agent development. | Eval GateAgent | 9 |
| Senior Staff Software Engineer, Agent Data Quality, DeepMind Senior Staff Software Engineer focused on Agent Data Quality within DeepMind, responsible for building data processing pipelines, experiment frameworks, and evaluation benchmarks for AI agents. The role involves analyzing agent behavior, identifying failure modes, and providing feedback for GenAI model improvement and product development, with a focus on reasoning, planning, and tool use. | Eval GateAgent | 9 |
| Staff Software Engineer, Gemini Evals, GenAI, DeepMind Staff Software Engineer focused on designing and optimizing distributed evaluation execution engines for AI agents. This role involves building systems for agent testing, developing test problems, creating visualizations, building leaderboards, and testing algorithms on robots. The engineer will also build abstractions for LLM agent loops, tool use, and automated rating systems, and design error classification, retry policies, and observability dashboards to meet SLOs. Collaboration with research scientists and data science teams is key, as is mentoring other engineers and advocating for code quality and system design. | Eval GateAgent | 9 |
| Research Engineer, Security and Privacy, DeepMind Research Engineer at Google DeepMind focused on evaluating agentic capabilities of AI models. The role involves building pipelines and tools for automated red-teaming to identify vulnerabilities and failure modes, collaborating with post-training teams to improve models, and generalizing solutions into reusable libraries. Emphasis on measuring and shaping model behavior through rigorous evaluation, with a goal of improving safety and robustness. | Eval GatePost-train | 9 |
| Research Scientist, Evaluations, Security and Privacy, DeepMind Research Scientist focused on security and privacy for AI models and agentic products, specifically Gemini. The role involves designing and evaluating novel defense mechanisms against adversarial attacks and prompt injections, translating research into practical solutions for training and inference pipelines, and collaborating with core modeling and engineering teams. The position requires a PhD and experience in ML research, benchmarking, and security, with a focus on next-generation security techniques for autonomous AI systems. | Eval GateAgent | 9 |
| Senior Staff Research Engineer, DeepMind Senior Staff Research Engineer at Google DeepMind focused on Agent Evals and Quality for GenAI model improvement and product development. The role involves developing, evaluating, and optimizing LLM-based agents for complex, multi-step tasks. Responsibilities include constructing quantitative benchmarks and automated evaluation frameworks (e.g., LLM-as-a-judge) to measure agent capabilities in reasoning, planning, and tool use, as well as creating and optimizing data mixes from user feedback for training and fine-tuning agents. The role also requires analyzing agent behavior to identify failure modes and performance bottlenecks. | Eval GateAgent | 9 |
| Staff Software Engineer, Model Quality Staff Software Engineer for Google Pics, an AI-powered visual editor, focusing on building and improving automated evaluation systems for generative AI models. The role involves establishing metrics, running evaluations, providing insights for model quality improvement, and creating tools to enhance the evaluation process, with a roadmap towards a 2026 launch. | Eval Gate | 8 |
| Technical Program Manager, Frontier Safety, Alignment and Collaboration, DeepMind Technical Program Manager for Frontier Safety, Alignment, and Collaboration at Google DeepMind. This role focuses on operational strategy and execution for safe and responsible AI development, bridging AI research with product deployment. Responsibilities include managing safety frameworks, implementing unified safety gates, coordinating evaluations for critical capability levels, and managing mitigation plans for model breaches. The role requires strong program management skills and an understanding of ML/AI safety and alignment principles. | Eval GatePost-train | 8 |
| Privacy and Security Technical Assurance Lead, RCI This role leads AI security assurance testing programs, focusing on independent technical validation and oversight of AI/ML controls. It involves offensive security testing, threat modeling, and collaborating with engineering and legal teams to ensure compliance with AI regulations. The primary output is the assurance testing framework and identified vulnerabilities, with a secondary focus on the security of tuned AI models. | Eval GatePost-train | 8 |
| Engineering Analyst, Trust and Safety, Gemini and Labs This role focuses on architecting the approach to complex risks associated with AI, defining the strategic roadmap for model safety, anticipating future threats, and developing novel evaluation paradigms to influence product and research direction. The primary focus is on ensuring safety as a foundational component of AI systems, with a secondary involvement in fine-tuning techniques and classifier-based guardrails. | Eval GatePost-train | 8 |
| Tech Lead Manager, Freshness and Factuality, GeminiApp, DeepMind Tech Lead Manager for GeminiApp, DeepMind, focusing on measuring the intelligence of AI agents through testing systems, developing test problems, and evaluating agent performance. The role involves leading a team of software engineers to ensure freshness and factuality in Gemini applications, driving backend improvements, and productionizing solutions. It requires a strong software engineering foundation, technical leadership, and experience with deep learning/machine learning, with a focus on offline and online evaluation pipelines and agent testing. | Eval GateAgent | 8 |
| Research Engineer, Benchmarking, Robotics, DeepMind Research Engineer focused on benchmarking foundation models for robotics. The role involves designing evaluation protocols, tooling, and frameworks to assess robot policies in both simulated and real-world environments. Key responsibilities include building infrastructure for large-scale evaluation, root-causing policy failures, establishing evaluation criteria for model releases, and innovating on hardware evaluation processes. The goal is to provide data-driven insights into technological readiness for robotics development. | Eval GateAgent | 8 |
| Emerging Impacts Manager, DeepMind This role focuses on leading ethics and safety reviews for AI projects at DeepMind, assessing downstream societal implications and governance considerations. The manager will collaborate with safety communities to inform model policy, prepare for emerging AI capabilities, and refine assessment frameworks by monitoring real-world impact. They will also design engagement models, develop best practices, and support the responsibility and safety council by presenting project assessments to executive stakeholders. The role requires experience in AI ethics and safety within a governance, policy, legal, or research capacity, with a focus on leading end-to-end assessments of ethical and societal questions related to technology development. | Eval Gate | 8 |
| Software Engineer, AI i18n and Evaluations Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes. | Eval GatePost-train | 8 |
| Staff Software Engineer, NotebookLM, Generative AI, Labs Staff Software Engineer focused on designing, developing, and maintaining robust evaluations for NotebookLM Chat and Content Studio features. This role involves improving evaluation infrastructure, defining quality metrics, and staying updated on LLM evaluation techniques within Google's Labs group, which incubates early-stage AI efforts. | Eval Gate | 8 |
| Software Engineer, AI i18n and Evaluations Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes. | Eval GatePost-train | 8 |
| Senior Engineering Analyst, Workspace AI, Trust and Safety This role focuses on ensuring the safety and integrity of Workspace AI products by developing and implementing anti-abuse policies, strategies, and evaluation frameworks. It involves analyzing data, identifying safety issues, and collaborating with engineering and product teams to mitigate risks. | Eval GatePost-train | 7 |
| Technical Program Manager, Gemini Evals, DeepMind Technical Program Manager at Google DeepMind focused on Gemini Evals. The role involves collaborating with engineering and data science teams to design, integrate, and execute model evaluations, conduct loss analysis, and drive strategic goals for AI programs. Requires experience in leading engineering projects and understanding LLM evals, model training, or data science. | Eval GatePost-train | 7 |
| Senior Data Scientist, Research, Search Intelligence Quality This role focuses on evaluating and improving Google Search's Generative AI products, such as AI overview and AI mode. The Senior Data Scientist will develop SOTA AI Raters and advanced measurement frameworks to ensure the quality of AI-generated responses, working with large datasets and analytical methods to inform model development and product strategy. | Eval Gate | 7 |
| Clinical Specialist, Mental Health This role focuses on evaluating AI model performance in mental health safety and quality applications, providing clinical leadership and guidance for AI projects within Google for Health. The specialist will leverage clinical expertise to influence product development and ensure AI tools improve health journeys. | Eval Gate | 7 |
| Privacy and Security Technical Assurance, Risk, Compliance and Integrity This role focuses on providing technical assurance and risk management for AI/ML systems within Google's Risk, Compliance and Integrity organization. The individual will be responsible for designing and executing testing frameworks for AI/ML and traditional security controls, leading cross-functional security testing initiatives, and advocating for AI security assurance. The role requires a deep understanding of AI/ML architectures, offensive security testing, threat modeling, and program management capabilities, operating as a critical second line of defense. | Eval Gate | 7 |
| AI Software Developer, Android XR, Application Compatibility AI Software Developer for Android XR, focusing on application compatibility and evaluation. The role involves building scalable execution frameworks and designing an automated evaluation framework using LLMs and computer vision to detect XR-specific issues. | Eval GateAgent | 7 |
| Research Strategist, Emerging Impacts Team, DeepMind This role focuses on assessing the ethical and safety implications of DeepMind's AI research and applications, working with technical teams and stakeholders to ensure responsible development and deployment of AGI. The strategist will lead ethics and safety reviews, develop best practices, and inform model policy. | Eval Gate | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering for Google's key products like Search, AI Overview, and AI Mode. The role involves identifying quality and metric headroom, conducting analyses, applying AI methods, developing and automating evals and measurements to guide improvements, and partnering with engineering and product teams to drive system changes. | Eval Gate | 7 |
| Lead Technical Analyst, Workspace AI, Trust and Safety Lead Technical Analyst for Workspace AI Trust and Safety, defining strategy and technical roadmap for AI safety, prompt injection evaluations, and misuse prevention. Designs and implements scalable anti-abuse detection and action systems, including AI agent frameworks. Investigates novel GenAI failure modes and establishes benchmarking/evaluation protocols. Advises stakeholders and mentors analysts. | Eval GateAgent | 7 |
| Technical Program Manager, Generative AI Safety Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models. | Eval GatePost-train | 7 |
| Senior Product Manager, GenAI Content Safety Senior Product Manager for GenAI Content Safety, focusing on scaling safety systems, improving recall of safety protections, building feedback loops for signal performance, partnering with GDM and T&S teams to mitigate emerging risks, and expanding the usability of signals through tooling improvements. | Eval GateAgent | 7 |
| Manager, Content Adversarial Red Team Manager for the Content Adversarial Red Team (CART) responsible for leading a team that conducts adversarial red teaming on Google's generative AI products to uncover loss patterns and ensure product safety. The role involves interfacing with stakeholders, reviewing analytic products, and leveraging AI augmentation for process improvement. | Eval Gate | 7 |
| Associate Principal Analyst, Content Adversarial Red Team This role focuses on identifying and mitigating emerging content safety risks within Google's Generative AI products. The analyst will develop strategies to uncover novel threats and vulnerabilities, partner with product and engineering teams to implement solutions, and shape internal programs for AI safety. The role involves adversarial testing and advocating for AI safety initiatives. | Eval Gate | 7 |
| Senior Software Engineer, Head Tracking, Beam, AI/ML Senior Software Engineer for Google Beam, focusing on AI/ML for head tracking. The role involves defining and owning the end-to-end strategy and roadmap for evaluating head tracking performance and robustness. Responsibilities include leading the development of evaluation infrastructure, collaborating with algorithm teams for improvements, designing testing scenarios, and working with cross-functional partners. Requires experience in C++, Python, and building evaluation systems for real-time systems like 3D tracking, robotics, or AR/VR, with a preference for ML frameworks and model evaluation experience. | Eval GateAgent | 7 |
| Senior Quality Engineer, Gemini Enterprise Quality Senior Quality Engineer for Gemini Enterprise Quality at Google Cloud AI Research. This role involves designing and implementing ML solutions, leveraging ML infrastructure, and focusing on quality assurance for AI products, particularly in specialized ML areas like speech/audio or reinforcement learning. The role requires experience in ML infrastructure, including model deployment and evaluation, and contributes to bringing AI innovations to real-world impact. | Eval GateServe | 7 |
| Senior Staff Uber Technical Lead, Observability Intelligence Senior Staff Uber Technical Lead for Observability Intelligence, driving the strategic shift of SRE incident response to an AI-driven paradigm within Google Cloud's monitoring systems. This role involves leading large-scale ML infrastructure optimization, defining the Observability Intelligence strategy, representing the organization in technical reviews, and partnering with Product Management to translate product needs into scalable architectural solutions. The focus is on building a cohesive, AI-powered observability ecosystem. | Eval GateServe | 7 |
| Senior Clinical Specialist, AI Evaluations This role focuses on evaluating AI model performance for health applications, leveraging clinical expertise to guide product development and ensure safety, quality, and efficacy. It involves applying evidence-based practices and contributing to the real-world implementation of AI health products. | Eval GateAgent | 7 |
| Engineering Analyst II, Gemini and Labs This role focuses on defining and implementing safety strategies for generative AI systems, including developing evaluation paradigms, guiding engineering and research teams on safety mitigations like fine-tuning and guardrails, and analyzing the AI threat landscape to create a proactive mitigation agenda. The role is critical for ensuring AI safety is a foundational component of Google's AI systems. | Eval GatePost-train | 7 |
| Software Engineer III, Skills Evaluation, Chrome Software Engineer III role focused on building and maintaining evaluation pipelines, safety classifiers, and automated testing systems for AI skills within the Chrome product. This involves designing and implementing metrics, visualization tools, and auto-raters to ensure the quality, safety, and performance of AI workflows, with a focus on integrating with various AI models and browser surfaces. | Eval GatePost-train | 7 |
| Principal Analyst, Trust and Safety Trusted Experiences, GenAI This role focuses on ensuring the safe launch of Generative AI models, acting as a key advisor and strategist for cross-functional teams. It involves anticipating risks, designing testing strategies, analyzing results, and driving mitigation and post-launch monitoring, with a specific emphasis on Text Models, Model Personalization, Model Governance, and Health/Mental Health. | Eval Gate | 7 |
| Staff Software Engineer, Agentic Data and Evals Staff Software Engineer focused on building and launching tools and solutions for GenAI data generation and evaluations. The role involves developing a self-service data generation platform, performing LLM/GenAI model evaluations, and fine-tuning models using techniques like RLHF. The engineer will work cross-functionally to deliver high-quality data sets and evaluation infrastructure for various GenAI use cases. | Eval GatePost-train | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering (CRAFT) for Google Search, AI Overview, and AI Mode products. The role involves identifying quality and metric headroom, conducting analyses, applying statistical/AI methods, developing and automating evals and measurements for iterative improvements, and partnering with engineering and product teams to drive system changes and launches. The position requires a Master's degree in a quantitative field and 5 years of experience in analytics and coding, with preferred experience in consumer-facing products and evaluation methodologies. | Eval GateShip | 7 |
| Senior Strategist, Kids and Learning Trust and Safety This role focuses on ensuring the safety and trustworthiness of Generative AI experiences for young users, specifically in educational contexts. The Senior Strategist will develop and implement product safety strategies, analyze risks, and work with engineering and product teams to build responsible AI capabilities, including those for image, video, and agentic AI. Key responsibilities include analyzing data to identify and combat abuse, enhancing operational workflows, improving model safety, debugging escalations, and managing technical projects. | Eval GateAgent | 7 |
| Staff Data Scientist, Research, Search Health Research Data Scientist focused on evaluation and metrics for AI answers in Search Health, developing advanced ML/LLM methodologies to identify product opportunities and influence product/engineering directions. | Eval Gate | 7 |
| Senior Engineering Analyst, Photos Responsible AI This role focuses on ensuring the safety and trustworthiness of AI features within Google Photos, specifically generative AI. The Senior Engineering Analyst will work with various teams to develop and execute comprehensive evaluations, identify emerging risks and abuse vectors, and build resilience against malicious inputs. The role involves defining testing approaches, tools, and solutions, establishing testing to discover risks, and defining program metrics and feedback loops. | Eval GatePost-train | 7 |
| Technical Program Manager, Generative AI Safety Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models. | Eval GatePost-train | 7 |