Google has 584 active AI-related job listings. The majority of these roles are focused on agents, representing 40% of the total, and serving infrastructure, at 26%. The most frequent technical tags include model_serving, agent_orchestration, and evals. Over the last 30 days, Google has added 413 new AI roles, a 105% increase compared to the preceding 30-day period.
Currently tracking 498 active AI roles, down 12% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$1030k (avg $233k).
Google currently has 586 active AI-related roles in our index. The most common open titles are: Software Engineer (5), AI Adoption Customer Engineer, Google Cloud (3), Conversational AI Consultant (2), Engineering Manager, Egregious Abuse Protection (2), Forward Deployed Engineer III, Generative AI, Google Cloud (2). Most positions are in Engineering and Product.
Google's active AI hiring is concentrated in: agents (43%), serving infrastructure (25%), application (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Google is hiring AI talent in: United States (376 roles), India (53 roles), Singapore (40 roles), Switzerland (20 roles).
Job postings at Google most frequently mention: Software Engineering, Algorithms & Data Structures, System Design, Computer Architecture, Machine Learning.
In the past 30 days, Google has posted 571 new AI-related roles. That is a +22% change versus the prior 30 days (469 → 571).
| Title | Stage | AI score |
|---|---|---|
| Senior Staff Software Engineer, Agentic Data Tooling, DeepMind Senior Staff Software Engineer focused on building agentic data tooling for Gemini, including evaluation frameworks (SmithBench, RE-Bench), data collection pipelines for agent interactions, and human-in-the-loop annotation systems to accelerate AI capabilities and agent development. | Eval GateAgent | 9 |
| Senior Staff Software Engineer, Agent Data Quality, DeepMind Senior Staff Software Engineer focused on Agent Data Quality within DeepMind, responsible for building data processing pipelines, experiment frameworks, and evaluation benchmarks for AI agents. The role involves analyzing agent behavior, identifying failure modes, and providing feedback for GenAI model improvement and product development, with a focus on reasoning, planning, and tool use. |
| Eval GateAgent |
| 9 |
| Staff Software Engineer, Gemini Evals, GenAI, DeepMind Staff Software Engineer focused on designing and optimizing distributed evaluation execution engines for AI agents. This role involves building systems for agent testing, developing test problems, creating visualizations, building leaderboards, and testing algorithms on robots. The engineer will also build abstractions for LLM agent loops, tool use, and automated rating systems, and design error classification, retry policies, and observability dashboards to meet SLOs. Collaboration with research scientists and data science teams is key, as is mentoring other engineers and advocating for code quality and system design. | Eval GateAgent | 9 |
| Senior Staff Research Engineer, DeepMind Senior Staff Research Engineer at Google DeepMind focused on Agent Evals and Quality for GenAI model improvement and product development. The role involves developing, evaluating, and optimizing LLM-based agents for complex, multi-step tasks. Responsibilities include constructing quantitative benchmarks and automated evaluation frameworks (e.g., LLM-as-a-judge) to measure agent capabilities in reasoning, planning, and tool use, as well as creating and optimizing data mixes from user feedback for training and fine-tuning agents. The role also requires analyzing agent behavior to identify failure modes and performance bottlenecks. | Eval GateAgent | 9 |
| Staff Software Engineer, Model Quality Staff Software Engineer for Google Pics, an AI-powered visual editor, focusing on building and improving automated evaluation systems for generative AI models. The role involves establishing metrics, running evaluations, providing insights for model quality improvement, and creating tools to enhance the evaluation process, with a roadmap towards a 2026 launch. | Eval Gate | 8 |
| Technical Program Manager, Frontier Safety, Alignment and Collaboration, DeepMind Technical Program Manager for Frontier Safety, Alignment, and Collaboration at Google DeepMind. This role focuses on operational strategy and execution for safe and responsible AI development, bridging AI research with product deployment. Responsibilities include managing safety frameworks, implementing unified safety gates, coordinating evaluations for critical capability levels, and managing mitigation plans for model breaches. The role requires strong program management skills and an understanding of ML/AI safety and alignment principles. | Eval GatePost-train | 8 |
| Privacy and Security Technical Assurance Lead, RCI This role leads AI security assurance testing programs, focusing on independent technical validation and oversight of AI/ML controls. It involves offensive security testing, threat modeling, and collaborating with engineering and legal teams to ensure compliance with AI regulations. The primary output is the assurance testing framework and identified vulnerabilities, with a secondary focus on the security of tuned AI models. | Eval GatePost-train | 8 |
| Engineering Analyst, Trust and Safety, Gemini and Labs This role focuses on architecting the approach to complex risks associated with AI, defining the strategic roadmap for model safety, anticipating future threats, and developing novel evaluation paradigms to influence product and research direction. The primary focus is on ensuring safety as a foundational component of AI systems, with a secondary involvement in fine-tuning techniques and classifier-based guardrails. | Eval GatePost-train | 8 |
| Tech Lead Manager, Freshness and Factuality, GeminiApp, DeepMind Tech Lead Manager for GeminiApp, DeepMind, focusing on measuring the intelligence of AI agents through testing systems, developing test problems, and evaluating agent performance. The role involves leading a team of software engineers to ensure freshness and factuality in Gemini applications, driving backend improvements, and productionizing solutions. It requires a strong software engineering foundation, technical leadership, and experience with deep learning/machine learning, with a focus on offline and online evaluation pipelines and agent testing. | Eval GateAgent | 8 |
| Research Engineer, Benchmarking, Robotics, DeepMind Research Engineer focused on benchmarking foundation models for robotics. The role involves designing evaluation protocols, tooling, and frameworks to assess robot policies in both simulated and real-world environments. Key responsibilities include building infrastructure for large-scale evaluation, root-causing policy failures, establishing evaluation criteria for model releases, and innovating on hardware evaluation processes. The goal is to provide data-driven insights into technological readiness for robotics development. | Eval GateAgent | 8 |
| Software Engineer, AI i18n and Evaluations Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes. | Eval GatePost-train | 8 |
| Staff Software Engineer, NotebookLM, Generative AI, Labs Staff Software Engineer focused on designing, developing, and maintaining robust evaluations for NotebookLM Chat and Content Studio features. This role involves improving evaluation infrastructure, defining quality metrics, and staying updated on LLM evaluation techniques within Google's Labs group, which incubates early-stage AI efforts. | Eval Gate | 8 |
| Software Engineer, AI i18n and Evaluations Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes. | Eval GatePost-train | 8 |
| Senior Engineering Analyst, Workspace AI, Trust and Safety This role focuses on ensuring the safety and integrity of Workspace AI products by developing and implementing anti-abuse policies, strategies, and evaluation frameworks. It involves analyzing data, identifying safety issues, and collaborating with engineering and product teams to mitigate risks. | Eval GatePost-train | 7 |
| Technical Program Manager, Gemini Evals, DeepMind Technical Program Manager at Google DeepMind focused on Gemini Evals. The role involves collaborating with engineering and data science teams to design, integrate, and execute model evaluations, conduct loss analysis, and drive strategic goals for AI programs. Requires experience in leading engineering projects and understanding LLM evals, model training, or data science. | Eval GatePost-train | 7 |
| Privacy and Security Technical Assurance, Risk, Compliance and Integrity This role focuses on providing technical assurance and risk management for AI/ML systems within Google's Risk, Compliance and Integrity organization. The individual will be responsible for designing and executing testing frameworks for AI/ML and traditional security controls, leading cross-functional security testing initiatives, and advocating for AI security assurance. The role requires a deep understanding of AI/ML architectures, offensive security testing, threat modeling, and program management capabilities, operating as a critical second line of defense. | Eval Gate | 7 |
| AI Software Developer, Android XR, Application Compatibility AI Software Developer for Android XR, focusing on application compatibility and evaluation. The role involves building scalable execution frameworks and designing an automated evaluation framework using LLMs and computer vision to detect XR-specific issues. | Eval GateAgent | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering for Google's key products like Search, AI Overview, and AI Mode. The role involves identifying quality and metric headroom, conducting analyses, applying AI methods, developing and automating evals and measurements to guide improvements, and partnering with engineering and product teams to drive system changes. | Eval Gate | 7 |
| Lead Technical Analyst, Workspace AI, Trust and Safety Lead Technical Analyst for Workspace AI Trust and Safety, defining strategy and technical roadmap for AI safety, prompt injection evaluations, and misuse prevention. Designs and implements scalable anti-abuse detection and action systems, including AI agent frameworks. Investigates novel GenAI failure modes and establishes benchmarking/evaluation protocols. Advises stakeholders and mentors analysts. | Eval GateAgent | 7 |
| Technical Program Manager, Generative AI Safety Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models. | Eval GatePost-train | 7 |
| Manager, Content Adversarial Red Team Manager for the Content Adversarial Red Team (CART) responsible for leading a team that conducts adversarial red teaming on Google's generative AI products to uncover loss patterns and ensure product safety. The role involves interfacing with stakeholders, reviewing analytic products, and leveraging AI augmentation for process improvement. | Eval Gate | 7 |
| Associate Principal Analyst, Content Adversarial Red Team This role focuses on identifying and mitigating emerging content safety risks within Google's Generative AI products. The analyst will develop strategies to uncover novel threats and vulnerabilities, partner with product and engineering teams to implement solutions, and shape internal programs for AI safety. The role involves adversarial testing and advocating for AI safety initiatives. | Eval Gate | 7 |
| Senior Software Engineer, Head Tracking, Beam, AI/ML Senior Software Engineer for Google Beam, focusing on AI/ML for head tracking. The role involves defining and owning the end-to-end strategy and roadmap for evaluating head tracking performance and robustness. Responsibilities include leading the development of evaluation infrastructure, collaborating with algorithm teams for improvements, designing testing scenarios, and working with cross-functional partners. Requires experience in C++, Python, and building evaluation systems for real-time systems like 3D tracking, robotics, or AR/VR, with a preference for ML frameworks and model evaluation experience. | Eval GateAgent | 7 |
| Senior Quality Engineer, Gemini Enterprise Quality Senior Quality Engineer for Gemini Enterprise Quality at Google Cloud AI Research. This role involves designing and implementing ML solutions, leveraging ML infrastructure, and focusing on quality assurance for AI products, particularly in specialized ML areas like speech/audio or reinforcement learning. The role requires experience in ML infrastructure, including model deployment and evaluation, and contributes to bringing AI innovations to real-world impact. | Eval GateServe | 7 |
| Senior Staff Uber Technical Lead, Observability Intelligence Senior Staff Uber Technical Lead for Observability Intelligence, driving the strategic shift of SRE incident response to an AI-driven paradigm within Google Cloud's monitoring systems. This role involves leading large-scale ML infrastructure optimization, defining the Observability Intelligence strategy, representing the organization in technical reviews, and partnering with Product Management to translate product needs into scalable architectural solutions. The focus is on building a cohesive, AI-powered observability ecosystem. | Eval GateServe | 7 |
| Engineering Analyst II, Gemini and Labs This role focuses on defining and implementing safety strategies for generative AI systems, including developing evaluation paradigms, guiding engineering and research teams on safety mitigations like fine-tuning and guardrails, and analyzing the AI threat landscape to create a proactive mitigation agenda. The role is critical for ensuring AI safety is a foundational component of Google's AI systems. | Eval GatePost-train | 7 |
| Software Engineer III, Skills Evaluation, Chrome Software Engineer III role focused on building and maintaining evaluation pipelines, safety classifiers, and automated testing systems for AI skills within the Chrome product. This involves designing and implementing metrics, visualization tools, and auto-raters to ensure the quality, safety, and performance of AI workflows, with a focus on integrating with various AI models and browser surfaces. | Eval GatePost-train | 7 |
| Staff Software Engineer, Agentic Data and Evals Staff Software Engineer focused on building and launching tools and solutions for GenAI data generation and evaluations. The role involves developing a self-service data generation platform, performing LLM/GenAI model evaluations, and fine-tuning models using techniques like RLHF. The engineer will work cross-functionally to deliver high-quality data sets and evaluation infrastructure for various GenAI use cases. | Eval GatePost-train | 7 |
| Senior Data Scientist, Core Ranking and AI Context Senior Data Scientist role focused on Core Ranking and AI Context Engineering (CRAFT) for Google Search, AI Overview, and AI Mode products. The role involves identifying quality and metric headroom, conducting analyses, applying statistical/AI methods, developing and automating evals and measurements for iterative improvements, and partnering with engineering and product teams to drive system changes and launches. The position requires a Master's degree in a quantitative field and 5 years of experience in analytics and coding, with preferred experience in consumer-facing products and evaluation methodologies. | Eval GateShip | 7 |
| Senior Engineering Analyst, Photos Responsible AI This role focuses on ensuring the safety and trustworthiness of AI features within Google Photos, specifically generative AI. The Senior Engineering Analyst will work with various teams to develop and execute comprehensive evaluations, identify emerging risks and abuse vectors, and build resilience against malicious inputs. The role involves defining testing approaches, tools, and solutions, establishing testing to discover risks, and defining program metrics and feedback loops. | Eval GatePost-train | 7 |
| Technical Program Manager, Generative AI Safety Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models. | Eval GatePost-train | 7 |