AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Google has 584 active AI-related job listings. The majority of these roles are focused on agents, representing 40% of the total, and serving infrastructure, at 26%. The most frequent technical tags include model_serving, agent_orchestration, and evals. Over the last 30 days, Google has added 413 new AI roles, a 105% increase compared to the preceding 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 498 active AI roles, down 12% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$1030k (avg $233k).

Hiring
498 / 1188
Momentum (4w)
↓-151 -12%
1101 opens last 4w · 1252 prior 4w
Salary range · avg $233k
$98k–$1030k
USD · disclosed roles only
Tracked since
Jan '25
last role today
Hiring velocityscroll left for older weeks
1 new role
Mar 6
3 new roles
Nov 20
1 new role
27
1 new role
Jan 29
1 new role
May 6
2 new roles
27
1 new role
Jun 3
1 new role
17
1 new role
Aug 12
1 new role
Nov 18
1 new role
Jan 6
1 new role
27
1 new role
Feb 3
1 new role
17
1 new role
Mar 10
1 new role
24
2 new roles
Apr 21
2 new roles
May 5
1 new role
26
1 new role
Jun 2
1 new role
9
1 new role
16
4 new roles
23
1 new role
30
1 new role
Jul 7
3 new roles
Aug 25
1 new role
Sep 15
1 new role
22
6 new roles
29
6 new roles
Oct 13
4 new roles
20
3 new roles
27
6 new roles
Nov 3
1 new role
10
1 new role
17
1 new role
24
3 new roles
Dec 1
1 new role
8
7 new roles
15
4 new roles
22
1 new role
29
2 new roles
Jan 5
4 new roles
12
5 new roles
19
11 new roles
26
1 new role
Feb 2
7 new roles
9
6 new roles
16
11 new roles
23
13 new roles
Mar 2
20 new roles
9
10 new roles
16
28 new roles
23
70 new roles
30
76 new roles
Apr 6
191 new roles
13
218 new roles
20
203 new roles
27
288 new roles
May 4
334 new roles
11
298 new roles
18
265 new roles
25
355 new roles
Jun 1
385 new roles
8
302 new roles
15
285 new roles
22
129 new roles
29

Frequently asked questions

  • What AI roles is Google hiring for?

    Google currently has 586 active AI-related roles in our index. The most common open titles are: Software Engineer (5), AI Adoption Customer Engineer, Google Cloud (3), Conversational AI Consultant (2), Engineering Manager, Egregious Abuse Protection (2), Forward Deployed Engineer III, Generative AI, Google Cloud (2). Most positions are in Engineering and Product.

  • What stage of AI development does Google focus on?

    Google's active AI hiring is concentrated in: agents (43%), serving infrastructure (25%), application (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Google hiring AI talent?

    Google is hiring AI talent in: United States (376 roles), India (53 roles), Singapore (40 roles), Switzerland (20 roles).

  • What skills does Google look for in AI roles?

    Job postings at Google most frequently mention: Software Engineering, Algorithms & Data Structures, System Design, Computer Architecture, Machine Learning.

  • How many AI roles has Google posted recently?

    In the past 30 days, Google has posted 571 new AI-related roles. That is a +22% change versus the prior 30 days (469 → 571).

Jobs (48)

498 AI · 1491 total active
FilteredStageEval Gate×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 50Pretrain · 31Post-train · 73Serve · 322Agent · 612Eval Gate · 48Ship · 254
Function
AllEngineering · 2727Product · 630Research · 137
Country
AllUnited States · 2154India · 372Singapore · 176Poland · 151Taiwan · 128United Kingdom · 110Switzerland · 89Israel · 76Canada · 64Ireland · 55Brazil · 46Mexico · 45Australia · 43Japan · 36Germany · 18South Korea · 15Spain · 13Romania · 8China · 7Sweden · 7Argentina · 6France · 6Chile · 5Hong Kong · 5Belgium · 4Denmark · 4Thailand · 4Vietnam · 4Colombia · 3Netherlands · 3Norway · 3Indonesia · 2Italy · 2Malaysia · 2Philippines · 2Czech Republic · 1Greece · 1Saudi Arabia · 1South Africa · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Threat Modeler Lead, CBRNE, DeepMind
Lead threat modeler for AI safety in CBRNE domains, focusing on evaluating and mitigating dual-use risks of advanced AI models. This role involves refining threat modeling frameworks, designing evaluations for AI risks, collaborating with mitigation teams, and engaging with external stakeholders. Requires a PhD and experience in national labs or defense organizations, with a preference for experience in red-teaming LLMs and understanding CBRNE risks.
Eval GateResearchNew York, NY +26d ago9
Senior Technical Program Manager Lead, Gemini Audio, DeepMind
Senior Technical Program Manager Lead for Gemini Audio at Google DeepMind, focusing on end-to-end model quality across the AI lifecycle. The role involves collaborating with researchers, data scientists, and serving/deployment teams to manage model training priorities, design and execute evaluations, and oversee the entire release cycle for foundational audio models. This includes checkpoint uploads, documentation, deployment coordination, capacity planning, and cross-functional testing, with a strong emphasis on applying deep AI evaluation methodologies and driving strategic outlook with high agency.
Eval GatePost-train
Product
Mountain View, CA +2
3w ago
9
Senior Staff Software Engineer, Agentic Data Tooling, DeepMind
Senior Staff Software Engineer focused on building agentic data tooling for Gemini, including evaluation frameworks (SmithBench, RE-Bench), data collection pipelines for agent interactions, and human-in-the-loop annotation systems to accelerate AI capabilities and agent development.
Eval GateAgentEngineeringNew York, NY +34w ago9
Senior Staff Software Engineer, Agent Data Quality, DeepMind
Senior Staff Software Engineer focused on Agent Data Quality within DeepMind, responsible for building data processing pipelines, experiment frameworks, and evaluation benchmarks for AI agents. The role involves analyzing agent behavior, identifying failure modes, and providing feedback for GenAI model improvement and product development, with a focus on reasoning, planning, and tool use.
Eval GateAgentEngineeringMountain View, CA +16w ago9
Staff Software Engineer, Gemini Evals, GenAI, DeepMind
Staff Software Engineer focused on designing and optimizing distributed evaluation execution engines for AI agents. This role involves building systems for agent testing, developing test problems, creating visualizations, building leaderboards, and testing algorithms on robots. The engineer will also build abstractions for LLM agent loops, tool use, and automated rating systems, and design error classification, retry policies, and observability dashboards to meet SLOs. Collaboration with research scientists and data science teams is key, as is mentoring other engineers and advocating for code quality and system design.
Eval GateAgentEngineeringMountain View, CA +26w ago9
Research Engineer, Security and Privacy, DeepMind
Research Engineer at Google DeepMind focused on evaluating agentic capabilities of AI models. The role involves building pipelines and tools for automated red-teaming to identify vulnerabilities and failure modes, collaborating with post-training teams to improve models, and generalizing solutions into reusable libraries. Emphasis on measuring and shaping model behavior through rigorous evaluation, with a goal of improving safety and robustness.
Eval GatePost-trainResearchNew York, NY +26w ago9
Research Scientist, Evaluations, Security and Privacy, DeepMind
Research Scientist focused on security and privacy for AI models and agentic products, specifically Gemini. The role involves designing and evaluating novel defense mechanisms against adversarial attacks and prompt injections, translating research into practical solutions for training and inference pipelines, and collaborating with core modeling and engineering teams. The position requires a PhD and experience in ML research, benchmarking, and security, with a focus on next-generation security techniques for autonomous AI systems.
Eval GateAgentResearchMountain View, CA +27w ago9
Senior Staff Research Engineer, DeepMind
Senior Staff Research Engineer at Google DeepMind focused on Agent Evals and Quality for GenAI model improvement and product development. The role involves developing, evaluating, and optimizing LLM-based agents for complex, multi-step tasks. Responsibilities include constructing quantitative benchmarks and automated evaluation frameworks (e.g., LLM-as-a-judge) to measure agent capabilities in reasoning, planning, and tool use, as well as creating and optimizing data mixes from user feedback for training and fine-tuning agents. The role also requires analyzing agent behavior to identify failure modes and performance bottlenecks.
Eval GateAgentEngineeringMountain View, CA +1Apr 149
Staff Software Engineer, Model Quality
Staff Software Engineer for Google Pics, an AI-powered visual editor, focusing on building and improving automated evaluation systems for generative AI models. The role involves establishing metrics, running evaluations, providing insights for model quality improvement, and creating tools to enhance the evaluation process, with a roadmap towards a 2026 launch.
Eval GateEngineeringNew York, NY +1yesterday8
Technical Program Manager, Frontier Safety, Alignment and Collaboration, DeepMind
Technical Program Manager for Frontier Safety, Alignment, and Collaboration at Google DeepMind. This role focuses on operational strategy and execution for safe and responsible AI development, bridging AI research with product deployment. Responsibilities include managing safety frameworks, implementing unified safety gates, coordinating evaluations for critical capability levels, and managing mitigation plans for model breaches. The role requires strong program management skills and an understanding of ML/AI safety and alignment principles.
Eval GatePost-trainEngineeringMountain View, CA +1yesterday8
Privacy and Security Technical Assurance Lead, RCI
This role leads AI security assurance testing programs, focusing on independent technical validation and oversight of AI/ML controls. It involves offensive security testing, threat modeling, and collaborating with engineering and legal teams to ensure compliance with AI regulations. The primary output is the assurance testing framework and identified vulnerabilities, with a secondary focus on the security of tuned AI models.
Eval GatePost-trainEngineeringDublin, Ireland1w ago8
Engineering Analyst, Trust and Safety, Gemini and Labs
This role focuses on architecting the approach to complex risks associated with AI, defining the strategic roadmap for model safety, anticipating future threats, and developing novel evaluation paradigms to influence product and research direction. The primary focus is on ensuring safety as a foundational component of AI systems, with a secondary involvement in fine-tuning techniques and classifier-based guardrails.
Eval GatePost-trainEngineeringBengaluru, Karnataka, India1w ago8
Tech Lead Manager, Freshness and Factuality, GeminiApp, DeepMind
Tech Lead Manager for GeminiApp, DeepMind, focusing on measuring the intelligence of AI agents through testing systems, developing test problems, and evaluating agent performance. The role involves leading a team of software engineers to ensure freshness and factuality in Gemini applications, driving backend improvements, and productionizing solutions. It requires a strong software engineering foundation, technical leadership, and experience with deep learning/machine learning, with a focus on offline and online evaluation pipelines and agent testing.
Eval GateAgentEngineeringZürich, Switzerland2w ago8
Research Engineer, Benchmarking, Robotics, DeepMind
Research Engineer focused on benchmarking foundation models for robotics. The role involves designing evaluation protocols, tooling, and frameworks to assess robot policies in both simulated and real-world environments. Key responsibilities include building infrastructure for large-scale evaluation, root-causing policy failures, establishing evaluation criteria for model releases, and innovating on hardware evaluation processes. The goal is to provide data-driven insights into technological readiness for robotics development.
Eval GateAgentEngineeringMountain View, CA +13w ago8
Emerging Impacts Manager, DeepMind
This role focuses on leading ethics and safety reviews for AI projects at DeepMind, assessing downstream societal implications and governance considerations. The manager will collaborate with safety communities to inform model policy, prepare for emerging AI capabilities, and refine assessment frameworks by monitoring real-world impact. They will also design engagement models, develop best practices, and support the responsibility and safety council by presenting project assessments to executive stakeholders. The role requires experience in AI ethics and safety within a governance, policy, legal, or research capacity, with a focus on leading end-to-end assessments of ethical and societal questions related to technology development.
Eval GateProductLondon, United Kingdom4w ago8
Software Engineer, AI i18n and Evaluations
Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes.
Eval GatePost-trainEngineeringSingapore4w ago8
Staff Software Engineer, NotebookLM, Generative AI, Labs
Staff Software Engineer focused on designing, developing, and maintaining robust evaluations for NotebookLM Chat and Content Studio features. This role involves improving evaluation infrastructure, defining quality metrics, and staying updated on LLM evaluation techniques within Google's Labs group, which incubates early-stage AI efforts.
Eval GateEngineeringMountain View, CA +16w ago8
Software Engineer, AI i18n and Evaluations
Software Engineer focused on AI internationalization and evaluations for Pixel and Android. Responsibilities include leading R&D for AI feature expansion, quality evaluations, and rater quality using on-device and server-based models. Tasks involve creating auto-raters, ensuring metric consistency, establishing benchmarks, and collaborating with AI feature teams. The role also involves identifying opportunities and leading roadmaps to scale language capabilities and improve model evaluation processes.
Eval GatePost-trainEngineeringSingaporeApr 178
Senior Engineering Analyst, Workspace AI, Trust and Safety
This role focuses on ensuring the safety and integrity of Workspace AI products by developing and implementing anti-abuse policies, strategies, and evaluation frameworks. It involves analyzing data, identifying safety issues, and collaborating with engineering and product teams to mitigate risks.
Eval GatePost-trainEngineeringSeattle, WA +12d ago7
Technical Program Manager, Gemini Evals, DeepMind
Technical Program Manager at Google DeepMind focused on Gemini Evals. The role involves collaborating with engineering and data science teams to design, integrate, and execute model evaluations, conduct loss analysis, and drive strategic goals for AI programs. Requires experience in leading engineering projects and understanding LLM evals, model training, or data science.
Eval GatePost-trainEngineeringMountain View, CA +12w ago7
Senior Data Scientist, Research, Search Intelligence Quality
This role focuses on evaluating and improving Google Search's Generative AI products, such as AI overview and AI mode. The Senior Data Scientist will develop SOTA AI Raters and advanced measurement frameworks to ensure the quality of AI-generated responses, working with large datasets and analytical methods to inform model development and product strategy.
Eval GateResearchMountain View, CA +13w ago7
Clinical Specialist, Mental Health
This role focuses on evaluating AI model performance in mental health safety and quality applications, providing clinical leadership and guidance for AI projects within Google for Health. The specialist will leverage clinical expertise to influence product development and ensure AI tools improve health journeys.
Eval GateProductNew York, NY +33w ago7
Privacy and Security Technical Assurance, Risk, Compliance and Integrity
This role focuses on providing technical assurance and risk management for AI/ML systems within Google's Risk, Compliance and Integrity organization. The individual will be responsible for designing and executing testing frameworks for AI/ML and traditional security controls, leading cross-functional security testing initiatives, and advocating for AI security assurance. The role requires a deep understanding of AI/ML architectures, offensive security testing, threat modeling, and program management capabilities, operating as a critical second line of defense.
Eval GateEngineeringAustin, TX +33w ago7
AI Software Developer, Android XR, Application Compatibility
AI Software Developer for Android XR, focusing on application compatibility and evaluation. The role involves building scalable execution frameworks and designing an automated evaluation framework using LLMs and computer vision to detect XR-specific issues.
Eval GateAgentEngineeringWaterloo, ON +33w ago7
Research Strategist, Emerging Impacts Team, DeepMind
This role focuses on assessing the ethical and safety implications of DeepMind's AI research and applications, working with technical teams and stakeholders to ensure responsible development and deployment of AGI. The strategist will lead ethics and safety reviews, develop best practices, and inform model policy.
Eval GateResearchMountain View, CA +13w ago7
Senior Data Scientist, Core Ranking and AI Context
Senior Data Scientist role focused on Core Ranking and AI Context Engineering for Google's key products like Search, AI Overview, and AI Mode. The role involves identifying quality and metric headroom, conducting analyses, applying AI methods, developing and automating evals and measurements to guide improvements, and partnering with engineering and product teams to drive system changes.
Eval GateEngineeringMountain View, CA +13w ago7
Lead Technical Analyst, Workspace AI, Trust and Safety
Lead Technical Analyst for Workspace AI Trust and Safety, defining strategy and technical roadmap for AI safety, prompt injection evaluations, and misuse prevention. Designs and implements scalable anti-abuse detection and action systems, including AI agent frameworks. Investigates novel GenAI failure modes and establishes benchmarking/evaluation protocols. Advises stakeholders and mentors analysts.
Eval GateAgentEngineeringSeattle, WA +13w ago7
Technical Program Manager, Generative AI Safety
Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models.
Eval GatePost-trainEngineeringSingapore4w ago7
Senior Product Manager, GenAI Content Safety
Senior Product Manager for GenAI Content Safety, focusing on scaling safety systems, improving recall of safety protections, building feedback loops for signal performance, partnering with GDM and T&S teams to mitigate emerging risks, and expanding the usability of signals through tooling improvements.
Eval GateAgentProductDublin, Ireland4w ago7
Manager, Content Adversarial Red Team
Manager for the Content Adversarial Red Team (CART) responsible for leading a team that conducts adversarial red teaming on Google's generative AI products to uncover loss patterns and ensure product safety. The role involves interfacing with stakeholders, reviewing analytic products, and leveraging AI augmentation for process improvement.
Eval GateEngineeringSan Francisco, CA +15w ago7
Associate Principal Analyst, Content Adversarial Red Team
This role focuses on identifying and mitigating emerging content safety risks within Google's Generative AI products. The analyst will develop strategies to uncover novel threats and vulnerabilities, partner with product and engineering teams to implement solutions, and shape internal programs for AI safety. The role involves adversarial testing and advocating for AI safety initiatives.
Eval GateEngineeringSeattle, WA +25w ago7
Senior Software Engineer, Head Tracking, Beam, AI/ML
Senior Software Engineer for Google Beam, focusing on AI/ML for head tracking. The role involves defining and owning the end-to-end strategy and roadmap for evaluating head tracking performance and robustness. Responsibilities include leading the development of evaluation infrastructure, collaborating with algorithm teams for improvements, designing testing scenarios, and working with cross-functional partners. Requires experience in C++, Python, and building evaluation systems for real-time systems like 3D tracking, robotics, or AR/VR, with a preference for ML frameworks and model evaluation experience.
Eval GateAgentEngineeringSeattle, WA +35w ago7
Senior Quality Engineer, Gemini Enterprise Quality
Senior Quality Engineer for Gemini Enterprise Quality at Google Cloud AI Research. This role involves designing and implementing ML solutions, leveraging ML infrastructure, and focusing on quality assurance for AI products, particularly in specialized ML areas like speech/audio or reinforcement learning. The role requires experience in ML infrastructure, including model deployment and evaluation, and contributes to bringing AI innovations to real-world impact.
Eval GateServeEngineeringSunnyvale, CA +17w ago7
Senior Staff Uber Technical Lead, Observability Intelligence
Senior Staff Uber Technical Lead for Observability Intelligence, driving the strategic shift of SRE incident response to an AI-driven paradigm within Google Cloud's monitoring systems. This role involves leading large-scale ML infrastructure optimization, defining the Observability Intelligence strategy, representing the organization in technical reviews, and partnering with Product Management to translate product needs into scalable architectural solutions. The focus is on building a cohesive, AI-powered observability ecosystem.
Eval GateServeEngineeringNew York, NY +17w ago7
Senior Clinical Specialist, AI Evaluations
This role focuses on evaluating AI model performance for health applications, leveraging clinical expertise to guide product development and ensure safety, quality, and efficacy. It involves applying evidence-based practices and contributing to the real-world implementation of AI health products.
Eval GateAgentProductMountain View, CA +37w ago7
Engineering Analyst II, Gemini and Labs
This role focuses on defining and implementing safety strategies for generative AI systems, including developing evaluation paradigms, guiding engineering and research teams on safety mitigations like fine-tuning and guardrails, and analyzing the AI threat landscape to create a proactive mitigation agenda. The role is critical for ensuring AI safety is a foundational component of Google's AI systems.
Eval GatePost-trainEngineeringBengaluru, Karnataka, India7w ago7
Software Engineer III, Skills Evaluation, Chrome
Software Engineer III role focused on building and maintaining evaluation pipelines, safety classifiers, and automated testing systems for AI skills within the Chrome product. This involves designing and implementing metrics, visualization tools, and auto-raters to ensure the quality, safety, and performance of AI workflows, with a focus on integrating with various AI models and browser surfaces.
Eval GatePost-trainEngineeringKirkland, WA +1Apr 307
Principal Analyst, Trust and Safety Trusted Experiences, GenAI
This role focuses on ensuring the safe launch of Generative AI models, acting as a key advisor and strategist for cross-functional teams. It involves anticipating risks, designing testing strategies, analyzing results, and driving mitigation and post-launch monitoring, with a specific emphasis on Text Models, Model Personalization, Model Governance, and Health/Mental Health.
Eval GateProductMountain View, CA +1Apr 297
Staff Software Engineer, Agentic Data and Evals
Staff Software Engineer focused on building and launching tools and solutions for GenAI data generation and evaluations. The role involves developing a self-service data generation platform, performing LLM/GenAI model evaluations, and fine-tuning models using techniques like RLHF. The engineer will work cross-functionally to deliver high-quality data sets and evaluation infrastructure for various GenAI use cases.
Eval GatePost-trainEngineeringSunnyvale, CA +1Apr 217
Senior Data Scientist, Core Ranking and AI Context
Senior Data Scientist role focused on Core Ranking and AI Context Engineering (CRAFT) for Google Search, AI Overview, and AI Mode products. The role involves identifying quality and metric headroom, conducting analyses, applying statistical/AI methods, developing and automating evals and measurements for iterative improvements, and partnering with engineering and product teams to drive system changes and launches. The position requires a Master's degree in a quantitative field and 5 years of experience in analytics and coding, with preferred experience in consumer-facing products and evaluation methodologies.
Eval GateShipEngineeringMountain View, CA +1Apr 167
Senior Strategist, Kids and Learning Trust and Safety
This role focuses on ensuring the safety and trustworthiness of Generative AI experiences for young users, specifically in educational contexts. The Senior Strategist will develop and implement product safety strategies, analyze risks, and work with engineering and product teams to build responsible AI capabilities, including those for image, video, and agentic AI. Key responsibilities include analyzing data to identify and combat abuse, enhancing operational workflows, improving model safety, debugging escalations, and managing technical projects.
Eval GateAgentProductSeattle, WA +2Apr 147
Staff Data Scientist, Research, Search Health
Research Data Scientist focused on evaluation and metrics for AI answers in Search Health, developing advanced ML/LLM methodologies to identify product opportunities and influence product/engineering directions.
Eval GateResearchMountain View, CA +3Apr 107
Senior Engineering Analyst, Photos Responsible AI
This role focuses on ensuring the safety and trustworthiness of AI features within Google Photos, specifically generative AI. The Senior Engineering Analyst will work with various teams to develop and execute comprehensive evaluations, identify emerging risks and abuse vectors, and build resilience against malicious inputs. The role involves defining testing approaches, tools, and solutions, establishing testing to discover risks, and defining program metrics and feedback loops.
Eval GatePost-trainEngineeringBengaluru, Karnataka, IndiaApr 77
Technical Program Manager, Generative AI Safety
Technical Program Manager for Generative AI Safety at Google, focusing on leading initiatives to expand content safety infrastructure, integrate safety classifiers, and build rapid response capabilities for AI abuse. The role involves partnering with cross-functional leaders to convert threat intelligence into scalable models and technical protections within the serving stack, orchestrating safety engineering teams, and managing global workflows for timely integration and evaluation of safety models for Gemini releases. This role also coordinates with infrastructure teams, generative AI product groups, and foundational model researchers to integrate safety signals into primary models.
Eval GatePost-trainEngineeringSingaporeApr 77
Senior Virologist, DeepMind
This role focuses on developing and executing biology evaluations to test the safety of AI models, specifically LLMs, and driving the development of harm frameworks and mitigation strategies. The individual will collaborate with experts in science, AI ethics, policy, and safety, and communicate results to decision-makers. The role requires a PhD in virology or equivalent, with experience in biosecurity principles at the intersection of microbiology and AI safety, including LLM risk assessments.
Eval GateResearchMountain View, CA +2yesterday5
Principal Engineering Analyst, RAI Testing
This role focuses on building and operationalizing scalable, automated AI testing frameworks and evaluation systems within Google's Trust & Safety Responsible AI Testing team. The goal is to empower product teams with self-service infrastructure for standard safety evaluations, while the role itself handles high-risk, bespoke evaluations for novel AI paradigms. It involves leveraging SQL and Python to embed these systems into developer pipelines and partnering with cross-functional stakeholders.
Eval GateEngineeringWashington, DC +15w ago5
Engineering Analyst, Kids and Learning Trust and Safety
This role supports the launch of Generative AI search experiences and education efforts, focusing on responsible AI capabilities. The analyst will perform data analysis to identify and combat abuse, develop datasets and run evaluations for Gen AI products, establish metrics for AI issues, and improve model safety through data analysis. The role requires experience in data analysis, project management, and familiarity with ML model performance or LLMs.
Eval GateEngineeringDublin, Ireland7w ago5
Learning Impact Specialist, LearnX
This role focuses on developing and implementing evaluation frameworks to assess the quality of Generative AI tools within an educational context. The specialist will leverage learning science principles to consult with product development teams and lead discussions on how GenAI can shape better learning outcomes. While not directly building AI models, the role is critical in evaluating their impact and quality in educational products.
Eval GateProductMountain View, CA +28w ago5