AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Google has 584 active AI-related job listings. The majority of these roles are focused on agents, representing 40% of the total, and serving infrastructure, at 26%. The most frequent technical tags include model_serving, agent_orchestration, and evals. Over the last 30 days, Google has added 413 new AI roles, a 105% increase compared to the preceding 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 498 active AI roles, down 12% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $98k–$1030k (avg $233k).

Hiring
498 / 1188
Momentum (4w)
↓-151 -12%
1101 opens last 4w · 1252 prior 4w
Salary range · avg $233k
$98k–$1030k
USD · disclosed roles only
Tracked since
Jan '25
last role today
Hiring velocityscroll left for older weeks
1 new role
Mar 6
3 new roles
Nov 20
1 new role
27
1 new role
Jan 29
1 new role
May 6
2 new roles
27
1 new role
Jun 3
1 new role
17
1 new role
Aug 12
1 new role
Nov 18
1 new role
Jan 6
1 new role
27
1 new role
Feb 3
1 new role
17
1 new role
Mar 10
1 new role
24
2 new roles
Apr 21
2 new roles
May 5
1 new role
26
1 new role
Jun 2
1 new role
9
1 new role
16
4 new roles
23
1 new role
30
1 new role
Jul 7
3 new roles
Aug 25
1 new role
Sep 15
1 new role
22
6 new roles
29
6 new roles
Oct 13
4 new roles
20
3 new roles
27
6 new roles
Nov 3
1 new role
10
1 new role
17
1 new role
24
3 new roles
Dec 1
1 new role
8
7 new roles
15
4 new roles
22
1 new role
29
2 new roles
Jan 5
4 new roles
12
5 new roles
19
11 new roles
26
1 new role
Feb 2
7 new roles
9
6 new roles
16
11 new roles
23
13 new roles
Mar 2
20 new roles
9
10 new roles
16
28 new roles
23
70 new roles
30
76 new roles
Apr 6
191 new roles
13
218 new roles
20
203 new roles
27
288 new roles
May 4
334 new roles
11
298 new roles
18
265 new roles
25
355 new roles
Jun 1
385 new roles
8
302 new roles
15
285 new roles
22
129 new roles
29

Frequently asked questions

  • What AI roles is Google hiring for?

    Google currently has 586 active AI-related roles in our index. The most common open titles are: Software Engineer (5), AI Adoption Customer Engineer, Google Cloud (3), Conversational AI Consultant (2), Engineering Manager, Egregious Abuse Protection (2), Forward Deployed Engineer III, Generative AI, Google Cloud (2). Most positions are in Engineering and Product.

  • What stage of AI development does Google focus on?

    Google's active AI hiring is concentrated in: agents (43%), serving infrastructure (25%), application (19%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Google hiring AI talent?

    Google is hiring AI talent in: United States (376 roles), India (53 roles), Singapore (40 roles), Switzerland (20 roles).

  • What skills does Google look for in AI roles?

    Job postings at Google most frequently mention: Software Engineering, Algorithms & Data Structures, System Design, Computer Architecture, Machine Learning.

  • How many AI roles has Google posted recently?

    In the past 30 days, Google has posted 571 new AI-related roles. That is a +22% change versus the prior 30 days (469 → 571).

Jobs (173)

498 AI · 1491 total active
FilteredStageServe×CountryUnited States×Clear all
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 31Post-train · 73Serve · 271Agent · 539Eval Gate · 44Ship · 202
Function
AllEngineering · 970Product · 111Research · 107
Country
AllUnited States · 751India · 92Singapore · 78United Kingdom · 44Switzerland · 41Canada · 31Poland · 31Taiwan · 26Brazil · 20Israel · 14Australia · 13Ireland · 11Mexico · 10Japan · 9Germany · 6Spain · 5France · 4South Korea · 4Sweden · 4China · 3Chile · 2Hong Kong · 2Argentina · 1Colombia · 1Italy · 1Netherlands · 1Romania · 1South Africa · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Software Engineer, DeepMind
Senior Software Engineer at Google DeepMind focused on building and enhancing serving solutions for Gemini models, developing new infrastructure for advanced capabilities like streaming and audio logic, and ensuring model quality in production. The role involves driving technical vision and roadmap for the team.
ServeAgentEngineeringMountain View, CA +12d ago9
Engineering Manager, ML Performance
Engineering Manager for Google's TPU Performance team, focusing on optimizing the speed and efficiency of AI/ML model training and inference on custom TPU hardware. The role involves leading a team to develop and maintain ML benchmarks, identify performance opportunities, drive optimizations (near-term and out-of-the-box), and participate in algorithmic innovations and co-designing TPU-friendly models. This includes work on inference serving, quantization, and compiler optimizations, serving both internal Google teams and external AI companies.
1–50 of 173← Prev1234Next →
ServePost-train
Engineering
Sunnyvale, CA +2
6d ago
9
Senior Research Engineer, On-Device Inference, Robotics, DeepMind
Senior Research Engineer focused on optimizing Gemini Robotics models for low-latency on-device inference, driving alignment between model and hardware architectures, and influencing future model designs for resource-constrained environments.
ServeAgentEngineeringMountain View, CA +13w ago9
Senior Software Engineer
Senior Software Engineer at Google DeepMind focused on building and enhancing serving solutions for Gemini models, developing new infrastructure for advanced capabilities like multimodal understanding, and ensuring model quality in production. The role involves collaboration, driving technical vision, and working with large-scale production systems and machine learning specialization.
ServeAgentEngineeringNew York, NY +13w ago9
Senior Software Engineer
Senior Software Engineer role focused on building and enhancing serving solutions for Gemini models, developing new infrastructure for advanced capabilities like large-scale streaming and audio logic, and ensuring the quality of models in production. The role involves collaborating with peers, driving technical vision, and requires experience in C++, algorithm design, debugging ML systems, and productionizing LLMs/multimodal models.
ServeAgentEngineeringNew York, NY +13w ago9
Staff Engineer, TPU Co-Design
Staff Engineer focused on co-designing TPU hardware for AI/ML applications, bridging model architecture innovation with next-generation hardware design. Responsibilities include optimizing the hardware/software stack for ML model training and serving, developing simulators, and conducting system-level performance analysis.
ServePost-trainEngineeringSunnyvale, CA +14w ago9
Senior Staff Engineer, TPU Co-Design
Senior Staff Engineer focused on co-designing TPU hardware for AI/ML training and serving. The role involves defining the hardware/software roadmap, bridging AI research with hardware design, and optimizing performance for large ML models. This position operates at the intersection of AI research and infrastructure engineering, aiming to deliver high-performance, power-efficient accelerators.
ServePretrainEngineeringSunnyvale, CA +14w ago9
Staff Software Engineer, AI/ML Performance
Staff Software Engineer focused on optimizing AI/ML training and serving workloads on TPUs. The role involves identifying performance opportunities, driving optimizations through custom kernels, compiler/runtime improvements, and algorithmic innovation. It also includes co-designing TPU-friendly models and working with frontier lab hyperscalers and foundation model builders.
ServePost-trainEngineeringSunnyvale, CA +14w ago9
Research Scientist, Efficient AI
Research Scientist focused on developing resource-efficient AI architectures, training/inference recipes, and model compression techniques to make ML models faster and smaller, enabling efficient deployment on Google's infrastructure. The role involves independent research, translating ideas to experiments, and contributing to the research community through publications.
ServePost-trainResearchMountain View, CA +15w ago9
Staff Software Engineer, AI/ML GenAI, Google Cloud Applications AI
Staff Software Engineer role at Google Cloud AI Research focusing on designing, developing, and deploying large-scale GenAI solutions. The role involves technical leadership, optimizing ML infrastructure, guiding data preparation and model optimization, and working with LLMs, Multi-Modal, and Large Vision Models. Experience with ML design, infrastructure, and GenAI techniques is required.
ServePost-trainEngineeringSunnyvale, CA +26w ago9
Senior Research Engineer, On-Device Inference, Robotics, DeepMind
Senior Research Engineer focused on optimizing Gemini Robotics models for low-latency on-device inference, driving alignment between model architectures and edge device constraints, and influencing research and engineering teams for robust solutions. Requires deep knowledge of inference techniques across GPU, TPU, and CPU architectures.
ServeAgentEngineeringMountain View, CA +18w ago9
Power and Performance Architect, TPU
This role focuses on defining and driving the power architecture roadmap for Google's next-generation TPUs, which are AI/ML hardware accelerators. The architect will bridge the gap between high-level concepts and silicon execution, optimizing for performance-per-watt for ML workloads and ensuring successful implementation of power management features. This involves collaboration with various teams, including SOC implementation, hardware/software validation, and data center operations, to align silicon capabilities with system-level power constraints. The role requires deep expertise in computer chip design, performance analysis, and power analysis, with a strong emphasis on machine learning accelerator architecture and workload characterization for power optimization.
ServeEngineeringSunnyvale, CA +1Apr 249
Staff Software Engineer, AI/ML GenAI, Google Cloud AI
Staff Software Engineer at Google Cloud AI Research focused on designing, developing, and deploying GenAI solutions. This role involves leading the design of GenAI solutions, optimizing ML infrastructure, and guiding the development of data preparation and model optimization strategies. Requires significant experience in software development, ML infrastructure optimization, and state-of-the-art GenAI techniques.
ServePost-trainEngineeringSunnyvale, CA +1Apr 139
Staff Software Engineer, AI/ML Performance
Staff Software Engineer focused on optimizing AI/ML training and serving workloads on TPUs. This role involves identifying performance bottlenecks, driving optimizations through custom kernels, compiler/runtime improvements, and collaborating with partner teams to achieve state-of-the-art performance for foundation model builders and hyperscalers. The position also involves algorithmic innovation and co-designing TPU-friendly models.
ServePost-trainEngineeringSunnyvale, CA +1Mar 249
Staff Software Engineer, GPU Performance
Staff Software Engineer focused on optimizing GPU performance for LLM training and serving within Google Cloud's AI infrastructure. This role involves identifying performance bottlenecks, running benchmarks, and implementing solutions at scale, with a strong emphasis on low-level GPU programming and compiler optimizations.
ServeEngineeringKirkland, WA +2yesterday8
Senior Software Engineer, AI Core Capabilities
Senior Software Engineer focused on end-to-end delivery and optimization of on-device GenAI capabilities for Android, building developer-facing APIs and optimizing inference for Gemini Nano models.
ServeAgentEngineeringMountain View, CA +12d ago8
Senior Software Engineer, Sensor AI/ML, Watch Software
Senior Software Engineer focused on AI/ML for sensor fusion and gesture recognition on Google's Pixel Watch and Fitbit devices. The role involves designing, training, and optimizing AI models for resource-constrained, on-body devices, with a strong emphasis on real-time inference, low-power formats (TFLite Micro), and C/C++ development for embedded systems. This position bridges research and engineering, requiring expertise in model optimization and deployment on edge devices.
ServePost-trainEngineeringMountain View, CA +12d ago8
Staff Software Engineer, AI/ML GenAI, Google Cloud
Staff Software Engineer at Google Cloud focused on designing, developing, and deploying large-scale GenAI solutions. The role involves leading ML infrastructure optimization, guiding data preparation and model optimization strategies, and working with state-of-the-art GenAI techniques like LLMs and Large Vision Models. Requires significant experience in software development, ML infrastructure, and GenAI.
ServePost-trainEngineeringSunnyvale, CA +15d ago8
Software Engineering Manager II, AI/ML, Google Cloud Compute
Software Engineering Manager II for Google Cloud Compute, responsible for leading teams, setting technical vision, and overseeing the design and implementation of ML solutions, including ML infrastructure optimization and model development strategies. Requires strong software development and ML experience, with a focus on leadership and people management.
ServeDataEngineeringKirkland, WA +31w ago8
Software Engineer, AI/ML, Google Research
Software Engineer role at Google Research focusing on implementing ML solutions, utilizing ML infrastructure, and contributing to model optimization and data processing. Requires experience in specialized ML areas like speech/audio, reinforcement learning, or ML infrastructure, with a focus on model deployment, evaluation, optimization, and data processing.
ServeDataEngineeringMountain View, CA +11w ago8
Senior Silicon System and Software Integration Engineer, Google Cloud
This role focuses on the hardware-software integration and validation of AI/ML accelerators (TPUs) for Google Cloud. The engineer will work on ASIC development, validation, software, tools, and methodologies to ensure the functionality and performance of these custom silicon solutions that power Google's AI/ML applications.
ServeEngineeringSunnyvale, CA +11w ago8
Software Engineering Manager II, AI/ML, Google Cloud
Software Engineering Manager II for Google Cloud, focusing on AI/ML. This role involves technical leadership, people management, setting team priorities, developing roadmaps, designing systems, and leading the implementation of ML solutions, including ML infrastructure optimization, model optimization, and data processing strategies. Requires significant experience in software development, ML infrastructure, and technical/people leadership.
ServeDataEngineeringSunnyvale, CA +31w ago8
Silicon System and Software Integration Engineer, TPU Cloud
This role focuses on the integration and validation of AI/ML hardware accelerators (TPUs) for Google's cloud infrastructure. The engineer will work on ASIC development, firmware, RTL, and software integration to ensure the functionality and performance of these chips, which power Google's AI/ML applications and services.
ServeEngineeringSunnyvale, CA +12w ago8
Software Engineer III, AI/ML, Proxybidder ML
Software Engineer III on the Proxybidder ML team at Google, responsible for the full machine learning model lifecycle including design, training, deployment, and serving in production for Google Ads. The role involves innovating on model design, analyzing experiments, enhancing model health, and collaborating with research and infrastructure teams. Requires experience with Python, C++, mathematical modeling, and ML infrastructure, with a focus on low-latency production systems.
ServePost-trainEngineeringNew York, NY +12w ago8
Senior Engineering Manager AI Inference Platform, Distributed Cloud
Senior Engineering Manager for AI Inference Platform, Distributed Cloud. Role focuses on architecting and optimizing the serving stack for models like Gemini in an on-prem cloud environment, improving speed, efficiency, and cost-effectiveness. Responsibilities include leading a team, defining technical vision for the LLM serving stack, overseeing performance analysis and benchmarking, and driving the design/implementation of advanced serving architectures.
ServeEngineeringSunnyvale, CA +12w ago8
Software Engineering Manager II, AI/ML GenAI, Google Cloud Compute
Software Engineering Manager II for Google Cloud Compute, focusing on AI/ML GenAI. This role involves technical leadership, team management, and guiding the design and optimization of GenAI solutions, ML infrastructure, and data/model strategies. The position requires significant experience in software development, ML infrastructure optimization, technical leadership, and GenAI techniques.
ServePost-trainEngineeringKirkland, WA +12w ago8
ML Chip/IP Architect, DeepMind
This role focuses on defining the top-level SoC architecture and chiplet strategy for next-generation Machine Learning (ML) accelerators. The individual will lead the architecture and design of the chip top-level, manage interfaces, clocking, power, and integration of IP blocks, and architect specific accelerator components. Collaboration with micro-architecture, physical design, systems, and software teams is crucial to ensure a feasible and optimal design meeting product requirements.
ServeEngineeringMountain View, CA +13w ago8
Staff AI/ML Software Engineer, YouTube Ads Creative Foundational Infrastructure
Staff AI/ML Software Engineer for YouTube Ads Creative Foundational Infrastructure. This role involves architecting, scaling, and steering next-generation infrastructure for AI/ML applications, specifically focusing on creative generation and optimization. Responsibilities include defining the technical roadmap, designing distributed systems for GenAI and media processing, building experiment and learning infrastructure, and partnering with various teams to align infrastructure with business goals.
ServeAgentEngineeringMountain View, CA +13w ago8
Senior Design and Integration Engineer, Cloud TPU
The role focuses on the design, integration, and verification of Google's next-generation Tensor Processing Units (TPUs), which are custom-built accelerators for AI and machine learning workloads. The engineer will work on microarchitecture, digital logic design, and optimization for performance, power, and area, collaborating with cross-functional teams to deliver cutting-edge hardware for AI/ML applications.
ServeEngineeringSunnyvale, CA +13w ago8
Senior Security Engineer, AI/ML, National Security, Public Sector
Senior Security Engineer focused on securing AI/ML infrastructure, particularly LLM deployments, for Google Public Sector. Responsibilities include architecting secure deployments, protecting model weights and data, mitigating AI-specific threats, and developing automated defenses. Requires experience with AI/ML development, infrastructure, containerization, and Python, along with a Top Secret/SCI security clearance.
ServeAgentEngineeringWashington, DC +23w ago8
Senior Software Engineer, AI/ML GenAI, Google Cloud
Senior Software Engineer role focused on designing and implementing GenAI solutions within Google Cloud, leveraging ML infrastructure and evaluating different techniques. Requires experience in Python/C++, ML infrastructure, software design, and state-of-the-art GenAI techniques.
ServePost-trainEngineeringSunnyvale, CA +13w ago8
Senior Staff Software Engineer, AI/ML, Google Cloud
Senior Staff Software Engineer on the AI and Infrastructure team at Google Cloud, focusing on delivering AI and Infrastructure at scale. The role involves designing, developing, and deploying large-scale software solutions, providing technical leadership, and driving ML infrastructure optimization across multiple ML areas. Requires extensive experience in ML infrastructure, design, architecture, and specific ML fields like speech/audio or reinforcement learning.
ServePost-trainEngineeringSeattle, WA +13w ago8
Customer Engineer IV, AI Infrastructure, Google Public Sector
Customer Engineer role focused on accelerating AI initiatives for Google Public Sector clients by owning the technical relationship with ML research teams, guiding them through solution design, accelerator selection, and ramping AI workloads onto Google's AI infrastructure. The role involves advising on hardware (GPU/TPU), ML frameworks, and model building techniques, acting as a hybrid technical and business advisor.
ServePost-trainEngineeringReston, VA +24w ago8
Senior Staff Software Engineer, TPU Performance
Senior Staff Software Engineer focused on optimizing ML training and serving performance on Google's TPUs. This role involves identifying and maintaining benchmarks, driving performance improvements through compiler/runtime optimizations and algorithmic innovations, and co-designing TPU-friendly models. Experience with ML infrastructure, speech/audio, or reinforcement learning is required.
ServePost-trainEngineeringSunnyvale, CA +14w ago8
Senior TPU RTL Design Engineer, Networking, Inter-Chip Interconnects
Senior engineer to design and develop RTL for Google's next-generation Tensor Processing Units (TPUs), focusing on inter-chip interconnects for AI and networking accelerators. This role involves microarchitecture, RTL design, implementation, and collaboration with system architects and verification teams to ensure high-performance, power-efficient silicon solutions for AI workloads.
ServeEngineeringSunnyvale, CA +14w ago8
RTL Design Engineer, Machine Learning Accelerators
This role focuses on the RTL design of Machine Learning Accelerators (TPUs) for Google's AI/ML applications. The engineer will design and verify complex digital designs with a focus on TPU architecture and its integration within AI/ML-driven systems, contributing to custom silicon solutions.
ServeEngineeringSunnyvale, CA +14w ago8
Staff Software Engineering, YouTube ML Efficiency
Staff Software Engineer focused on ML efficiency for YouTube's recommendation systems, working on optimizing models for next-gen TPUs, enabling new architectures and training procedures, and reducing complexity in the ML training and serving ecosystem through automation.
ServeDataEngineeringSan Bruno, CA +14w ago8
Software Engineering Manager II, AI/ML GenAI, Google Cloud
Software Engineering Manager II for Google Cloud's AI/ML GenAI team, focusing on leading teams to deliver AI and Infrastructure at scale. The role involves setting team priorities, developing technical vision and roadmaps, guiding system designs, and leading the design of GenAI solutions, optimizing ML infrastructure, and guiding data preparation and model optimization strategies. Requires significant experience in software development, ML infrastructure optimization, technical leadership, and GenAI techniques.
ServePost-trainEngineeringSunnyvale, CA +14w ago8
Staff Software Engineer, AI/ML, Google Public Sector
Staff Software Engineer at Google Public Sector focused on architecting and deploying large-scale distributed data systems and advanced machine learning pipelines, optimizing inference workloads for specialized hardware accelerators, and leading technical direction for complex production software systems. The role involves managing petabyte-scale data ingestion, optimizing numerical operations, and implementing data life-cycle policies.
ServeAgentEngineeringReston, VA +25w ago8
Software Engineer III, AI/ML GenAI, Google Cloud Performance
Software Engineer III role focused on implementing GenAI solutions within Google Cloud, utilizing ML infrastructure, and contributing to data preparation, optimization, and performance enhancements. Requires experience with core GenAI concepts and text, image, video, or audio generation.
ServeEngineeringMountain View, CA +15w ago8
Staff Software Engineer, AI Engines 3P TPU Inference
Staff Software Engineer focused on AI Engines 3P TPU Inference, working on ML infrastructure, compilers, and runtimes for Google's AI models. The role involves developing and optimizing ML software infrastructure, partnering with research, and ensuring high performance and scalability for inference.
ServeEngineeringMountain View, CA +15w ago8
Staff Software Engineer, AI/ML, Google Cloud AI
Staff Software Engineer at Google Cloud AI, focusing on developing and deploying large-scale AI/ML solutions for customer experience applications powered by Gemini Enterprise. The role involves technical leadership, optimizing ML infrastructure, and guiding model optimization and data processing strategies.
ServeDataEngineeringSunnyvale, CA +15w ago8
Senior Software Engineer, AI/ML, AI and Infrastructure
Senior Software Engineer role on the AI and Infrastructure team at Google, focusing on developing and scaling AI/ML capabilities and infrastructure. The role involves writing and testing product/system code, collaborating on design and code reviews, contributing to documentation, triaging and debugging issues, and designing/implementing ML solutions leveraging ML infrastructure. Requires experience in Python/C++, software development, ML infrastructure, and specialization in areas like speech/audio, reinforcement learning, or other ML fields.
ServePost-trainEngineeringMountain View, CA +36w ago8
Software Engineer III, AI/ML GenAI, Google Cloud Compute
Software Engineer III role focused on implementing GenAI solutions within Google Cloud Compute, utilizing ML infrastructure, and contributing to data preparation, optimization, and performance enhancements. Requires experience with core GenAI concepts like LLMs, Multi-Modal, and Large Vision Models, and generation across text, image, video, or audio.
ServePost-trainEngineeringSunnyvale, CA +16w ago8
Machine Learning Hardware Architect, TPU
This role focuses on architecting and defining specifications for next-generation high-performance computing systems, specifically TPUs, to accelerate AI/ML applications. The individual will collaborate with software teams to define AI workload requirements, perform architecture studies for performance and efficiency, and influence technical roadmaps for hardware-software platforms. The goal is to drive the evolution of AI hardware for large-scale systems and data centers.
ServeEngineeringSunnyvale, CA +16w ago8
Staff Software Engineer, AI/ML GenAI, Google Cloud
Staff Software Engineer role at Google Cloud focused on designing, developing, and deploying large-scale GenAI solutions. The role involves technical leadership, optimizing ML infrastructure, guiding data preparation and model optimization, and working with state-of-the-art GenAI techniques like LLMs, Multi-Modal, and Large Vision Models. Requires significant software development experience and specific experience in ML design and GenAI techniques.
ServePost-trainEngineeringNew York, NY +17w ago8
Staff Software Engineer, GPU Performance
Staff Software Engineer focused on optimizing GPU performance for LLM training and serving within Google Cloud. This role involves benchmarking, performance analysis, and implementing solutions at scale, working with cutting-edge AI accelerators and low-level GPU programming.
ServeEngineeringSunnyvale, CA +37w ago8
Senior Software Engineer, AI/ML GenAI, Google Cloud Compute Infrastructure
Senior Software Engineer role focused on designing and implementing GenAI solutions within Google Cloud Compute Infrastructure. The role involves leveraging ML infrastructure, evaluating techniques, and working with state-of-the-art GenAI models like LLMs and Large Vision Models. Requires strong programming skills (Python/C++), experience with ML infrastructure, and software design/architecture.
ServePost-trainEngineeringSunnyvale, CA +17w ago8
Software Engineering Manager II, AI/ML GenAI, Google Cloud AI
Software Engineering Manager II for Google Cloud AI GenAI team, focusing on leading engineering teams, setting technical vision, and guiding the design and optimization of GenAI solutions, ML infrastructure, and data preparation strategies. The role involves people management, technical leadership, and contributing to product strategy within the AI Research team that aims to push state-of-the-art AI and collaborate with product teams.
ServePost-trainEngineeringSunnyvale, CA +18w ago8
Staff Software Engineer, TPU, Performance
Staff Software Engineer focused on optimizing the performance of ML models (including Gemini and OSS models) on TPU systems for both JAX and PyTorch platforms. The role involves identifying and maintaining ML benchmarks, analyzing performance metrics, and collaborating with compiler and runtime teams to improve performance. It also includes engaging with product teams and researchers to solve performance problems for large-scale ML training and serving.
ServeEngineeringSunnyvale, CA +18w ago8