Forward Deployed Engineer Ii, Genai, Google Cloud

Google · Big Tech · São Paulo, State of São Paulo, Brazil

Google Cloud is seeking a Generative AI Forward Deployed Engineer II to build and deploy agentic AI solutions within customer environments. This role involves transitioning prototypes to production-grade workflows, architecting integrations, building evaluation and observability pipelines, and providing feedback to product teams. The ideal candidate has experience with Python, ML packages, applied AI, RAG, fine-tuning, and cloud platforms, with preferred experience in multi-agent systems and LLM optimization.

What you'd actually do

Serve as a developer for AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, model context protocol (MCP) servers) that drive measurable Return on Investment (ROI).
Architect and code the "connective tissue" between Google’s AI products and customers' live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety, and latency.
Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
Co-build with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Skills

Required

Python
Keras
PyTorch
HF Transformers
prompt engineering
fine-tuning
Retrieval-augmented generation (RAG)
orchestrating model interactions with external tools
Google Cloud Platform

Nice to have

LangGraph
CrewAI
Google’s Agent Development Kit (ADK)
ReAct
self-reflection
hierarchical delegation
LLM-native metrics
state management
granular tracing

What the JD emphasized

production-grade reality
production
production-grade agentic workflows
production-grade

Other signals

building agentic solutions
production-grade reality
customer environments
address blockers to production
integration complexities
data readiness issues
state-management issues
white-glove deployment of AI systems
feedback loop
product roadmap
developer for AI applications
rapid prototypes to production-grade agentic workflows
multi-agent systems
model context protocol (MCP) servers
measurable Return on Investment (ROI)
architect and code the connective tissue
Google's AI products and customers' live infrastructure
APIs, legacy data silos, and security perimeters
build high-performance evaluation pipelines
observability frameworks
agentic systems meet requirements for accuracy, safety, and latency
identify repeatable field patterns and friction points
reusable modules or formal product feature requests
co-build with customer engineering teams
Google-grade development best practices
long-term project success
high end-user adoption
Python and relevant machine learning packages
applied AI
building systems around pretrained models
prompt engineering
fine-tuning
Retrieval-augmented generation (RAG)
orchestrating model interactions with external tools
managing solutions on a Cloud Platform
implementing multi-agent systems using frameworks
patterns like ReAct, self-reflection, and hierarchical delegation
LLM-native metrics
optimizing state management
granular tracing

Read full job description

****

As a Generative AI Forward Deployed Engineer (FDE) at Google Cloud, you will be an embedded builder who bridges the gap between frontier AI products and production-grade reality within customer environments. Unlike traditional advisory roles, you will function as an innovator-builder moving beyond high-level architecture to code, debug, and jointly ship bespoke agentic solutions directly within the customer’s environment.

In this role, you will address blockers to production, including solving the integration complexities, data readiness issues, and state-management issues that prevent AI from reaching enterprise-grade maturity. By embedding with accounts, you will serve a dual purpose providing white-glove deployment of AI systems and acting as a critical feedback loop, transforming real-world field insights into Google Cloud’s future product roadmap.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

Serve as a developer for AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, model context protocol (MCP) servers) that drive measurable Return on Investment (ROI).
Architect and code the "connective tissue" between Google’s AI products and customers' live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety, and latency.
Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
Co-build with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Qualifications

Minimum qualifications:

Bachelor's degree in Science, Technology, Engineering, Mathematics, or equivalent practical experience.
3 years of experience in Python and relevant machine learning packages (e.g., Keras, PyTorch, HF Transformers).
Experience in applied AI, with a focus on building systems around pretrained models (e.g., prompt engineering, fine-tuning, Retrieval-augmented generation (RAG), orchestrating model interactions with external tools to deliver solutions).
Experience managing solutions on a Cloud Platform (e.g., Google Cloud Platform).

Preferred qualifications:

Master’s degree or PhD in AI, Computer Science, or a related technical field.
Experience implementing multi-agent systems using frameworks (e.g., LangGraph, CrewAI, or Google’s Agent Development Kit (ADK)) and patterns like ReAct, self-reflection, and hierarchical delegation.
Knowledge of (Large Language Model) LLM-native metrics (e.g., tokens/sec, cost-per-request) and techniques for optimizing state management and granular tracing.