Forward Deployed Engineer Ii, Genai, Google Cloud

Google Google · Big Tech · São Paulo, State of São Paulo, Brazil

Google Cloud is seeking a Generative AI Forward Deployed Engineer II to build and deploy agentic AI solutions within customer environments. This role involves transitioning prototypes to production-grade workflows, architecting integrations, building evaluation and observability pipelines, and providing feedback to product teams. The ideal candidate has experience with Python, ML packages, applied AI, RAG, fine-tuning, and cloud platforms, with preferred experience in multi-agent systems and LLM optimization.

What you'd actually do

  1. Serve as a developer for AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, model context protocol (MCP) servers) that drive measurable Return on Investment (ROI).
  2. Architect and code the "connective tissue" between Google’s AI products and customers' live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
  3. Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety, and latency.
  4. Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
  5. Co-build with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Skills

Required

  • Python
  • Keras
  • PyTorch
  • HF Transformers
  • prompt engineering
  • fine-tuning
  • Retrieval-augmented generation (RAG)
  • orchestrating model interactions with external tools
  • Google Cloud Platform

Nice to have

  • LangGraph
  • CrewAI
  • Google’s Agent Development Kit (ADK)
  • ReAct
  • self-reflection
  • hierarchical delegation
  • LLM-native metrics
  • state management
  • granular tracing

What the JD emphasized

  • production-grade reality
  • production
  • production-grade agentic workflows
  • production-grade

Other signals

  • building agentic solutions
  • production-grade reality
  • customer environments
  • address blockers to production
  • integration complexities
  • data readiness issues
  • state-management issues
  • white-glove deployment of AI systems
  • feedback loop
  • product roadmap
  • developer for AI applications
  • rapid prototypes to production-grade agentic workflows
  • multi-agent systems
  • model context protocol (MCP) servers
  • measurable Return on Investment (ROI)
  • architect and code the connective tissue
  • Google's AI products and customers' live infrastructure
  • APIs, legacy data silos, and security perimeters
  • build high-performance evaluation pipelines
  • observability frameworks
  • agentic systems meet requirements for accuracy, safety, and latency
  • identify repeatable field patterns and friction points
  • reusable modules or formal product feature requests
  • co-build with customer engineering teams
  • Google-grade development best practices
  • long-term project success
  • high end-user adoption
  • Python and relevant machine learning packages
  • applied AI
  • building systems around pretrained models
  • prompt engineering
  • fine-tuning
  • Retrieval-augmented generation (RAG)
  • orchestrating model interactions with external tools
  • managing solutions on a Cloud Platform
  • implementing multi-agent systems using frameworks
  • patterns like ReAct, self-reflection, and hierarchical delegation
  • LLM-native metrics
  • optimizing state management
  • granular tracing