Forward Deployed Engineer, Genai, Google Cloud

Google Google · Big Tech · Mexico City, CDMX, Mexico

This role involves building and deploying production-grade generative AI agentic solutions within customer environments on Google Cloud. It focuses on integrating Google's AI products with customer infrastructure, developing evaluation pipelines, and providing feedback to product teams. The role requires experience in applied AI, building systems around pre-trained models, and cloud platform deployment.

What you'd actually do

  1. Serve as a developer for AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, Model Context Protocol (MCP) servers) that drive measurable Return on Investment (ROI).
  2. Architect and engineer the "connective tissue" linking Google’s AI products to customers' live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
  3. Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety, and latency.
  4. Identify recurring field patterns and friction points across Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
  5. Collaborate with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Skills

Required

  • Python
  • Typescript
  • applied AI
  • building systems around pretrained models
  • prompt engineering
  • fine-tuning
  • Retrieval-augmented generation (RAG)
  • orchestrating model interactions with external tools
  • architecting solutions on a Cloud Platform
  • deploying solutions on a Cloud Platform
  • managing solutions on a Cloud Platform

Nice to have

  • Master’s degree or PhD in AI, Computer Science, or a related technical field
  • implementing multi-agent systems
  • LangGraph
  • CrewAI
  • Google’s ADK
  • ReAct
  • self-reflection
  • hierarchical delegation
  • LLM-native metrics
  • tokens/sec
  • cost-per-request
  • optimizing state management
  • granular tracing

What the JD emphasized

  • production-grade reality
  • production-grade agentic workflows
  • production-grade AI-driven solutions
  • production
  • production roadmap

Other signals

  • building bespoke agentic solutions
  • customer environments
  • feedback loop
  • production-grade reality