Forward Deployed Engineer, Genai, Google Cloud (japanese, English)

Google Google · Big Tech · Tokyo, Japan

This role involves deploying and integrating AI systems, specifically agentic workflows, into customer infrastructure on Google Cloud. The engineer will manage integration and data readiness issues, build evaluation pipelines and observability frameworks, and act as a feedback loop to influence the product roadmap. The role requires experience with applied AI, pre-trained models, RAG, and cloud platforms, with a focus on transitioning prototypes to production-grade solutions.

What you'd actually do

  1. Serve as a developer for Artificial Intelligence (AI) applications, transitioning from prototypes to production-grade agentic workflows (e.g., multi-agent systems, Model Context Protocol (MCP) servers) that drive Return on Investment (ROI).
  2. Architect and code the connection between Google’s AI products and customer's live infrastructure, including Application Programming Interfaces (APIs), legacy data silos, and security perimeters as part of a team.
  3. Build evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety and latency.
  4. Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
  5. Co-build with customer engineering teams to instill Google-grade development best practices, ensuring project success and end-user adoption.

Skills

Required

  • Python
  • machine learning package (e.g., Keras, PyTorch, HF Transformers)
  • applied AI
  • building systems around pre-trained models
  • prompt engineering
  • fine-tuning
  • Retrieval-Augmented Generation (RAG)
  • orchestrating model interactions with external tools
  • architecting solutions on a Cloud Platform (e.g., Google Cloud Platform)
  • deploying solutions on a Cloud Platform (e.g., Google Cloud Platform)
  • managing solutions on a Cloud Platform (e.g., Google Cloud Platform)
  • Japanese
  • English

Nice to have

  • implementing multi-agent systems using frameworks (e.g., LangGraph, CrewAI, or Google’s ADK)
  • patterns like ReAct, self-reflection, and hierarchical delegation
  • Large Language Model (LLM)-native metrics (e.g., tokens/sec, cost-per-request)
  • techniques for optimizing state management
  • granular tracing

What the JD emphasized

  • production-grade agentic workflows
  • agentic systems
  • production-grade reality
  • enterprise-grade maturity
  • customer's live infrastructure
  • accuracy, safety and latency

Other signals

  • Deploying AI systems
  • Integrating AI products into customer infrastructure
  • Building agentic workflows
  • Feedback loop from field to product roadmap