Forward Deployed Engineer, Generative Ai, Google Cloud (korean, English)

Google Google · Big Tech · Seoul, South Korea

This role focuses on deploying and building generative AI solutions, specifically agentic systems, within customer environments on Google Cloud. The engineer will bridge the gap between frontier AI products and production reality, coding, debugging, and shipping bespoke solutions. Responsibilities include architecting integration, building evaluation pipelines, and providing feedback to product teams. The role requires experience with Python, cloud platforms, RAG, vector databases, and ideally multi-agent systems.

What you'd actually do

  1. Serve as a developer for complex AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, MCP servers) that drive measurable Return on Investment (ROI).
  2. Architect and code the "connective tissue" between Google’s AI products and customer's live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
  3. Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet requirements for accuracy, safety, and latency.
  4. Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for the Engineering teams.
  5. Co-build with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Skills

Required

  • software development using Python
  • architecting AI systems on cloud platforms (e.g., GCP)
  • building pipelines for structured and unstructured data
  • vector databases
  • RAG-like architectures
  • Korean and English fluency

Nice to have

  • Master’s degree or PhD in AI, Computer Science, or a related technical field
  • implementing multi-agent systems using frameworks (e.g., LangGraph, CrewAI, ADK)
  • complex patterns (e.g., ReAct, self-reflection, hierarchical delegation)
  • leading technical discovery sessions
  • LLM-native metrics (e.g., tokens/sec, cost-per-request)
  • techniques for optimizing state management and granular tracing

What the JD emphasized

  • production-grade reality
  • code, debug, and jointly ship bespoke agentic solutions
  • address blockers to production
  • solving the integration complexities
  • data readiness issues
  • state-management issues
  • white glove" deployment of complex AI systems
  • critical feedback loop
  • production-grade AI-driven solutions
  • architecting AI systems on cloud platforms
  • power enterprise AI solutions
  • implementing multi-agent systems
  • optimizing state management
  • granular tracing

Other signals

  • customer-facing AI deployment
  • building agentic systems
  • feedback loop to product
  • production-grade AI solutions