Forward Deployed Engineer, Generative AI (genmedia), Google Cloud

Google Google · Big Tech · New York, NY +3

Forward Deployed Engineer for Generative AI in the Media & Entertainment sector on Google Cloud. This role involves building, debugging, and shipping bespoke agentic solutions directly within customer environments, bridging the gap between frontier AI products and production reality. Responsibilities include developing complex AI applications, architecting integrations, building evaluation pipelines, and providing feedback to product teams. Requires experience in production-grade AI solutions, cloud platforms, data pipelines with vector databases and RAG, and leading technical discovery.

What you'd actually do

  1. Serve as a developer for complex AI applications, transitioning from rapid prototypes to production-grade agentic workflows (e.g., multi-agent systems, MCP servers) that drive measurable ROI.
  2. Architect and code the "connective tissue" between Google’s AI products and customer's live infrastructure, including APIs, legacy data silos, and security perimeters as part of an expert team.
  3. Build high-performance evaluation pipelines and observability frameworks to ensure agentic systems meet rigorous requirements for accuracy, safety, and latency.
  4. Identify repeatable field patterns and friction points in Google’s AI stack, converting them into reusable modules or formal product feature requests for engineering teams.
  5. Be able to co-build with customer engineering teams to instill Google-grade development best practices, ensuring long-term project success and high end-user adoption.

Skills

Required

  • Python
  • AI systems architecture
  • cloud platforms (GCP)
  • vector databases
  • RAG architectures
  • technical discovery sessions
  • production-grade AI solutions

Nice to have

  • Media & Entertainment use cases
  • multi-agent systems frameworks (LangGraph, CrewAI, ADK)
  • ReAct
  • self-reflection
  • hierarchical delegation
  • LLM-native metrics
  • state management optimization
  • granular tracing
  • MCP
  • tool-calling
  • OAuth-based authentication

What the JD emphasized

  • production-grade reality
  • code, debug, and jointly ship bespoke agentic solutions
  • address blockers to production
  • solving the integration complexities, data readiness issues, and state-management challenges
  • critical feedback loop
  • transforming real-world field insights into Google Cloud’s future product roadmap
  • production-grade AI-driven solutions from conception to launch to customers
  • building pipelines for structured and unstructured data using both vector databases and RAG-like architectures
  • implementing multi-agent systems using frameworks
  • complex patterns (e.g., ReAct, self-reflection, hierarchical delegation)
  • LLM-native" metrics
  • optimizing state management and granular tracing
  • implement secure agentic workflows incorporating MCP, tool-calling, and OAuth-based authentication

Other signals

  • customer-facing AI solutions
  • production-grade agentic workflows
  • feedback loop to product roadmap
  • Vertex AI platform
  • Gemini models