Senior Staff Software Engineer, ML Infrastructure, Agents Infrastructure

Google Google · Big Tech · Sunnyvale, CA +1

Senior Staff Software Engineer focused on building and scaling ML infrastructure for conversational agents in an enterprise AI context. The role involves designing and implementing back-end services, agent APIs, and infrastructure for autonomous tool use and multi-step goal achievement. It also includes developing analysis systems for massive data streams and robust monitoring/explainability frameworks, with collaboration with Google DeepMind and Vertex AI.

What you'd actually do

  1. Lead the design and implementation of back-end services and the Agent API, abstracting complex Google Cloud Platform (GCP) infrastructure into a modern, asynchronous, event-driven framework.
  2. Build the infrastructure for seamless communication and autonomous tool-use, enabling specialized agents to interact with APIs and databases to solve complex, multi-step goals.
  3. Develop "always-on" analysis systems that ingest and reason over massive data streams from hundreds of disparate sources, including Enterprise Resource Planning (ERPs), Customer Relationship Management (CRMs), and unstructured documents.
  4. Implement robust frameworks for monitoring, tracing, and explainability to ensure all agentic actions are transparent, auditable, and reliable for enterprise customers.
  5. Partner with Vertex AI and Google DeepMind to leverage, integrate, and influence the future roadmap of Google’s core agent frameworks and AI models.

Skills

Required

  • software development
  • technical project strategy
  • ML design
  • ML infrastructure
  • model deployment
  • model evaluation
  • data processing
  • debugging
  • fine tuning
  • design and architecture
  • testing/launching software products
  • GenAI techniques
  • LLMs
  • Multi-Modal
  • Large Vision Models
  • language modeling
  • computer vision
  • back-end services
  • Agent API
  • GCP infrastructure
  • asynchronous, event-driven framework
  • tool-use
  • APIs
  • databases
  • analysis systems
  • data streams
  • monitoring
  • tracing
  • explainability
  • Vertex AI
  • Google DeepMind

Nice to have

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field
  • data structures and algorithms
  • technical leadership role
  • complex, matrixed organization
  • cross-functional, or cross-business projects

What the JD emphasized

  • 8 years of experience in software development
  • 7 years of experience leading technical project strategy, ML design, and optimizing industry-scale ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning)
  • 5 years of experience with design and architecture; and testing/launching software products
  • 2 years of experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision)
  • massive data streams
  • complex, multi-step goals
  • autonomous tool-use
  • agentic actions
  • core agent frameworks

Other signals

  • building agentic systems
  • large scale
  • enterprise customers