Staff Software Developer, ML Infrastructure, Core Infra

Google Google · Big Tech · Waterloo, ON +1

Staff Software Developer focused on ML Infrastructure for Core Infra, specifically for conversational AI agents. The role involves driving technical direction for data systems, implementing scalable data solutions on GCP, and leading critical infrastructure investments to support Generative AI development.

What you'd actually do

  1. Drive the technical direction and multi-year road map for conversational agent data systems, collaborating with Principal Engineers to align infrastructure capabilities with the evolution of Generative AI.
  2. Implement scalable, high-performance data solutions using Spanner and Google Cloud Platform (GCP), balancing the demands of real-time runtime storage with the complexity of massive-scale offline analytics.
  3. Act as the primary technical liaison for cross-functional partner teams, clarifying project scope and setting the direction for both the producers and consumers of core product data.
  4. Advocate a culture of production excellence by leading high-stakes design and code reviews across cross-site organizations.
  5. Lead critical infrastructure investments that address scalability bottlenecks, technical debt, and operations, guiding the team through complex migrations in a high-velocity environment.

Skills

Required

  • Software development
  • Software design and architecture
  • Data systems
  • ML infrastructure
  • Spanner
  • Google Cloud Platform (GCP)
  • Speech/audio
  • Reinforcement learning
  • Model deployment
  • Model evaluation
  • Data processing
  • Debugging
  • Fine tuning

Nice to have

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field
  • Technical leadership
  • Complex, matrixed organization experience

What the JD emphasized

  • 8 years of experience in software development
  • 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture
  • 5 years of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field.
  • 5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).

Other signals

  • Generative AI
  • conversational AI
  • data systems
  • infrastructure
  • ML infrastructure