Tech Lead, Google Kubernetes Engine AI Platform

Google Google · Big Tech · Seattle, WA +1

Tech Lead for Google Kubernetes Engine (GKE) AI Platform, focusing on managing containerized AI/ML workloads on GPU/TPU infrastructure using Kubernetes. The role involves driving innovation in reliability, efficiency, and scale of AI infrastructure, engaging with customers, and leading technical direction for ML workload efficiency and optimization.

What you'd actually do

  1. Act as an AI Platform TL, driving innovation on GKE AI/ML infra reliability, efficiency and scale.
  2. Engage with Megawhale customers to ensure their success/growth on GKE/Google Cloud Platform (GCP).
  3. Identify gaps and drive improvement across entire GKE/Google Compute Engine (GCE) stack.
  4. Help shape the culture of the team to be a high executing team that is fun to work with.
  5. Lead the technical goal for GKE AI/ML workload efficiency and optimization, setting the direction.

Skills

Required

  • software development
  • cloud computing
  • operating systems
  • Kubernetes
  • GKE
  • AI infrastructure management
  • distributed systems

Nice to have

  • Machine Learning
  • data analytics
  • applied ML
  • GPU/TPU management
  • large-scale distributed systems
  • code and model tuning skills

What the JD emphasized

  • AI infrastructure
  • ML workload efficiency
  • optimization

Other signals

  • AI infrastructure
  • Kubernetes
  • GKE
  • ML workload efficiency
  • optimization