Tech Lead, Google Kubernetes Engine AI Platform

Google Google · Big Tech · Seattle, WA +1

Tech Lead for Google Kubernetes Engine (GKE) AI Platform, focusing on managing containerized AI/ML workloads on GPU/TPU infrastructure using Kubernetes. The role involves driving innovation in reliability, efficiency, and scale of AI infrastructure, engaging with customers, and leading technical direction for workload optimization.

What you'd actually do

  1. Act as an AI Platform TL, driving innovation on GKE AI/ML infra reliability, efficiency and scale.
  2. Engage with Megawhale customers to ensure their success/growth on GKE/Google Cloud Platform (GCP).
  3. Identify gaps and drive improvement across entire GKE/Google Compute Engine (GCE) stack.
  4. Help shape the culture of the team to be a high executing team that is fun to work with.
  5. Lead the technical goal for GKE AI/ML workload efficiency and optimization, setting the direction.

Skills

Required

  • software development
  • cloud computing
  • operating systems
  • distributed systems
  • data analytics
  • applied ML
  • AI infrastructure management
  • orchestration
  • machine learning infrastructure
  • large-scale distributed systems
  • Cloud
  • problem-solving
  • code tuning

Nice to have

  • Master's degree or PhD in Computer Science, Machine Learning, or a related field
  • model tuning

What the JD emphasized

  • AI infrastructure
  • GPU/TPU
  • Kubernetes
  • ML workload efficiency
  • optimization

Other signals

  • AI infrastructure
  • Kubernetes
  • containerized workloads
  • ML workload efficiency
  • optimization