Staff Software Engineer, Load Balancing and Agent Gateway

Google Google · Big Tech · Warsaw, Poland

Staff Software Engineer role focused on designing and leading feature development for the GKE Agent Gateway, enhancing capabilities for AI/ML workloads and contributing to the Kubernetes open-source project. The role involves collaboration with Load Balancing teams, ensuring optimal routing, load balancing, and autoscaling for inference serving, and improving observability, testability, and reliability of the Inference Gateway.

What you'd actually do

  1. Design, develop, test, and maintain features for the GKE Agent Gateway, enhancing its capabilities for AI/ML workloads and Contribute to the GKE Gateway controller OSS community as a lead on Inference.
  2. Collaborate with Load Balancing development teams on requirements and integrations with Google Cloud Load Balancers (e.g., Envoy features and Service Extensions).
  3. Engage with and contribute to the upstream OSS Kubernetes Gateway API project.
  4. Work with AI/ML platform teams to ensure optimal routing, load balancing, and autoscaling for inference serving.
  5. Develop solutions that span both the Kubernetes control plane and the load balancing data plane and constantly work on improving observability, testability, and reliability of the Inference Gateway.

Skills

Required

  • C++
  • Java
  • Python
  • Kotlin
  • Go
  • networking protocols
  • software design and architecture
  • technical leadership

Nice to have

  • Kubernetes
  • data structures/algorithms
  • complex, matrixed organization
  • cross-functional, or cross-business projects

What the JD emphasized

  • AI workloads
  • inference serving
  • Kubernetes Gateway API

Other signals

  • AI workloads
  • inference serving
  • Kubernetes Gateway API