Senior Software Engineer, Ai, Tpu, Infrastructure

Google Google · Big Tech · Taipei, Taiwan

The role focuses on designing and implementing software for AI infrastructure, specifically connecting Tensor Processing Unit (TPU) clusters and cloud systems. It involves building and integrating cloud compute software to support AI development and deployment at scale.

What you'd actually do

  1. Design and implement software to bootstrap compute or storage components, including Operating System (OS) image deployment and node-specific configuration.
  2. Build and integrate cloud compute software to establish an Artificial Intelligence (AI) infrastructure connecting Tensor Processing Unit (TPU) clusters and cloud systems.
  3. Partner with TPU hardware and software teams to design, develop, and validate systems for standalone deployment.
  4. Collaborate with cross-functional leadership to drive quality project delivery and shape the future of Machine Learning (ML) infrastructure.
  5. Provide technical guidance and mentorship to engineers, promoting professional growth and team development.

Skills

Required

  • software development
  • programming languages
  • software design
  • software architecture
  • OS image deployment
  • node-specific configuration

Nice to have

  • C++
  • cloud compute technologies
  • data center compute technologies
  • AI infrastructure software
  • reference OS
  • storage systems
  • machine learning
  • hardware and software stacks
  • technical leadership
  • cross-functional teams
  • communication skills
  • problem-solving skills
  • collaborative environments

What the JD emphasized

  • AI infrastructure
  • ML infrastructure

Other signals

  • AI infrastructure
  • TPU clusters
  • ML infrastructure