Senior GPU Software Engineer, System Software, AI Infrastructure

Google Google · Big Tech · Taipei, Taiwan

This role focuses on developing and maintaining the system software stack for GPUs, which are critical for powering various Google services including AI infrastructure. The engineer will provide technical leadership, drive integration of next-generation GPU accelerators, and ensure the reliability and performance of these systems within Google's data centers. While the role supports AI infrastructure, the core craft is in GPU system software engineering, not direct AI/ML model development.

What you'd actually do

  1. Design, develop and maintain the system software stack for Graphics Processing Unit (GPU) system software.
  2. Provide technical leadership to help formulate and drive software development plans.
  3. Help identify dependencies in cross-functional teams and drive New Product Introduction (NPI) execution with a focus on development velocity and quality.
  4. Set technical direction and priorities for a small number of adjacent engineers within and across teams, recommend solutions that consider both short and long-term goals.
  5. Drive System Software integration to enable next generation GPU Accelerators for Google Data Center.

Skills

Required

  • software development
  • software design and architecture
  • embedded operating systems
  • testing, maintaining, or launching software products

Nice to have

  • designing and developing device drivers for peripherals such as GPUs, Switches, and connectivity buses such as USB, PCIe, etc.
  • large-scale and secure fleet management of servers and components
  • software or hardware integration, system bring-up, and diagnostics
  • modularized servers of high performance and reliability
  • build tools to enable automation of tests to ship high quality systems

What the JD emphasized

  • GPU system software
  • System Software integration
  • GPU Accelerators