Staff Software Engineer, Ml, Compute Platform

Google Google · Big Tech · Sunnyvale, CA +1

Staff Software Engineer role focused on developing embedded software, diagnostics tests, and tools for pre/post-silicon validation of in-house SoCs, ML Systems (TPU/GPU), and compute servers. The role involves system health verification, performance characterization, and solving reliability problems for Google's global data centers, enabling hardware reliability and decision-making in hardware/software design and deployment. It is part of the AI and Infrastructure team, which supports various Google services including compute, search, machine learning, and image processing.

What you'd actually do

  1. Provide technical leadership on high-impact projects.
  2. Influence and coach a distributed team of engineers.
  3. Facilitate alignment and clarity across teams on goals, outcomes, and timelines.
  4. Manage project priorities, deadlines, and deliverables.
  5. Design, develop, test, deploy, maintain, and enhance large-scale software solutions.

Skills

Required

  • software development
  • testing
  • launching software products
  • embedded operating systems
  • software design
  • software architecture
  • embedded systems
  • ARM processor
  • Linux

Nice to have

  • data structures
  • algorithms
  • technical leadership
  • diagnostics
  • networking
  • data analysis

What the JD emphasized

  • ML Systems (TPU/GPU)
  • compute servers
  • Google Data Centers