Staff Swe, Compiler Architect, System Performance Modeling

Google Google · Big Tech · Sunnyvale, CA +3

Staff SWE role focused on architecting and owning the accuracy and fidelity of a critical co-design simulation platform for next-generation ML accelerators. The role involves establishing correlation infrastructure between simulated and physical hardware, evolving the simulation layer for complex workloads like LLMs, and identifying system-level bottlenecks at the pre-silicon stage to provide performance estimates for future ML systems.

What you'd actually do

  1. Establish and maintain high-confidence correlation infrastructure between simulated performance and physical hardware measurements (silicon).
  2. Architect and evolve the simulation layer to support deep exploration of complex, business-critical workloads (e.g., large language models, advanced kernels) and future system topologies.
  3. Identify and solve system-level hardware/software bottlenecks and optimization opportunities at the critical pre-silicon stage.
  4. Provide high-confidence lower-bound performance estimates for future ML systems and architectures.

Skills

Required

  • C++
  • Python
  • software products testing and launching
  • performance analysis
  • large-scale systems data analysis
  • visualization tools
  • debugging
  • software design and architecture

Nice to have

  • hardware/software co-design
  • performance analysis at pre-silicon stage
  • ML system architectures
  • compilers
  • Intermediate Representations (IRs)
  • hardware accelerators
  • enabling and optimizing large-scale ML models
  • technical strategy for complex systems
  • influencing simulation toolchains and hardware roadmaps
  • constructing custom IR dialects
  • open-source compiler frameworks (MLIR, XLA)
  • system level analysis
  • software-hardware mapping opportunities
  • architecting high-confidence, high-velocity system performance modeling
  • correlation infrastructure

What the JD emphasized

  • critical co-design simulation platform
  • pre-silicon stage
  • performance modeling and correlation infrastructure

Other signals

  • performance modeling
  • simulation technologies
  • ML accelerators
  • HW-SW co-design
  • system architecture roadmaps