Senior Software Engineer, ML Compilers, Edge Tpu

Google Google · Big Tech · Mountain View, CA +2

Senior Software Engineer on the EdgeTPU Compiler team responsible for building next-generation compiler optimizations to power machine learning (ML) models on custom hardware. This role involves translating framework-level code (JAX, PyTorch) into efficient instructions for the EdgeTPU, owning end-to-end features, triaging issues, implementing optimizations, and collaborating with model owners and stakeholders to accelerate ML deployment on edge devices.

What you'd actually do

  1. Design and build ML compiler optimizations for EdgeTPU hardware, and extend leading authoring frameworks (such as JAX and PyTorch) to enable seamless, high-performance compilation.
  2. Triage, root-cause, and resolve complex correctness and performance issues encountered when deploying state-of-the-art ML models on EdgeTPU.
  3. Propose, design, and implement robust compiler features and fixes to systematically address performance bottlenecks and hardware limitations.
  4. Partner closely with ML model owners to influence model architectures, ensuring they are designed for optimal, efficient execution on EdgeTPU systems.
  5. Own project execution from end to end, collaborating with cross-functional partners and stakeholders, and managing priorities/deadlines/deliverables for key compiler feature areas.

Skills

Required

  • software development
  • C++
  • software testing
  • software maintenance
  • software launching
  • software design
  • software architecture
  • compilers
  • optimization
  • parallelization

Nice to have

  • compiler development
  • accelerator-based architectures
  • low-level software
  • CPU
  • GPU
  • TPU
  • optimizing ML model inference on device

What the JD emphasized

  • ML compiler optimizations
  • EdgeTPU hardware
  • ML models
  • compiler optimizations
  • performance
  • hardware limitations
  • optimal, efficient execution

Other signals

  • ML Compilers
  • Edge TPU
  • hardware execution
  • ML models
  • compiler optimizations