Software Development Manager - Compiler, Aws Neuron, Annapurna Labs

Amazon Amazon · Big Tech · Cupertino, CA · Software Development

Software Development Manager for the NKI Compiler Team at Amazon Annapurna Labs, focusing on developing and optimizing a new compiler for AWS custom ML hardware (Inferentia and Trainium chips). The role involves leading a team of compiler engineers to deploy compiler optimization algorithms and a new compiler at scale, targeting AWS ML accelerators. The team works on the AWS Neuron SDK, including an ML compiler and runtime, integrating with frameworks like PyTorch and TensorFlow. The role requires strong leadership, technical expertise in compilers, and collaboration with AWS ML services teams.

What you'd actually do

  1. leading a team of experienced compiler engineers developing compiler optimization algorithms and deploying, at scale, a new compiler targeting AWS custom hardware.
  2. You’ll leverage your technical communications skills as a partner with AWS ML services teams, pre-silicon design, and bringing new products/features to market.
  3. You have knowledge of resource management, scheduling, code generation, optimization, and instruction architectures including CPU, NPU, GPU and novel forms of compute.
  4. 5+ years of engineering team management experience
  5. 9+ years of working directly within engineering teams experience

Skills

Required

  • engineering team management experience
  • working directly within engineering teams experience
  • designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • Experience partnering with product or program management teams
  • Understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization)
  • Strong software design fundamentals
  • excellent system-level coding skills

Nice to have

  • M.S. or Ph.D. in Computer Science or related technical field

What the JD emphasized

  • compiler optimization algorithms
  • deploying, at scale
  • new compiler
  • AWS custom hardware
  • resource management
  • scheduling
  • code generation
  • optimization
  • instruction architectures