Software Development Manager - ML Performance Tooling and Benchmarking, Aws Neuron, Annapurna Labs

Amazon Amazon · Big Tech · CA, ON +1 · Software Development

Manager III leading a team of compiler engineers to develop, deploy, and scale a compiler targeting AWS Inferentia and Trainium ML accelerators. The role involves technical leadership, innovation, and collaboration with AWS ML services teams to ensure the Neuron SDK meets customer needs for high performance, low cost, and ease of use. Deep knowledge of resource management, scheduling, code generation, and optimization is required.

What you'd actually do

  1. leading a team of compiler engineers through developing, deploying, and scaling a compiler targeting AWS Inferentia and Trainium
  2. innovating on behalf of our customers
  3. leveraging your vision and technical communication skills as a hands-on partner to AWS ML services teams
  4. be involved in pre-silicon design, bring new products/optimizations/features to market
  5. ensure the Neuron SDK exceeds our customers' needs of high performance, low cost, and ease of use

Skills

Required

  • Compiler engineering
  • Resource management
  • Scheduling
  • Code generation
  • Optimization
  • New instruction architectures (CPU, NPU, GPU, novel compute)
  • Technical leadership
  • Technical communication
  • People management

Nice to have

  • PyTorch
  • TensorFlow
  • MxNet
  • ML frameworks
  • Pre-silicon design

What the JD emphasized

  • leading a team of compiler engineers
  • developing, deploying, and scaling a compiler
  • AWS Inferentia and Trainium
  • technically capable, credible and curious
  • innovating on behalf of our customers
  • high performance, low cost, and ease of use

Other signals

  • ML accelerators
  • Inferentia chip
  • Trainium chip
  • AWS Neuron SDK
  • ML compiler
  • runtime
  • optimizes performance
  • PyTorch
  • TensorFlow
  • MxNet