Sr. Machine Learning - Compiler Engineer Iii, Aws Neuron, Annapurna Labs

Amazon Amazon · Big Tech · Cupertino, CA · Software Development

This role is for a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, focusing on the development and scaling of a compiler for ML accelerators. The role involves architecting and implementing features for a deep learning compiler stack that optimizes neural network performance on custom AWS hardware, integrating with frameworks like PyTorch and TensorFlow. The goal is to provide significant performance improvements for large-scale ML workloads.

What you'd actually do

  1. Architecting and implementing business-critical features
  2. publish cutting-edge research
  3. mentoring a brilliant team of experienced engineers
  4. hands-on partner to AWS ML services teams
  5. involved in pre-silicon design, bringing new products/features to market

Skills

Required

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Experience as a mentor, tech lead or leading an engineering team

Nice to have

  • Bachelor's degree in computer science or equivalent
  • Machine Learning
  • AI accelerators

What the JD emphasized

  • world's largest ML workloads
  • scaling of a compiler
  • deep learning compiler stack
  • optimize the performance of complex neural net models
  • custom-built AWS hardware
  • quantum leap in performance

Other signals

  • AWS Machine Learning accelerators
  • Inferentia chip
  • Trainium
  • AWS Neuron Software Development Kit (SDK)
  • ML compiler
  • runtime
  • PyTorch, TensorFlow and MxNet
  • optimize the performance of complex neural net models on our custom-built AWS hardware
  • deep learning compiler stack
  • toolchain that will provide a quantum leap in performance