What you'd actually do

leading a team of experienced compiler engineers developing compiler optimization algorithms and deploying, at scale, a new compiler targeting AWS custom hardware.

technically capable, credible, and curious in your own right as a trusted AWS Neuron Manager, innovating on behalf of our customers.

leverage your technical communications skill as a hands-on partner to AWS ML services teams, involved in pre-silicon design, bringing new products/features to market.

build the software that will boost the entire deep learning community.

You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.

Skills

Required

5+ years of engineering team management experience
9+ years of working directly within engineering teams experience
4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
Experience partnering with product or program management teams
Deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization)
Strong software design fundamentals
excellent system-level coding skills with an emphasis on graph theory and performance techniques

Nice to have

PhD in computer science, computer engineering, or related field, or MS degree
Experience with general troubleshooting/debugging of hardware
experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware
Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework design.
Interactions with open-source communities, in either a leadership or code contributor role

The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium delivers the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power for ML in the cloud. This is all enabled by edge software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler, runtime and natively integrates into popular ML frameworks, such as PyTorch and JAX. .AWS Neuron and Trainium are used at scale with customers and partners like PyTorch, Anthropic, Poolside, Decart, Epic Games, Snap, AirBnB, Autodesk, Amazon Alexa, and more customers in various other segments.

The Team: The Amazon Annapurna Labs team is a responsible for building innovation in silicon and software for AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design and verification, software and operations. With such breadth of talent, there's opportunity to learn all of the time. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. When you couple that with the ability to work on so many different products and services, it's a very unique learning culture.

Learn more about Our History: https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success

You: We are seeking a talented SW Engineering Manager with strong leadership/ mentoring skills to join our Deep Learning Compiler Team. As a Manager III you be leading a team of experienced compiler engineers developing compiler optimization algorithms and deploying, at scale, a new compiler targeting AWS custom hardware. You'll need to be technically capable, credible, and curious in your own right as a trusted AWS Neuron Manager, innovating on behalf of our customers. You’ll leverage your technical communications skill as a hands-on partner to AWS ML services teams, involved in pre-silicon design, bringing new products/features to market. As deep learning models become more versatile, using compiler technologies to achieve both high performance and high productivity becomes essential. Join the team to build the software that will boost the entire deep learning community.

You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.

Explore the Product: https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.html

https://github.com/aws/aws-neuron-sdk

https://aws.amazon.com/machine-learning/neuron/

In order to be considered for this role, candidates must be currently located or willing to relocate to Cupertino (preferred), Seattle, or Austin.

About the team Inclusive Team Culture Here at Annapurna Labs, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

Work/Life Balance Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

Mentorship & Career Growth Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future.

Basic Qualifications

5+ years of engineering team management experience
9+ years of working directly within engineering teams experience
4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
Experience partnering with product or program management teams
Deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization)
Strong software design fundamentals and excellent system-level coding skills with an emphasis on graph theory and performance techniques

Preferred Qualifications

PhD in computer science, computer engineering, or related field, or MS degree
Experience with general troubleshooting/debugging of hardware, or experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware
Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework design.
Interactions with open-source communities, in either a leadership or code contributor role

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Cupertino - 212,700.00 - 287,700.00 USD annually

Learn more about Our History: https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success

You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.

Explore the Product: https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.html

https://github.com/aws/aws-neuron-sdk

https://aws.amazon.com/machine-learning/neuron/

In order to be considered for this role, candidates must be currently located or willing to relocate to Cupertino (preferred), Seattle, or Austin.

Basic Qualifications

5+ years of engineering team management experience
9+ years of working directly within engineering teams experience
4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
Experience partnering with product or program management teams
Deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization)
Strong software design fundamentals and excellent system-level coding skills with an emphasis on graph theory and performance techniques

Preferred Qualifications

PhD in computer science, computer engineering, or related field, or MS degree
Experience with general troubleshooting/debugging of hardware, or experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware
Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework design.
Interactions with open-source communities, in either a leadership or code contributor role

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

USA, CA, Cupertino - 212,700.00 - 287,700.00 USD annually

Software Development Manager - Compiler, Aws Neuron, Annapurna Labs

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications