What you'd actually do

Optimizing system performance across the entire ML software stack

Analyzing high-performance ML workloads running on Annapurna hardware

Developing high-performance kernels for critical ML operations

Enhancing the Neuron SDK to improve developer experience and system capabilities

Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

Other signals

AWS Neuron software stack

Generative AI and other advanced ML workloads

AWS's custom-built ML accelerators

ML inference and training

ML systems performance and software

high-performance kernels

compiler enhancements

instruction scheduling

memory management

parallelism

kernel optimization

hardware-software co-design

Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud. We're building a new core group of engineers in TLV (Tel Aviv) to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you'll help shape the direction of the team from the ground up and work on:

Optimizing system performance across the entire ML software stack Analyzing high-performance ML workloads running on Annapurna hardware Developing high-performance kernels for critical ML operations Enhancing the Neuron SDK to improve developer experience and system capabilities Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

As part of the Performance Engineering Team, you'll contribute to projects involving instruction scheduling, memory management, parallelism, kernel optimization, and compiler enhancements to maximize end-to-end performance.

This is a unique opportunity to be at the intersection of ML and systems within AWS, helping to build the future of AI infrastructure — right here in Tel Aviv.

Key job responsibilities Our engineers collaborate across diverse teams, projects, and environments to have a firsthand impact on our global customer base. You will:

Solve challenging technical problems, often ones not solved before, at every layer of the stack. Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security. Research implementations that deliver the best possible experiences for customers.

A day in the life As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:

Build high-impact solutions to deliver to our large customer base. Participate in design discussions, code review, and communicate with internal and external stakeholders. Work cross-functionally to help drive business decisions with your technical input. Work in a startup-like development environment, where you’re always working on the most important stuff.

Considering the technical requirements of this role, and to ensure fair and equal candidate consideration, we are conducting full interview loops. If you meet the basic qualifications, and the team is interested in moving forward, we will be in touch to coordinate your interviews.

About the team *Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

*Why AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

*Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

*Inclusive Team Culture AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.

*Mentorship and Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Basic Qualifications

B.S. or M.S. in computer science or related field
Proficiency with 1 or more of the following programming languages: Python(preferred), C++
Experience working with TensorFlow, PyTorch, and/or JAX
3+ years of non-internship professional software development experience
3+ years of experience in performance optimizations in LLM, Vision or other deep-learning models

Preferred Qualifications

M.S. in computer science or related field
Experience with developing algorithms for simulation tools
Experience developing compiler optimization, kernel writing or hardware-software co-design

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

This is a unique opportunity to be at the intersection of ML and systems within AWS, helping to build the future of AI infrastructure — right here in Tel Aviv.

Key job responsibilities Our engineers collaborate across diverse teams, projects, and environments to have a firsthand impact on our global customer base. You will:

Basic Qualifications

B.S. or M.S. in computer science or related field
Proficiency with 1 or more of the following programming languages: Python(preferred), C++
Experience working with TensorFlow, PyTorch, and/or JAX
3+ years of non-internship professional software development experience
3+ years of experience in performance optimizations in LLM, Vision or other deep-learning models

Preferred Qualifications

M.S. in computer science or related field
Experience with developing algorithms for simulation tools
Experience developing compiler optimization, kernel writing or hardware-software co-design

Machine Learning Performance Engineer, Annapurna Labs

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications