What you'd actually do

Design and implement tooling for profiling, optimization, and resource management of ML workloads on custom accelerators.

Build high-impact solutions that ship to a large and growing customer base.

Participate in design discussions, code reviews, and cross-functional collaboration with hardware, software, and customer-facing teams.

Create metrics, implement automation, and resolve root causes of software defects.

Work in a startup-like environment where you're always focused on the most important problems.

Skills

Required

Experience with at least one modern language such as Java, Python, C++, or C# including object-oriented design
Experience with at least one general-purpose programming language such as Java, Python, C++, C#, Go, Rust, or TypeScript
Experience with data structure implementation, basic algorithm development, and/or object-oriented design principles
Proficiency in Java and at least one of Go, Python, or TypeScript.
Familiarity with Git and CI/CD pipelines.

Nice to have

Experience from a technical internship
Experience in optimization mathematics such as linear programming and nonlinear optimization
Experience with distributed, multi-tiered systems, algorithms, and relational databases
Work 40 hours/week, and overtime as required
Experience from previous technical internship(s) or demonstrated project experience
Experience with Cloud platforms (preferably AWS), database systems (SQL and NoSQL), AI tools for development productivity, contributing to open-source projects, and/or version control systems
Internship or project experience with AWS services (EKS, EC2, Lambda, S3, DynamoDB, or SQS).
Familiarity with distributed systems or big data architectures.
Experience with Linux systems and performance profiling.
Exposure to compiler toolchains, code generation, or instruction set architectures (CPU, NPU, GPU).

Annapurna Labs was a startup acquired by AWS in 2015 and is now fully integrated. If AWS is an infrastructure company, think of Annapurna Labs as the infrastructure provider of AWS. Our org spans silicon engineering, hardware design and verification, software, and operations. We've delivered AWS Nitro, ENA, EFA, Graviton, F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and scalable NVMe storage.

AWS Neuron is the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers that use them.

We're looking for a Software Development Engineer to help build and evolve machine learning tools that run, optimize, and analyze ML workloads on custom AI accelerators. You'll work across the stack, from infrastructure orchestration to developer-facing tooling - alongside hardware engineers, system architects, and ML researchers both within and outside Amazon.

Key job responsibilities

Design and implement tooling for profiling, optimization, and resource management of ML workloads on custom accelerators.
Build high-impact solutions that ship to a large and growing customer base.
Participate in design discussions, code reviews, and cross-functional collaboration with hardware, software, and customer-facing teams.
Create metrics, implement automation, and resolve root causes of software defects.
Work in a startup-like environment where you're always focused on the most important problems.

About the team This is a high-impact, high-visibility team where your work directly accelerates every Neuron team's ability to ship, effectively multiplying the output of 100+ engineers. We're a small, senior group actively building greenfield capabilities, which means significant design ownership for SDEs and the opportunity to own major components and drive architectural decisions. You'll work at the cutting edge of AI infrastructure, at the intersection of Kubernetes, custom silicon, and large-scale ML workloads.

Basic Qualifications

Are 18 years of age or older
Experience with at least one modern language such as Java, Python, C++, or C# including object-oriented design
Experience with at least one general-purpose programming language such as Java, Python, C++, C#, Go, Rust, or TypeScript
Experience with data structure implementation, basic algorithm development, and/or object-oriented design principles
Proficiency in Java and at least one of Go, Python, or TypeScript.
Familiarity with Git and CI/CD pipelines.

Preferred Qualifications

Experience from a technical internship
Experience in optimization mathematics such as linear programming and nonlinear optimization
Experience with distributed, multi-tiered systems, algorithms, and relational databases
Work 40 hours/week, and overtime as required
Experience from previous technical internship(s) or demonstrated project experience
Experience with Cloud platforms (preferably AWS), database systems (SQL and NoSQL), AI tools for development productivity, contributing to open-source projects, and/or version control systems
Internship or project experience with AWS services (EKS, EC2, Lambda, S3, DynamoDB, or SQS).
Familiarity with distributed systems or big data architectures.
Experience with Linux systems and performance profiling.
Exposure to compiler toolchains, code generation, or instruction set architectures (CPU, NPU, GPU).

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Cupertino - 127,100.00 - 185,000.00 USD annually

AWS Neuron is the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers that use them.

Key job responsibilities

Design and implement tooling for profiling, optimization, and resource management of ML workloads on custom accelerators.
Build high-impact solutions that ship to a large and growing customer base.
Participate in design discussions, code reviews, and cross-functional collaboration with hardware, software, and customer-facing teams.
Create metrics, implement automation, and resolve root causes of software defects.
Work in a startup-like environment where you're always focused on the most important problems.

Basic Qualifications

Are 18 years of age or older
Experience with at least one modern language such as Java, Python, C++, or C# including object-oriented design
Experience with at least one general-purpose programming language such as Java, Python, C++, C#, Go, Rust, or TypeScript
Experience with data structure implementation, basic algorithm development, and/or object-oriented design principles
Proficiency in Java and at least one of Go, Python, or TypeScript.
Familiarity with Git and CI/CD pipelines.

Preferred Qualifications

Experience from a technical internship
Experience in optimization mathematics such as linear programming and nonlinear optimization
Experience with distributed, multi-tiered systems, algorithms, and relational databases
Work 40 hours/week, and overtime as required
Experience from previous technical internship(s) or demonstrated project experience
Experience with Cloud platforms (preferably AWS), database systems (SQL and NoSQL), AI tools for development productivity, contributing to open-source projects, and/or version control systems
Internship or project experience with AWS services (EKS, EC2, Lambda, S3, DynamoDB, or SQS).
Familiarity with distributed systems or big data architectures.
Experience with Linux systems and performance profiling.
Exposure to compiler toolchains, code generation, or instruction set architectures (CPU, NPU, GPU).

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

USA, CA, Cupertino - 127,100.00 - 185,000.00 USD annually

Software Development Engineer I, ML Infra Services, Annapurna Labs

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Basic Qualifications

Preferred Qualifications

Basic Qualifications

Preferred Qualifications