Software Development Engineer I, ML Infra Services, Annapurna Labs

Amazon Amazon · Big Tech · Seattle, WA · Software Development

Software Development Engineer I role focused on building and evolving machine learning infrastructure services, specifically tooling for profiling, optimization, and resource management of ML workloads on custom AI accelerators. The role involves working across the stack from infrastructure orchestration to developer-facing tooling, with a focus on shipping solutions to a large customer base and contributing to the development of AI accelerators like AWS Neuron.

What you'd actually do

  1. Design and implement tooling for profiling, optimization, and resource management of ML workloads on custom accelerators.
  2. Build high-impact solutions that ship to a large and growing customer base.
  3. Participate in design discussions, code reviews, and cross-functional collaboration with hardware, software, and customer-facing teams.
  4. Create metrics, implement automation, and resolve root causes of software defects.
  5. Work in a startup-like environment where you're always focused on the most important problems.

Skills

Required

  • Kubernetes, Docker or containers ecosystem
  • deploying identity and access management systems
  • software development lifecycle
  • Java
  • Go, Python, or TypeScript
  • Git
  • CI/CD pipelines

Nice to have

  • AWS services (EKS, EC2, Lambda, S3, DynamoDB, or SQS)
  • distributed systems
  • big data architectures
  • Linux systems
  • performance profiling
  • compiler toolchains
  • code generation
  • instruction set architectures (CPU, NPU, GPU)

Other signals

  • ML Infra Services
  • custom AI accelerators
  • ML workloads
  • inference
  • optimization
  • resource management