Sr. Embedded Firmware Engineer, Annapurna Labs ML Acceleration Systems Software

Amazon Amazon · Big Tech · Austin, TX · Software Development

This role involves developing systems software and firmware for machine learning acceleration servers, focusing on initialization, monitoring, and management of custom accelerator chips. The work includes developing device drivers (I2C, SPI, PCIe) and ensuring the production software is scalable and maintainable for AWS infrastructure. The role explicitly states it does not work on ML algorithms but on the physical systems that execute them.

What you'd actually do

  1. Develop firmware to initialize, monitor, and manage custom machine learning accelerators at AWS scale.
  2. Working with the MLA Hardware, Test and Manufacturing teams to create a coordinated software package to enable both qualification as well as rapid deployment of software.
  3. Developing software (C, C++, Python, Lua) which can be maintained, improved upon, documented, tested, and reused.

Skills

Required

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Bachelor's degree in computer science or equivalent

Nice to have

  • Knowledge of computer architecture, operating systems, and parallel computing
  • Knowledge of system performance, memory management, and parallel computing principles