Cloud Hardware Dev Engineer (aws Generative AI & ML Servers), Aws Generative AI & ML Servers

Amazon Amazon · Big Tech · Seattle, WA · Hardware Development

This role focuses on the hardware engineering for AWS cloud servers, specifically those designed for AI/ML and HPC workloads. The engineer will be responsible for the design, development, testing, and operation of accelerated server segments, working with interdisciplinary teams and external partners to deliver these products at scale. The role involves understanding customer needs, architecting solutions, and overseeing the fleet's quality and performance post-launch.

What you'd actually do

  1. own and lead the design, development and root cause of a new segment of accelerated servers.
  2. work closely with our customers to understand their technical needs and business goals, leveraging your experience with server design and the knowledge of various teams to architect the solutions that we will deploy at scale.
  3. work with an interdisciplinary team of component, firmware, test, qualification, and integration engineers, and lead our design and manufacturing partners to bring these servers to the data center.
  4. oversee the fleet of servers you develop, monitoring their quality and how they are meeting the customer requirements.
  5. interfacing with our internal and external customers to understand project requirements and facilitate system development ontop of your server design.

Skills

Required

  • Experience working with interdisciplinary teams to execute product design from concept to production
  • 10+ years of new hardware products development experience
  • Experience developing and executing test procedures for mechanical or electrical systems/components based on design intent and approved equipment submissions
  • Experience in server technologies such as, thermal, mechanical, power, and signal integrity
  • Bachelor's degree in electrical engineering or equivalent
  • 7+ years of server hardware development and troubleshooting experience

Nice to have

  • Experience with the project management of technical projects
  • Experience in compute and storage server architecture and design for large scale applications
  • Master's degree or above in electrical engineering, computer engineering, or equivalent
  • 5+ years hardware development with a focus on system / server development in compute and/or storage server architecture and design for large scale applications
  • AI Infrastructure development experience, especially with next generation GPUs

What the JD emphasized

  • new hardware products development experience
  • server technologies such as, thermal, mechanical, power, and signal integrity
  • server hardware development and troubleshooting experience
  • AI Infrastructure development experience, especially with next generation GPUs