Cloud Hardware Dev Engineer (aws Generative AI & ML Servers), Aws Hardware Engineering Services

Amazon Amazon · Big Tech · Seattle, WA · Hardware Development

This role focuses on the hardware engineering for AWS cloud servers specifically designed for AI training and inference. The engineer will be responsible for the design, development, testing, and operation of these accelerated servers, working with interdisciplinary teams and manufacturing partners to deliver scalable solutions for AI/ML and HPC workloads. While the servers support AI, the role itself is in hardware engineering, not directly building AI models or agents.

What you'd actually do

  1. own and lead the design, development and root cause of a new segment of accelerated servers
  2. work closely with our customers to understand their technical needs and business goals, leveraging your experience with server design and the knowledge of various teams to architect the solutions that we will deploy at scale
  3. work with an interdisciplinary team of component, firmware, test, qualification, and integration engineers, and lead our design and manufacturing partners to bring these servers to the data center
  4. oversee the fleet of servers you develop, monitoring their quality and how they are meeting the customer requirements
  5. interfacing with our internal and external customers to understand project requirements and facilitate system development ontop of your server design

Skills

Required

  • Experience working with interdisciplinary teams to execute product design from concept to production
  • Experience developing and executing test procedures for mechanical or electrical systems/components based on design intent and approved equipment submissions
  • Knowledge of server hardware and components
  • Bachelor's degree in electrical engineering or equivalent
  • 2+ years of server hardware troubleshooting and repair experience
  • 4+ years of hardware design and validation of components, subsystems and systems experience

Nice to have

  • Master's degree or above in electrical engineering, computer engineering, or equivalent
  • Experience in compute and storage server architecture and design for large scale applications
  • AI infrastructure hardware development and debugging experience

What the JD emphasized

  • AI infrastructure hardware development and debugging experience