Product Test Engineer,, Trainium Manufacturing, Quality and Reliability

Amazon Amazon · Big Tech · TPE, Taiwan +1 · Hardware Development

This role focuses on designing and implementing system-level test solutions for ML acceleration hardware within AWS. It involves validating hardware and software for large-scale deployment, collaborating with cross-functional teams, and ensuring the quality and reliability of ML infrastructure.

What you'd actually do

  1. Design and implement system-level test strategies for ML acceleration products
  2. Develop comprehensive functional and performance tests for complete ML systems
  3. Create and maintain scalable test infrastructure for high-volume product validation
  4. Implement product bring-up and first-boot test procedures
  5. Drive improvements in test coverage, product quality, and manufacturing efficiency

Skills

Required

  • System-level test strategies
  • Functional and performance testing
  • Test infrastructure development
  • Product bring-up
  • Hardware/software integration
  • Debugging complex interactions

Nice to have

  • ML workloads
  • Data center environments

What the JD emphasized

  • ML acceleration hardware
  • system-level test solutions
  • large-scale deployment