System Thermal Design Staff Engineer

AMD AMD · Semiconductors · Penang, Malaysia · Engineering

AMD is seeking an experienced System Thermal Design Engineer to lead thermal design and validation for next-generation server, AI, and datacenter platforms. The role involves driving system thermal strategy, debugging thermal issues, and collaborating with cross-functional teams to ensure robust thermal performance for high-performance computing platforms.

What you'd actually do

  1. Lead system-level thermal design and validation activities for CPU/GPU server and AI platforms
  2. Develop and execute thermal validation plans, methodologies, and thermal design strategies
  3. Drive thermal characterization and thermal margin analysis across various workloads and environmental conditions
  4. Lead thermal issue debugging, root-cause analysis, and corrective actions
  5. Perform chamber testing, instrumentation setup, and thermal data analysis

Skills

Required

  • System-level thermal design and validation for CPU/GPU platforms
  • Server, datacenter, AI, or high-performance computing systems
  • Lead technical activities and drive cross-functional collaboration
  • Problem-solving and debugging skills
  • System thermal design and optimization
  • Thermal chamber testing
  • Thermal instrumentation and thermocouple setup
  • Thermal characterization and workload validation
  • Thermal margin analysis
  • Hotspot identification and airflow validation
  • Fan tuning and thermal optimization
  • CPU/GPU thermal validation
  • Thermal telemetry and data analysis
  • Validation methodology development
  • System bring-up and issue debugging
  • Mechanical Engineering
  • Thermal Engineering

Nice to have

  • FloTHERM
  • ANSYS Icepak
  • Thermal simulation and CFD correlation
  • Airflow and thermal modeling experience
  • AMD EPYC platforms
  • GPU server or AI systems
  • NVIDIA HGX or multi-GPU platforms
  • Familiarity with CFD tools and simulation correlation
  • Liquid cooling validation experience
  • Validation automation or scripting experience

What the JD emphasized

  • AI platforms
  • datacenter platforms
  • high-performance computing platforms
  • server
  • AI server thermal validation