AI Customer Engineering Customer Co‑validation Lead

AMD AMD · Semiconductors · Penang, Malaysia · Engineering

This role leads customer-facing validation strategy for AMD Instinct™ GPU platforms, focusing on co-developing end-to-end system validation with hyperscale, enterprise, and MDC partners. The lead will own technical relationships, drive debug across complex systems (GPU, CPU, NIC, firmware, ROCm, AI software stack), and translate learnings into product improvements. While the role uses AI tools and deals with AI performance, its core function is system-level validation and customer engineering, not direct AI model development.

What you'd actually do

  1. Lead Customer Co‑Validation Strategy, execution, engagements involving AI performance and GPU software stack issues and develop joint validation plans.
  2. Own the technical relationship for system‑level co‑validation with strategic customers.
  3. Drive execution of co‑validation cycles across single‑node, multi‑node, rack‑scale, and clusters systems.
  4. Ensure AMD and customer test coverage is complementary, efficient, and aligned to shared deployment goals.
  5. Lead cross‑domain debug spanning GPU, CPU, NIC, BMC/AMC, firmware, ROCm, OS, virtualization, orchestration, libraries, drivers, AI software stack and customer automation frameworks.

Skills

Required

  • Deep experience with Linux, OS/kernel/driver stacks, ROCm or CUDA‑like environments, and GPU/CPU/NIC integration.
  • Strong background in multi‑node / rack‑scale validation, orchestration, virtualization (KVM, VMware, Hyper‑V), and manageability (Redfish/BMC/AMC).
  • Skilled in system‑level debug across firmware, software, hardware, and automation layers.
  • Expertise in test plan architecture, debug methodologies, and multi-domain system validation.
  • Proficiency with Python or similar scripting for automation.
  • Bachelor’s or Master’s in Computer Engineering, Computer Science, Electronics, Electrical Engineering, or similar discipline.
  • 10+ years of relevant experience in system validation, customer engineering, platform bring‑up, or datacenter clusters/solution enablement roles.

Nice to have

  • Familiarity with AI concepts, frameworks, or applications.
  • Experience with customer co-validation, data center cluster deployment or AI performance tuning is a strong plus.
  • Comfortable leading technical discussions with senior customer engineers and decision-makers.
  • Exceptional communication skills, both technical and customer-facing.

What the JD emphasized

  • AI performance
  • system-level debug
  • customer co-validation
  • AI performance tuning