Lead System Debug Engineer - Server Validation

AMD AMD · Semiconductors · Austin, TX · Engineering

Lead System Debug Engineer for AMD EPYC Server & Instinct products, focusing on post-silicon validation, system and silicon debug, and driving improvements to debug methodology. This role involves leading complex debug efforts, managing technical issues, and communicating program status to stakeholders.

What you'd actually do

  1. Debug issues found during Silicon bring-up, Post-Silicon Validation, and production phases of SOC programs.
  2. Lead complex debug efforts for internal Silicon findings to identify root cause and resolution.
  3. Ensure issues are solved on time with quality.
  4. Manage and track technical issues, risks and priorities to un-gate program milestones.
  5. Publish debug program indicators to identify major roadblocks and drive changes to improve debug throughput.

Skills

Required

  • Extensive experience in validation roles involving debugging OS, FW, Silicon, and HW issues.
  • Understanding of PC industry standard busses and their software stack, such as PCIe, CXL.
  • Strong knowledge of X86 architecture, SoC design, memory, RAS & power management
  • Extensive knowledge of system architecture, technical debug, and validation strategy
  • Good understanding and experience in platform/ system level debug, Operating System, Device Drivers and System BIOS interactions.
  • Experience in technical program management.
  • A thorough understanding of datacenter industry technologies and their software stack.

Nice to have

  • Strong analytical/problem-solving skills and pronounced attention to details
  • Excellent communication and coordination skills.
  • Detailed oriented, highly organized, able to prioritize, and juggle multiple work streams to tight deadlines.
  • Must be a self-starter, and able to independently drive tasks to completion

What the JD emphasized

  • technical leadership skills
  • validation and debug expertise