System Design & Debug Manager – AI Customer Engineering

AMD AMD · Semiconductors · Santa Clara, CA · Engineering

This role is a System Design & Debug Manager within AMD's AI Customer Engineering organization. The primary focus is on driving complex silicon, system, and fleet-level issues to resolution for AI-related customer programs. It involves leading debug execution, coordinating cross-functional efforts, managing field failures, and improving debug processes. While the role is within the AI Customer Engineering organization, the core responsibilities are in system design and debug of hardware and infrastructure, not in building or researching AI models themselves.

What you'd actually do

  1. Lead debug execution across hyperscale, OEM, HPC, and enterprise customer programs. Own high‑impact, cross‑customer and systemic issues and maintain visibility into top risks and trends.
  2. Partner with Customer Program Managers to align debug execution with customer deliverables, platform readiness, and deployment schedules. Support escalations and executive‑level customer engagements.
  3. Drive cross‑functional debug efforts across design, validation, product engineering, and failure analysis. Align pre‑ and post‑silicon debug strategies and connect lab debug to real‑world customer environments.
  4. Lead resolution of field failures, fleet anomalies, and data center reliability issues. Aggregate fleet, RMA, and production signals and feed learnings back into design, validation, and manufacturing.
  5. Own debug tracking, prioritization, risk management, and executive reporting. Apply structured methodologies (8D, CAPA, FMEA) and drive continuous improvement in execution speed and consistency.

Skills

Required

  • Deep understanding of data center system architecture (CPU, GPU, FPGA, memory, connectivity, RAS, hotplug)
  • Familiarity with hardware bring up, validation, manufacturing, and test flows
  • Knowledge of reliability and quality metrics (yield, DPM, FIT)
  • Proven years of experience in the semiconductor industry
  • Deep hands-on experience with silicon debug (pre‑silicon and post‑silicon)
  • Strong background in product development, debug tools, validation, failure analysis, or customer engineering
  • Proven experience managing complex debug programs across multiple customer segments
  • Strong functional team and project management skills with ability to drive execution across global, cross-functional teams
  • Excellent written and verbal communication skills, including executive-level engagement
  • Bachelor’s degree in Electrical Engineering, Computer Engineering, Computer Science, or related field

Nice to have

  • Master’s degree preferred

What the JD emphasized

  • deep technical expertise
  • strong cross-functional program leadership
  • deep expertise in pre- and post-silicon debug
  • proven track record of leading critical customer escalations
  • effectively influencing cross-functional teams without direct authority
  • excellent communicator
  • distill complex technical challenges into clear, concise, and decision-oriented messaging for executive leadership and customers
  • Deep hands-on experience with silicon debug (pre‑silicon and post‑silicon)
  • Proven experience managing complex debug programs