Senior Software Development Engineer

AMD AMD · Semiconductors · Singapore · Engineering

Develop and deploy diagnostic tests for AMD Data Center GPU products, covering silicon screening to server rack assembly. Responsibilities include designing and implementing tests, developing automation frameworks, debugging failures, optimizing coverage, and coordinating with cross-functional teams for root cause analysis and issue resolution. Requires strong Python and C++ skills, experience with hardware/silicon validation, and familiarity with server/datacenter systems.

What you'd actually do

  1. Design and implement diagnostic tests for AMD silicon and server platforms
  2. Develop test automation frameworks and infrastructure
  3. Debug test failures and hardware issues across production stages
  4. Optimize test coverage and execution time
  5. Lead root cause analysis and debug efforts for failures on production systems, often in time-sensitive and urgent scenarios

Skills

Required

  • Python
  • C++
  • Linux system administration
  • shell scripting
  • Git version control
  • code review practices
  • diagnostic tools
  • hardware debugging methodologies
  • ROCm/CUDA/OpenCL/Vulkan/OpenGL
  • BS in Computer Science, Computer Engineering, Electrical Engineering

Nice to have

  • hardware/silicon validation
  • manufacturing test environments
  • low-level hardware/software systems
  • server or datacenter systems architecture
  • silicon validation processes
  • test methodologies
  • manufacturing workflows
  • production test environments
  • server architectures
  • BMC
  • firmware
  • system integration
  • GPU/accelerator performance metrics
  • computational throughput
  • memory bandwidth
  • power efficiency
  • thermal characteristics
  • whole-system performance
  • AMD GPU or CPU technologies
  • SQL
  • Snowflake
  • ROCm
  • technical designs
  • test plans
  • procedures
  • cross-functional teams

What the JD emphasized

  • excellent written and verbal communication skills is an absolute