Software Development Engineer - GPU Debugger

AMD AMD · Semiconductors · Austin, TX · Engineering

Software Development Engineer to join the AI Developer Tools team, designing and developing advanced GPU Debugging tools for HPC, ML, and AI workloads within the ROCm ecosystem. The role involves collaborating with various teams, implementing new features, optimizing tools, and ensuring profiling support for new ASICs.

What you'd actually do

  1. Design, develop, and maintain ROCm Debugger components for GPU debugging
  2. Collaborate with architecture, driver, and runtime teams to enable profiling for next-generation AMD GPUs involved in Pre Silicon and Post Silicon activities.
  3. Implement new features and APIs to enhance debugging capabilities for AI and HPC workloads
  4. Optimize profiling tools for accuracy, scalability, and minimal overhead
  5. Debug and resolve issues in profiling workflows and improve tool reliability

Skills

Required

  • C++
  • computer architecture
  • multithreading
  • concurrency
  • Linux development environments
  • Git-based workflows

Nice to have

  • GDB
  • LLDB
  • HIP
  • OpenCL
  • CUDA
  • ROCm ecosystem
  • Windows experience