Sr. Software Development Engineer

AMD AMD · Semiconductors · San Jose, CA · Engineering

Sr. Software Development Engineer at AMD to research, design, develop, and/or test operating systems-level software, compilers, and/or network distribution software for semiconductor operations. The role involves developing software and tools for design, infrastructure, and technology platforms, with a focus on AI and next-generation computing experiences. Requires experience with ML model serving frameworks, multi-GPU inference, and GPU architecture.

What you'd actually do

  1. Research, design, develop, and/or test operating systems-level software, compilers, and/or network distribution software for semiconductor operations, applying principles and techniques of computer science, engineering, and mathematical analysis.
  2. Design, develop, troubleshoot and debug software programs for enhancements and new products.
  3. Develop software and tools in support of design, infrastructure and technology platforms, including operating systems, compilers, routers, networks, utilities, databases, cloud-based and Internet related tools.
  4. Determine hardware compatibility and/or influence hardware design.
  5. Work in an area of specialization to develop systems-level software, working on problems of complex scope where analysis of situations or data requires a review of a variety of factors.

Skills

Required

  • Model parallelism techniques
  • Existing ML model serving frameworks, including vLLM and TensorRT
  • Multi-GPU inference
  • Collectives and communication mechanisms
  • SHARK, IREE, MLIR, and PyTorch
  • Open-source development ethos
  • Python
  • Bash
  • CUDA
  • GPU architecture and programming
  • macOS software development environments
  • Server systems
  • Object Oriented design
  • Software debugging (macOS or Linux/Unix)
  • Git
  • GNU toolchain
  • X86-64 architecture
  • Big Data Analysis
  • Kubernetes
  • Natural Language Processing

What the JD emphasized

  • Model parallelism techniques
  • Existing ML model serving frameworks, including vLLM and TensorRT
  • Multi-GPU inference
  • Collectives and communication mechanisms
  • SHARK, IREE, MLIR, and PyTorch
  • Open-source development ethos
  • Python, Bash, and CUDA
  • GPU architecture and programming
  • macOS software development environments
  • Server systems
  • Object Oriented design
  • Software debugging (macOS or Linux/Unix)
  • Git
  • GNU toolchain
  • X86-64 architecture
  • Big Data Analysis
  • Kubernetes
  • Natural Language Processing

Other signals

  • ML model serving frameworks
  • Multi-GPU inference
  • GPU architecture and programming