Senior GPU Memory System Architect

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Seeking a Senior GPU Memory System Architect to design and optimize GPU memory systems, memory management, virtualization, and security for leading-edge silicon processes. Responsibilities include developing architecture, creating performance models, analyzing workloads, and defining specifications for NVIDIA's AI, data center, and autonomous vehicle solutions.

What you'd actually do

  1. Developing architecture and micro-architecture to improve the state-of-the-art in GPU memory system, Memory management, Virtualization & security optimizing along the axes of performance, power efficiency, complexity, area, effort, and schedule.
  2. Participating in performance simulation of features to improve translation efficiency to access memory and io using protocol like PCIe.
  3. Implementing and maintaining high-level functional and performance models.
  4. Analyzing benchmarks, application workloads and performance simulation results to identify areas for microarchitecture optimizations.
  5. Defining and performing experiments to study the machine in action, presenting experiment results to the larger group and proposing mechanisms for improvement.

Skills

Required

  • Master degree or equivalent experience in Electrical Engineering, Computer Science, Computer Engineering or related field
  • 3+ years of meaningful work experience in GPU or CPU Architecture and development specifically in the area interconnects, QoS. Memory hierarchy, Memory model & ordering, multifunction HW accelerator, software, virtualization, and security.
  • Strong communication and interpersonal skills

Nice to have

  • PhD with a focus in computer architecture
  • successfully mentoring junior engineers and interns
  • hardware memory management unit
  • prefetching
  • accelerator IPs
  • memory subsystem hierarchy
  • multi-core systems
  • coherent interconnects
  • Industry IO protocol like PCIe/CXL
  • encryption
  • compression
  • confidential compute
  • virtualization & security

What the JD emphasized

  • optimizing performance, area, complexity, and power