Senior Cpu Performance Architect

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2

NVIDIA is seeking a Senior CPU Performance Architect to drive the development of CPU technology for architectures used in AI/DL, HPC, CSP, gaming, VR, and autonomous vehicles. The role involves workload bring-up, performance analysis, identifying bottlenecks, and collaborating with architects to improve future designs. The candidate will also benchmark NVIDIA's CPUs against competitors and suggest improvements.

What you'd actually do

  1. Work on workload bring-up and performance analysis/projection, both on silicon and full-system simulator.
  2. Study workloads for a wide range of markets, including AI/DL, CSP, HPC, and autonomous vehicles.
  3. Study real world use-cases and identify critical application behavior and reduce to directed test cases.
  4. Analyze and debug performance scaling bottlenecks on multi-core and multi-socket CPU and CPU/GPU systems.
  5. Work with CPU and interconnect architects to improve future CPU and system designs based on your findings.

Skills

Required

  • BS/MS in Electrical Engineering, Computer Science, Computer Engineering, or equivalent experience
  • CPU workloads and performance analysis
  • Performance test development and benchmarking for CPU and I/O
  • CPU microarchitecture and system architecture

Nice to have

  • ARM instruction set architecture (ISA)
  • PhD or Research experience
  • GPU driver experience
  • Knowledge of GPU-accelerated workloads and modeling performance of accelerated workloads
  • Experience with performance optimization of AI frameworks such as PyTorch

What the JD emphasized

  • 7+ years of relevant experience
  • Experience with CPU workloads and performance analysis
  • Knowledge of performance test development and benchmarking for CPU and I/O
  • Deep knowledge of CPU microarchitecture and system architecture