Senior Datacenter GPU Power Architect

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1

NVIDIA is seeking a Senior Datacenter GPU Power Architect to design energy-efficient GPU and SOC architectures for AI, HPC, Automotive, and other products. The role involves developing power estimation models and tools, exploring energy efficiency at GPU and datacenter levels, setting power targets, and deploying machine learning techniques for power and performance modeling. The architect will also perform pre-silicon thermal analysis and model cutting-edge technologies, collaborating with cross-functional teams to influence the product roadmap.

What you'd actually do

  1. You will be responsible for conceiving flows/methodologies/KPIs and building the power estimation models/tools for GPU products and systems like NVIDIA DGX/HGX based datacenters.
  2. Early GPU & System Architecture exploration with focus on energy efficiency and TCO improvements at GPU and Datacenter level.
  3. You will set power targets for future projects, track ASIC milestones and drive Performance vs Power Analysis to enable impactful product roadmap decisions.
  4. Deploy machine learning techniques to develop highly accurate power and performance models of our GPUs, CPUs, Switches, and platforms.
  5. Understand the workload characteristics for GenAI/HPC workloads at Datacenter Scale to drive new HW/SW features for Perf@Watt improvements.

Skills

Required

  • MSEE/MSCE, preferably PhD, or equivalent experience related to Power / Performance estimation and optimization techniques.
  • 5+ years of experience
  • Strong knowledge of energy efficient chip/system design fundamentals and related tradeoffs.
  • Familiarity with low power design techniques such as multi-VT, Clock gating, Power gating, and Dynamic Voltage-Frequency Scaling (DVFS).
  • Deeper understanding of processors (GPU is a plus), servers, and system-SW architectures, and their performance/power modeling techniques.
  • Familiarity with Python and Si data analysis
  • Familiarity with performance monitors/simulators used in modern processor architectures.
  • Strong interpersonal and organizational skills and the ability & desire to work as a great teammate.

Nice to have

  • GPU
  • PhD

What the JD emphasized

  • energy efficiency
  • Perf/Watt
  • power estimation models
  • performance models
  • GenAI/HPC workloads

Other signals

  • AI
  • GPU
  • Datacenter
  • Power Architecture
  • Performance Modeling