Platform Architecture Engineer, Geforce Now

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

This role focuses on architecting and optimizing cloud infrastructure for AI workloads, specifically for the GeForce NOW service. The engineer will perform deep performance and power analysis of GPU/CPU microarchitecture for AI inference, deploy and optimize AI/gaming kernels, and build models to guide platform decisions balancing performance, power, and cost. The role requires strong programming skills and experience with AI models and performance analysis methodologies.

What you'd actually do

  1. Architect next-generation cloud infrastructure optimized for AI workloads alongside gaming
  2. Perform deep performance and power analysis of GPU/CPU microarchitecture features for AI inference and gaming workloads
  3. Deploy, optimize, and benchmark AI/gaming kernels in the cloud across various system configurations.
  4. Build models and tools to guide platform decisions balancing performance, power, and cost
  5. Present findings to cross-functional teams including product, engineering, and executives

Skills

Required

  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI)
  • 8+ years of experience in CPU or GPU performance, working on microarchitecture bottleneck analysis
  • Demonstrated expertise in conducting hardware power and performance analysis with an understanding of microarchitecture design trade-offs
  • Experience characterizing and optimizing AI, gaming and/or cloud workloads, including software and compiler-level optimizations
  • Strong programming skills in C, C++, Python, and scripting languages, with hands-on experience configuring, deploying, and running AI models
  • Good understanding of performance analysis methodologies, including code instrumentation, sampling, and roofline analysis
  • Demonstrated problem-solving skills
  • Ability to analyze complex data, draw insightful conclusions, and form hypotheses
  • Strong presentation skills

What the JD emphasized

  • 8+ years of experience in CPU or GPU performance, working on microarchitecture bottleneck analysis
  • Demonstrated expertise in conducting hardware power and performance analysis with an understanding of microarchitecture design trade-offs
  • Experience characterizing and optimizing AI, gaming and/or cloud workloads, including software and compiler-level optimizations
  • Strong programming skills in C, C++, Python, and scripting languages, with hands-on experience configuring, deploying, and running AI models

Other signals

  • architecting cloud infrastructure for AI workloads
  • performance and power analysis of GPU/CPU microarchitecture for AI inference
  • deploying, optimizing, and benchmarking AI/gaming kernels
  • building models and tools to guide platform decisions