Senior Solutions Architect, Csp System

NVIDIA NVIDIA · Semiconductors · Beijing, China +1

Senior Solutions Architect for NVIDIA's CSP team in China, focusing on CPU technologies to optimize AI data center performance and drive the adoption of NVIDIA's integrated CPU-GPU-DPU platforms.

What you'd actually do

  1. Work with Sales, BD and CPM team to introduce NVIDIA technologies into assigned accounts and grow business accordingly.
  2. Serve as the primary technical authority on CPU technologies for NVIDIA’s Chinese CSP customers, providing expert consultation on CPU selection, architecture design, and integration with NVIDIA’s AI infrastructure (including Grace/Vera CPUs and NVL72 platforms).
  3. Lead CPU-focused technical engagements with CSPs, collaborating with their R&D, infrastructure, and AI teams to understand workload requirements (e.g., AI data preprocessing, HPC, distributed computing) and design optimized CPU-GPU integrated solutions.
  4. Drive CPU performance optimization for CSP workloads, conducting in-depth analysis of bottlenecks, implementing tuning strategies (including SIMD instruction set optimization and low-level intrinsics), and delivering reference implementations to unlock full platform potential.
  5. Act as a liaison between CSP customers and NVIDIA’s global engineering, product, and R&D teams, advocating for customer-specific CPU requirements, providing feedback on product roadmaps, and ensuring alignment with NVIDIA’s technical strategy and export compliance guidelines.

Skills

Required

  • CPU architecture
  • performance optimization
  • data center infrastructure
  • high-performance computing (HPC)
  • AI workloads
  • CPU microarchitecture (e.g., x86, ARM)
  • performance analysis tools
  • CPU benchmarking
  • bottleneck-driven tuning
  • C/C++
  • Python
  • low-level software optimization
  • compiler toolchains
  • performance libraries
  • working with major Chinese CSPs or global hyperscalers
  • technical communication
  • presentation skills
  • cross-functional collaboration

Nice to have

  • NVIDIA Grace/Vera CPUs
  • ARM-based high-performance CPUs
  • integrated CPU-GPU-DPU platforms
  • CPU in Agentic AI
  • CPU in Post-Training
  • AI/ML workload optimization
  • data preprocessing
  • distributed training
  • inference pipelines on CPU platforms
  • open-source performance tools
  • HPC frameworks
  • CPU optimization libraries
  • leading technical programs
  • cross-functional initiatives for CSP customers
  • PoC delivery
  • large-scale deployment support
  • NVIDIA data center products (GPUs, DPUs, CPUs)
  • NVIDIA software stacks
  • AI factory concepts
  • large-scale data center deployment

What the JD emphasized

  • Hands-on ability is mandatory.