Senior Solutions Architect, Datacenter Cpus

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

NVIDIA is seeking a Senior Solutions Architect experienced in Arm-based server CPUs to join their team in Santa Clara, CA. The role involves acting as a technical liaison for NVIDIA’s CPU portfolio with cloud partners, encouraging adoption and improving next-generation cloud instances and services. Responsibilities include architecting and validating multi-tenant cloud infrastructure, guiding customers through workload migration and optimization, performing performance analysis, and creating technical content for evangelism. The ideal candidate will have a Bachelor’s degree, 8+ years of experience in a technical customer-facing role, a solid understanding of server CPU architecture, and hands-on experience with Arm-based processors and the Arm software ecosystem.

What you'd actually do

  1. Workload Migration & Ecosystem Readiness: We will guide customers through the porting and optimization of x86 workloads to ARM while collaborating with ISVs and open-source communities to guarantee application readiness across our entire software stack.
  2. Performance Tuning & Issue Resolution: We will implement rigorous performance analysis and benchmarking of key cloud workloads, partnering closely with our internal engineering teams to solve complex scalability and reliability issues across the CPU, memory, and networking levels.
  3. Technical Evangelism & Content Creation: We need you to prepare highly sought-after technical content and present at customer build reviews, executive briefings, and industry events to help us clearly articulate the technical and business value of our NVIDIA CPUs.

Skills

Required

  • Bachelor’s degree in Computer Engineering, Electrical Engineering, Computer Science, or equivalent experience in the field.
  • 8+ years in solution architecture, systems engineering, performance engineering, or similar technical customer-facing role.
  • Solid understanding of server CPU architecture and microarchitecture.
  • Hands-on experience with Arm-based server processors and the Arm software ecosystem.
  • Experience migrating applications from x86 or other ARM platforms, resolving ISA compatibility issues, recompiling dependencies, and applying targeted optimizations to improve performance on the target device.
  • Skill in constructing credible head-to-head comparisons against competing silicon by controlling for process node, memory configuration, and compiler flags to produce publishable performance claims.
  • Measuring real-world efficiency and thermal performance to prove superior TCO advantages over competitors.
  • Deep familiarity with Linux on Arm, including kernel parameters, packaging, and system bring-up for data center platforms.
  • Strong proficiency in C/C++ and at least one scripting language for tooling, automation, and performance experiments.

Nice to have

  • Hands-on experience configuring bootloaders, kernel drivers, and OS distributions (Linux, RTOS) to rapidly stand up a working software environment on production ARM hardware.
  • Practical experience operating production workloads on major public cloud providers, supported by a solid grasp of modern data center architectures and the interaction between CPUs, GPUs, and networking.
  • Hand-on experience with benchmark suites (SPEC CPU, MLPerf, custom traces) to characterize customer workloads and produce compelling performance comparisons.
  • Superb communication and presentation skills, with the ability to engage both deep technical and executive audiences.

What the JD emphasized

  • Solid understanding of server CPU architecture and microarchitecture
  • Hands-on experience with Arm-based server processors and the Arm software ecosystem
  • Experience migrating applications from x86 or other ARM platforms, resolving ISA compatibility issues, recompiling dependencies, and applying targeted optimizations to improve performance on the target device
  • Skill in constructing credible head-to-head comparisons against competing silicon by controlling for process node, memory configuration, and compiler flags to produce publishable performance claims
  • Measuring real-world efficiency and thermal performance to prove superior TCO advantages over competitors
  • Deep familiarity with Linux on Arm, including kernel parameters, packaging, and system bring-up for data center platforms
  • Hands-on experience with benchmark suites (SPEC CPU, MLPerf, custom traces) to characterize customer workloads and produce compelling performance comparisons