AI Factory Cpu Focused Solutions Archit… at NVIDIA

What you'd actually do

Our day-to-day work involves helping our partners be successful in their adoption of end-to-end AI solutions using NVIDIA's compute, networking, and software stacks.

For this particular role, that means having a deep technical understanding of NVIDIA Reference Architectures, and using that understanding to enable customers adopting our CPU-based solutions as part of the overall NVIDIA AI Factory.

This is a multi-faceted role necessitating being comfortable working on not just hardware and software elements, but also the larger AI workflow and operationalization of large scale compute resources.

We succeed when we help our customers overcome barriers to adopting our best known methods.

As the technical leader for the CPU components within the NVIDIA AI Factory, you will play an instrumental role in driving that success.

Skills

Required

Experience with defining, deploying, and testing large scale reference architectures for High Performance Computing and AI.
A track record of defining and using MLOps and AI workflow tools and processes.
6 or more years of hands-on expertise with modern data center architectures and interaction between CPUs, GPUs, and networking.
Strong foundational expertise and a BS, MS, or equivalent experience in Engineering.
Strong analytical and problem-solving skills
Ability to articulate what you know to others.
Ability to multitask efficiently in a multifaceted environment.
Experienced with organizing, presenting, and discussing technical materials with groups that can be comprised of a range of technical capability.
Flexibility to adapt in fluid situations, especially with partners or customers.
Comfortable with occasional travel to customer sites.

Nice to have

Hands-on experience with Arm-based server processors and the Arm software ecosystem.
Proficiency with tooling, automation, and performance testing for large-scale clusters, preferably using AI tools.
Deep understanding of Agentic AI and inference workflows.
Experience building, using, and explaining reinforcement learning.
Willingness and ability to learn quickly as we address sophisticated problems, and an understanding of how all elements of the AI Factory interact with each other.

What the JD emphasized

large-scale HPC and AI infrastructure

deploy and operationalize AI solutions at scale

NVIDIA Reference Architectures

CPU-based solutions

larger AI workflow

operationalization of large scale compute resources

defining, deploying, and testing large scale reference architectures for High Performance Computing and AI

defining and using MLOps and AI workflow tools and processes

modern data center architectures

interaction between CPUs, GPUs, and networking

Agentic AI and inference workflows

Other signals

designing, building, and maintaining large-scale HPC and AI infrastructure

deploy and operationalize AI solutions at scale

technical leader for the CPU components within the NVIDIA AI Factory

defining, deploying, and testing large scale reference architectures for High Performance Computing and AI

defining and using MLOps and AI workflow tools and processes

Do you want to be part of a team that's revolutionizing the field of AI with data center scale solutions? We are looking for a hardworking Solution Architect with experience in designing, building, and maintaining large-scale HPC and AI infrastructure to join our team at NVIDIA.

As Solution Architects, we are actively helping make AI Factories a reality. Working closely with customers and partners to address unsolved problems in the industry, our team helps to deploy and operationalize AI solutions at scale.

What you'll be doing:

Our day-to-day work involves helping our partners be successful in their adoption of end-to-end AI solutions using NVIDIA's compute, networking, and software stacks. For this particular role, that means having a deep technical understanding of NVIDIA Reference Architectures, and using that understanding to enable customers adopting our CPU-based solutions as part of the overall NVIDIA AI Factory. This is a multi-faceted role necessitating being comfortable working on not just hardware and software elements, but also the larger AI workflow and operationalization of large scale compute resources. We succeed when we help our customers overcome barriers to adopting our best known methods. As the technical leader for the CPU components within the NVIDIA AI Factory, you will play an instrumental role in driving that success.

If this sounds exciting to you, then NVIDIA might be your new home. As a team, we also excel at sharing knowledge with our colleagues, whether it's delivering demos, assisting with proof-of-concepts, or writing papers and developer blogs. By collaborating with executives and engineering, we tackle sophisticated problems and help bring NVIDIA's premiere technologies to life. Our mission is to solve the problems that nobody else has solved yet, and we need someone to be an instrumental part of that!

What we need to see:

Experience with defining, deploying, and testing large scale reference architectures for High Performance Computing and AI.
A track record of defining and using MLOps and AI workflow tools and processes.
6 or more years of hands-on expertise with modern data center architectures and interaction between CPUs, GPUs, and networking.
Strong foundational expertise and a BS, MS, or equivalent experience in Engineering.
Strong analytical and problem-solving skills, along with an ability to articulate what you know to others.
Ability to multitask efficiently in a multifaceted environment.
Experienced with organizing, presenting, and discussing technical materials with groups that can be comprised of a range of technical capability.
Flexibility to adapt in fluid situations, especially with partners or customers.
Comfortable with occasional travel to customer sites.

Ways to stand out from the crowd:

Hands-on experience with Arm-based server processors and the Arm software ecosystem.
Proficiency with tooling, automation, and performance testing for large-scale clusters, preferably using AI tools.
Deep understanding of Agentic AI and inference workflows.
Experience building, using, and explaining reinforcement learning.
Willingness and ability to learn quickly as we address sophisticated problems, and an understanding of how all elements of the AI Factory interact with each other.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until April 25, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.