Senior Network Solution Architect – AI … at NVIDIA

What you'd actually do

Partner with AI-native / consumer internet customers on large data center GPU and networking deployments. Guide architecture decisions across network, compute, and storage, including fabric design. Support on-site bring-up of server, network, and cluster infrastructure in customer data centers.

Demonstrate expertise on advanced GPU and network systems (Spectrum-X, BlueField DPU, InfiniBand/RoCE, etc.) for key accounts. Run regular technical account reviews covering roadmap alignment, cluster issues, feature discussions, and new technology introductions. Capture customer-specific requirements and translate them into concrete feedback for product, architecture, and engineering teams.

Analyze and debug configuration and performance issues in RoCE and InfiniBand environments. Work across NICs, switches, Linux, and system software to deliver performant, reliable AI clusters.

Identify and shape new project opportunities for NVIDIA GPUs, networking, and software in AI and data center use cases. Collaborate closely with Systems Engineering, Product Management, and Sales to align solutions with customer outcomes. Build targeted POCs that showcase the value of NVIDIA’s networking stack (e.g., Spectrum-X fabrics, BlueField DPUs) in real customer environments.

Skills

Required

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, or other Engineering fields or equivalent experience
6+ years of hands-on network engineering experience in data center or cloud environments
Proven, expert-level troubleshooting of data center networks (packet-level, control plane, and fabric behavior)
Deep protocol knowledge of BGP, OSPF, and L2/L3 switching in large-scale data center or cloud networks (ECMP, Clos/leaf–spine)
Experience with high-density switching at cloud or hyperscale is strongly preferred
Experience with InfiniBand or RoCE is a major plus
Solid understanding of CPU/GPU server architecture, NICs, Linux, system software, and kernel drivers
Strong time management and ability to context-switch across multiple customers and projects
Excellent written and verbal communication, including clear design docs, customer presentations, and root-cause summaries

Nice to have

Advanced certifications: CCIE, JNCIE, or equivalent expert-level certifications
Automation & tooling: Experience in Python, Bash, or C/C++ for automating network workflows, validation, and debug
NVIDIA platform experience: Hands-on work with NVIDIA GPUs, NICs, DPUs, or ARM-based CPU platforms
Customer-facing background: Pre-sales, post-sales, field engineering, or consulting experience with external enterprise or cloud customers
Large-scale deployments: Direct experience bringing up and operating large clusters or supercomputing environments
Virtualization / cloud: Familiarity with virtualization, containers, and cloud networking concepts

NVIDIA is looking for an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. Do you want to be part of a team that brings new AI hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product roadmap.

What you will be doing:

Partner with AI-native / consumer internet customers on large data center GPU and networking deployments. Guide architecture decisions across network, compute, and storage, including fabric design. Support on-site bring-up of server, network, and cluster infrastructure in customer data centers.
Demonstrate expertise on advanced GPU and network systems (Spectrum-X, BlueField DPU, InfiniBand/RoCE, etc.) for key accounts. Run regular technical account reviews covering roadmap alignment, cluster issues, feature discussions, and new technology introductions. Capture customer-specific requirements and translate them into concrete feedback for product, architecture, and engineering teams.
Analyze and debug configuration and performance issues in RoCE and InfiniBand environments. Work across NICs, switches, Linux, and system software to deliver performant, reliable AI clusters.
Identify and shape new project opportunities for NVIDIA GPUs, networking, and software in AI and data center use cases. Collaborate closely with Systems Engineering, Product Management, and Sales to align solutions with customer outcomes. Build targeted POCs that showcase the value of NVIDIA’s networking stack (e.g., Spectrum-X fabrics, BlueField DPUs) in real customer environments.

What we need to see:

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, or other Engineering fields or equivalent experience.
6+ years of hands-on network engineering experience in data center or cloud environments.
Proven, expert-level troubleshooting of data center networks (packet-level, control plane, and fabric behavior).
Deep protocol knowledge of BGP, OSPF, and L2/L3 switching in large-scale data center or cloud networks (ECMP, Clos/leaf–spine). Experience with high-density switching at cloud or hyperscale is strongly preferred. Experience with InfiniBand or RoCE is a major plus.
Solid understanding of CPU/GPU server architecture, NICs, Linux, system software, and kernel drivers
Strong time management and ability to context-switch across multiple customers and projects.
Excellent written and verbal communication, including clear design docs, customer presentations, and root-cause summaries.

Ways to stand out from the crowd:

Advanced certifications: CCIE, JNCIE, or equivalent expert-level certifications.
Automation & tooling: Experience in Python, Bash, or C/C++ for automating network workflows, validation, and debug.
NVIDIA platform experience: Hands-on work with NVIDIA GPUs, NICs, DPUs, or ARM-based CPU platforms.
Customer-facing background: Pre-sales, post-sales, field engineering, or consulting experience with external enterprise or cloud customers.
Large-scale deployments: Direct experience bringing up and operating large clusters or supercomputing environments.
Virtualization / cloud: Familiarity with virtualization, containers, and cloud networking concepts.

We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 2, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.