Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| SoC Product Architect, Telecom AI RAN NVIDIA is seeking a Lead SoC Product Architect for their Telecom AI RAN platform, focusing on defining the architecture and roadmap for radio and distributed unit products. The role involves analyzing workloads, driving competitive analysis, synthesizing customer requirements, and collaborating with engineering teams to ensure efficient implementation of AI-native RAN applications. The ideal candidate will have extensive experience in wireless RAN/baseband architecture or SoC product definition, with a strong understanding of 3GPP RAN standards and L1/PHY algorithms. | Serve | 7 |
| Senior System Software Engineer - Neural Graphics Performance Senior System Software Engineer focused on optimizing neural graphics performance, specifically Gaussian Splatting and neural reconstruction algorithms, for applications in robotics, healthcare, and AV development. The role involves implementing and optimizing reconstruction/rendering algorithms using CUDA and Slang, optimizing data processing pipelines, and influencing software architecture for performance. |
| ServeData |
| 7 |
| Developer Relations Manager, Federal Government Developer Relations Manager for NVIDIA's Federal Government sector, focusing on autonomous drone systems. The role involves technical engagement with ISVs, primes, and venture-backed companies to promote NVIDIA's accelerated computing stack for edge inference, multi-agent coordination, and synthetic training environments. Requires deep technical credibility in robotics, onboard autonomy, or flight software, with a proven record of influencing partner roadmaps and delivering on DoD timelines. | ServeData | 7 |
| Senior System Software Engineer - Dynamo-Triton Inference Server Senior System Software Engineer to work on Dynamo-Triton Inference Server, a GPU-accelerated AI inference serving platform. The role involves developing high-performance inference software, contributing to feature development, driving customer adoption, and optimizing throughput and latency for both LLM and non-LLM workloads. | Serve | 7 |
| Senior Developer Technology Engineer - Windows AI Platform Senior Developer Technology Engineer focused on optimizing AI GPU deployment on the NVIDIA RTX platform for enterprise and consumer AI applications. This role involves profiling, debugging, training, and enhancing open-source LLM and GenAI software on Windows, collaborating with internal teams and external partners to improve performance and user experience. | Serve | 7 |
| Senior Solutions Architect, NVIDIA Cloud Partners - Mexico This role involves acting as a technical advisor and driver for customers and partners in designing, implementing, and deploying large-scale AI/HPC GPU infrastructure and applications. It focuses on integrating libraries, frameworks, and models, and delivering GenAI, AI, and ML solutions to production, with practical expertise in fine-tuning and deploying models. | ServePost-train | 7 |
| Senior Software Performance Engineer - AV Platform Senior Software Performance Engineer for Autonomous Vehicles platform, focusing on optimizing latency and throughput of L2/L3/L4 autonomous driving solutions on NVIDIA's heterogeneous hardware architectures. Requires strong C++ skills, parallel programming, performance analysis, and experience with GPGPU/CUDA. | ServeAgent | 7 |
| Software Engineering Intern, JAX - Fall 2026 Internship role focused on developing performance optimizations for deep learning frameworks using JAX, contributing to core components and tools for the NVIDIA AI platform. | Serve | 7 |
| Senior AI and ML HPC Cluster Engineer This role focuses on designing, implementing, and managing large-scale GPU compute clusters for AI/ML and HPC workloads. It involves infrastructure engineering, automation, and supporting researchers with performance analysis and optimization. The role requires expertise in cluster management, Linux administration, container technologies, scripting, and MPI workflows. | Serve | 7 |
| Developer Technology Engineer – AI NVIDIA Developer Technology Engineer focused on optimizing deep learning and machine learning workloads on NVIDIA's accelerated computing platform (GPU, CPU, DPU) for key customers. Requires strong C/C++ and CUDA experience, with an MS/PhD in CS or related field. | Serve | 7 |
| Manager, Solutions Architecture - Continuous Bringup and Optimization Manager of Solutions Architecture focused on leading a team to consult, optimize, and improve the resiliency of customer AI factory infrastructures, including GPU-accelerated systems and AI workloads. The role involves hands-on infrastructure analysis, tuning, and establishing optimization/monitoring methodologies for large-scale AI/HPC systems. | Serve | 7 |
| Manager, Software Architecture Manager for a systems and networking engineering team focused on building distributed AI communication systems (libraries, frameworks, system integrations) for GPUs, nodes, and storage. The role involves setting technical direction, leading execution, and fostering technical excellence within the team, with a focus on AI infrastructure problems. | Serve | 7 |
| Senior Performance Engineer Senior Performance Engineer at NVIDIA focusing on optimizing AI and HPC workloads on GPU/CPU clusters. Responsibilities include profiling, benchmarking, identifying bottlenecks, and developing performance analysis tools, with a strong emphasis on high-performance networking and telemetry. | Serve | 7 |
| AI Factory CPU focused Solutions Architect This role focuses on designing, building, and maintaining large-scale HPC and AI infrastructure, specifically CPU-based solutions within the NVIDIA AI Factory. The Solutions Architect will enable customers in adopting end-to-end AI solutions, operationalizing large compute resources, and overcoming adoption barriers. The role involves deep technical understanding of NVIDIA's stacks and AI workflows. | ServeAgent | 7 |
| Senior Solutions Architect, AI Factory Infrastructure This role focuses on designing and implementing AI infrastructure solutions for customers, with a strong emphasis on AI inference at scale and physical AI simulations. It involves full-stack design, including hardware, workload orchestration, and application performance tuning, for hybrid cloud and on-prem deployments. | ServeAgent | 7 |
| Senior Software Engineer - Verification AI Infrastructure Senior Software Engineer focused on building and optimizing scalable software automation systems with AI/ML integration for NVIDIA's Data Center environments. The role involves developing automation and validation tools, improving system performance, and troubleshooting complex issues in distributed systems. | Serve | 7 |
| Senior Solutions Architect, Cloud Infrastructure and DevOps NVIDIA is seeking a Senior Cloud Infrastructure and DevOps Solutions Architect to advise on and guide the implementation of large-scale computational and AI infrastructure, focusing on Kubernetes-based platforms and automation for AI/HPC systems. | Serve | 7 |
| Senior Software Architect, AI Systems and Networking This role focuses on building and optimizing systems-level software for high-performance communication and memory management libraries essential for distributed AI workloads. It involves hardware-software co-optimization, profiling data movement, and integrating networking capabilities into AI serving stacks, bridging applied research and production engineering. | Serve | 7 |
| Senior Solutions Architect, AdTech and Media NVIDIA Solutions Architect focused on AdTech and Media, helping customers adopt NVIDIA's full-stack accelerated computing platform. This role involves technical advisory, proof-of-concept evaluations, deep analysis and optimization of AI/ML models and recommender systems, and translating customer feedback into product insights. The role requires strong Python/C++ coding, understanding of AdTech/MarTech, ML/DL frameworks, and deploying models at scale on cloud or on-premise environments. | ServeAgent | 7 |
| Deep Learning Kernel Software Performance Architect - New College Grad 2026 NVIDIA is seeking a Deep Learning Kernel Software Performance Architect to develop and analyze processor and system architectures that accelerate machine learning and data analytics applications. The role involves debugging deep learning software, developing analysis tools, and collaborating with various NVIDIA teams to optimize performance. | Serve | 7 |
| Solutions Architect, Inference Deployments NVIDIA is seeking a Solutions Architect to deploy and enhance AI inference solutions at scale using GPU technology and Kubernetes. The role involves building inference pipelines, orchestrating disaggregated inference, accelerating inference with various backends, and providing technical leadership to customers for enterprise AI deployments. | Serve | 7 |
| Senior Computer Vision and Deep Learning Hardware Architect NVIDIA is seeking an Autonomous Vehicle Performance Architecture Engineer to design, model, and verify state-of-the-art programmable vision accelerators (PVA) for automotive and robotics. The role involves optimizing software for autonomous driving solutions, analyzing and prototyping applications, building performance models for future architectures, and collaborating with teams to enhance PVA architecture. Requires a Masters/PhD, 3+ years of relevant experience, strong C/C++ and computer architecture skills, and performance modeling/optimization expertise. Experience in DSP programming, autonomous vehicle software, deep learning, computer vision, and self-driving cars is a plus. | ServePost-train | 7 |
| Senior Solutions Architect, NVIDIA Cloud Partners NVIDIA is seeking a Senior Solutions Architect to advise partners on deploying large-scale AI/HPC GPU infrastructure, integrating libraries, frameworks, and models, and delivering GenAI/AI/ML solutions to production. The role involves end-to-end technology solution integration and providing product strategy recommendations. | ServePost-train | 7 |
| Senior Solutions Architect, Financial Services Banking Senior Solutions Architect for Financial Services Banking at NVIDIA, focusing on accelerating High-Performance Computing and AI workloads. The role involves partnering with engineering, product, and sales teams, performing proof-of-concepts, optimizing ML/DL models on GPU architectures, and building collateral for finance industry use cases. Requires deep experience in ML/DL algorithms, frameworks, and deploying models at scale. | ServeAgent | 7 |
| Solutions Architect, AI Cloud Partner Performance NVIDIA Solutions Architect focused on enabling cloud partners to achieve elite performance and reliability for AI workloads, particularly LLM training and inference, by adopting reference architectures and optimizing GPU clusters. | Serve | 7 |
| Senior Software Engineer, NCCL Senior Software Engineer role focused on designing, implementing, and maintaining highly-optimized communication runtimes for Deep Learning frameworks and HPC programming interfaces on GPU clusters. This involves system software development, parallel programming interface contributions, and proof-of-concept creation for new designs and hardware features. | Serve | 7 |
| Manager, Solutions Architecture - Data Center Specialists Manager for a team of infrastructure experts focused on delivering NVIDIA-powered AI Factories, advising partners on large-scale AI/HPC projects, and understanding AI workloads in relation to data center infrastructure. | Serve | 7 |
| Senior Solutions Architect, Financial Services Capital Markets NVIDIA is seeking a Senior Solutions Architect for Financial Services Capital Markets to work with clients on High-Performance Computing and AI workloads. The role involves performing proof-of-concepts, optimizing ML/DL models on GPU architectures, and building collateral for finance industry use cases. Requires strong Python/C++ coding, experience with ML frameworks, and deploying models at scale. | ServePost-train | 7 |
| Senior Software Solutions Architect - NVIS Senior Software Solutions Architect at NVIDIA, focusing on helping customers deploy and integrate NVIDIA's AI and machine learning software stacks (like NVIDIA AI, Run:ai, Mission Control) into their existing MLOps environments. This involves collaborating with infrastructure admins, data scientists, and ML engineers, developing integration scripts, diagnosing performance issues, and improving deep learning model performance within cloud-native environments. | Serve | 7 |
| Senior Manager, Site Reliability Engineering Senior Manager of Site Reliability Engineering to lead and reshape IT operations at scale, building AI-powered systems for reliability, speed, and employee experience. Focuses on transforming Incident, Problem, and Change Management using observability, AI insights, and orchestration to move towards predictive and autonomous operations. | Serve | 7 |
| Senior Solutions Architect, AI Compute – NPN Senior Solutions Architect for AI Compute at NVIDIA, focusing on deploying, managing, and validating AI Compute/HPC infrastructure for enterprise customers and partners. Requires strong Linux system administration, scripting, and cluster management skills, with experience in benchmarking tools and Kubernetes. | Serve | 7 |
| Senior AI Compute Engineer - NVIS This role focuses on deploying, managing, and validating AI Compute/HPC infrastructure in Linux environments for NVIDIA's customers. It involves system design, networking, automation, and customer interaction to support large-scale AI projects. The role requires strong Linux system administration, scripting, and experience with cluster management and benchmarking tools like MLPerf. | Serve | 7 |
| Solutions Architect - AI Networking and Storage Solutions Architect role focused on helping OEM customers build enterprise AI solutions using NVIDIA's AI technology, specifically focusing on the networking and storage demands for Generative AI, LLMs, and Deep Learning. The role involves architecting storage solutions, training partners, and acting as a technical authority on NVIDIA products, with an emphasis on high-performance storage systems and large-scale cluster bring-up. | ServeAgent | 7 |
| Senior Solutions Architect, GPU Performance and LLM - Cloud Service Providers Senior Solutions Architect at NVIDIA focused on helping large customers build and optimize AI/ML and HPC software solutions, particularly involving LLM training and inference on NVIDIA's hardware and software stack. The role involves deep technical engagement with customers, performance analysis, and solution development. | ServePretrain | 7 |
| Manager, AI Networking Performance Research and Analysis Manager for AI Networking Performance Research and Analysis at NVIDIA, focusing on optimizing networking technologies (NIC, Switch) for AI workloads like LLM training and inference. The role involves end-to-end performance strategy, from pre-silicon modeling to GA, and building telemetry frameworks and dashboards for performance tracking and root cause analysis. Requires strong experience in high-performance networking, cluster performance, and managing engineering teams, with a focus on Python, Bash, and C/C++. | ServeAgent | 7 |
| Senior Software Engineer, AI Inference Senior Software Engineer focused on optimizing and scaling AI inference for large language models, working with customers and contributing to open-source projects like vLLM. | Serve | 7 |
| Senior Software Engineer, Machine Learning Inference Senior Software Engineer role focused on designing and implementing inference software optimizations for NVIDIA TensorRT and TensorRT-LLM to accelerate AI applications on NVIDIA GPUs. Involves C++, Python, and CUDA development, collaboration with AI experts, and optimization of deep learning frameworks and compilers. | Serve | 7 |
| Senior Math Libraries Engineer - Sparsity in AI Software engineer to design and develop C++ libraries and tools for unstructured sparsity in Deep Learning (DL) and High-Performance Computing (HPC) on NVIDIA GPUs. This involves DSL specifications, on-demand code generation, and enabling the system in Python/PyTorch. The role focuses on performance evaluation, library quality, and collaboration with product management. | Serve | 7 |
| Senior Software Engineer, JAX Senior Software Engineer focused on performance optimizations for JAX, a deep learning framework, to build a scalable platform for data, training, and analysis. The role involves developing core JAX components, working with AI researchers, and building tools to improve AI system development efficiency. | Serve | 7 |
| Senior AI and FSI Developer Technology Engineer Senior AI and FSI Developer Technology Engineer at NVIDIA focused on optimizing AI and HPC workloads on NVIDIA CPUs and GPUs for the financial services industry. The role involves researching, designing, and developing techniques to accelerate these workloads, profiling and eliminating performance bottlenecks, and collaborating with internal and external experts to influence future hardware and software designs. The engineer will also publish and present their work. | Serve | 7 |
| Developer Relations Manager, Financial Services NVIDIA is looking for a Principal Developer Relations Manager for Financial Services to lead strategic engagement with developers and software providers to accelerate the adoption of NVIDIA's AI and computing platforms. The role involves acting as a technical advisor, integrating NVIDIA's software stack into partner products, guiding partners through onboarding, analyzing the developer ecosystem, and collaborating with internal teams and partners to drive AI adoption in financial services. | Serve | 7 |
| Senior HPC Solutions Architect This role focuses on supporting NVIDIA's AI factory deployments by assisting with the deployment, debugging, and optimization of AI workloads on NVIDIA platforms. The Senior HPC Solutions Architect will work with customers to identify and resolve cluster performance and stability issues, benchmark framework features, and guide customers in scaling workloads on NVIDIA GPUs. The role requires strong networking and system-level understanding, with experience in large-scale training workloads and parallel applications. | Serve | 7 |
| Senior Software Engineer, DL Compilers Senior Software Engineer role focused on building the code generation backend for NVIDIA's deep learning compilers, connecting ML frontends to GPU compilation for high-performance kernel generation. | Serve | 7 |
| Senior Software Engineer - NIM Factory Container and Cloud Infrastructure Senior Software Engineer role focused on container and cloud infrastructure for NVIDIA Inference Microservices (NIMs) and hosted services. The role involves designing and implementing container strategies, building enterprise-grade software for container build, packaging, and deployment, and improving reliability, performance, and scale across thousands of GPUs, with a focus on disaggregated LLM inference. | Serve | 7 |
| Developer Technology Engineer, HPC and AI NVIDIA is seeking a Developer Technology Engineer to research, develop, and optimize deep learning, machine learning, and HPC workloads on NVIDIA's accelerated computing platform. The role involves working with customers and internal teams to address real-world use cases and performance challenges. | Serve | 7 |
| Senior Machine Learning Applications and Compiler Engineer, LPX NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to map neural network workloads onto future NVIDIA platforms. Responsibilities include building and maintaining high-performance runtime and compiler components for end-to-end inference optimization, defining workload mappings, extending the SW ecosystem, benchmarking, profiling, and collaborating with hardware architects. The role involves prototyping new compilation and runtime techniques and publishing technical work. | Serve | 7 |
| Solution Architect – Accelerated Computing Libraries NVIDIA is seeking a Solution Architect to drive the adoption of their AI and accelerated computing libraries across industries. The role involves understanding customer workloads, designing solutions using NVIDIA libraries for LLM inference and training acceleration, and collaborating with product teams to improve features and performance. The candidate will also build technical assets and analyze industry trends. | Serve | 7 |
| Senior Math Libraries Engineer – AI and HPC Senior engineer to join NVIDIA's Math Libraries team, focusing on kernel generation for AI and HPC, specifically matrix operations, JITing, and fusions. The role involves designing and implementing high-performance numerical dense linear algebra software on GPUs, providing technical leadership, and collaborating with product management. | Serve | 7 |
| Senior HPC Cluster Administrator - Deep Learning Frameworks Infrastructure NVIDIA is seeking a Senior HPC Cluster Administrator to manage large-scale GPU compute clusters for deep learning training, inference, and HPC workloads. The role involves full lifecycle management, automation, performance optimization, and collaboration with ML engineers and software teams. | Serve | 7 |
| Senior Manager, GPU Cloud Infrastructure - GeForce NOW Senior Manager to lead the design, scaling, and operations of high-performance networking for GPU-based cloud infrastructure, critical for cloud gaming, AI/ML training, and inference platforms. | Serve | 7 |