NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Deep Learning Hardware Modeling Architect - LPU NVIDIA is seeking a Senior Deep Learning Hardware Modeling Architect to optimize AI inference speed and efficiency. The role involves driving architectural specifications, developing written specifications for component-level and system-level designs, and embodying these specifications in an executable model. The candidate will ensure high performance using C++ software practices, solid algorithms, and parallelism, and resolve performance and correctness issues across chip and hardware subsystems. | Serve | 7 |
| Senior AI Infrastructure Engineer - DGX Cloud Senior AI Infrastructure Engineer responsible for designing, building, and maintaining large-scale production systems for NVIDIA's DGX Cloud, focusing on AI training and inferencing platforms. This role involves infrastructure automation, distributed systems, performance characterization, and ensuring reliability and availability of GPU cloud services. | Serve |
| 7 |
| Senior Compiler Engineer - DL NVIDIA is seeking a Senior Compiler Engineer for its Deep Learning Compiler (DLC) team. This role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and hardware teams to accelerate deep learning inference performance. The compiler is critical for data centers, personal devices, automotive, and robotics, aiming for leading inference performance, fast build times, and reduced memory footprints. | Serve | 7 |
| Deep Learning Performance Architect, CUTLASS DSL NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++. | Serve | 7 |
| Senior Software Architect, GPU Networking Research NVIDIA is seeking a Senior Software Architect to focus on GPU Networking Research for accelerating AI workloads and building AI data centers. The role involves leading vision, architecture, design, and proof-of-concept development for future GPU Networking offerings, identifying new technologies, and working with the community. Requires M.Sc./Ph.D. or equivalent experience, 8+ years in systems architecture, and experience in virtualization, networking, storage, and OS drivers. Experience in performance profiling, optimization, and HW offloads is crucial. A research track record and knowledge of Deep Learning frameworks are desirable. | Serve | 7 |
| Senior Software Engineer, CUTLASS Kernels Senior Software Engineer to develop and optimize high-performance deep learning kernels (e.g., GEMM, attention, convolution) using CUTLASS CUDA C++ and Python DSL for NVIDIA GPUs and future architectures. The role involves optimizing kernels for peak throughput, collaborating with various NVIDIA teams (architecture, compiler, libraries, DL frameworks), and requires strong C++ and CUDA experience, understanding of computer architecture, and experience with parallel programming languages targeting accelerators. | Serve | 7 |
| Senior Software Engineer, CUTLASS Performance Senior Software Engineer role focused on optimizing the performance of CUTLASS, a high-performance linear algebra and Tensor Core primitive ecosystem for NVIDIA GPUs. The role involves benchmarking deep learning models, identifying performance gaps, developing tooling for optimization, and acting as a performance representative across NVIDIA teams. | Serve | 7 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks. | Serve | 7 |
| Principal Architect, System Software - Orbital Data Center NVIDIA is seeking a Principal Architect to lead the system software architecture for their Orbital Data Center (ODC) modules, specifically Space-1. This role involves designing and implementing a resilient, production-ready inference platform for the harsh environment of low-Earth orbit, covering the full stack from firmware to AI workloads. The architect will collaborate with hardware teams, drive customer use cases, and ensure the platform operates reliably for 5-year missions, enabling AI adoption in space. | Serve | 7 |
| Software Engineer, TensorRT Specialized Platforms - New College Grad 2025 Software Engineer role focused on developing and optimizing high-performance deep learning inference software (TensorRT) for specialized platforms. Requires strong C++ skills, familiarity with deep learning frameworks, and interest in performance optimization and systems programming. | Serve | 7 |
| Senior Datacenter Performance Model Engineer Develops datacenter-scale performance modeling and prediction tools for AI researchers running AI workloads on GPU clusters. Involves building production tools, automating workflows, and partnering with architects. | Serve | 7 |
| Deep Learning Compiler Engineer - CUDA NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks. | Serve | 7 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing AI and deep learning applications on GPU architectures, working with customers to provide AI solutions, and collaborating with internal teams to influence future hardware and software design. | Serve | 7 |
| Senior HPC AI Cluster Engineer NVIDIA is seeking an experienced HPC-AI Engineer to join their Networking Clusters Solutions Infrastructure team. The role involves designing, implementing, and maintaining large-scale HPC/AI clusters, managing job schedulers, developing CI/CD pipelines, and automating infrastructure deployment and monitoring. The engineer will work with cutting-edge hardware and software, support R&D, and engage in POCs for future improvements. | Serve | 7 |
| Senior Power Analysis and Optimization Engineer This role focuses on applying AI, ML, and LLMs to optimize power efficiency in NVIDIA's GPUs and SoCs. The engineer will develop and productionize ML/RL-based models for power analysis and optimization, design and train custom LLMs for interpreting power data and recommending improvements, and apply AI to tune power-efficient configurations. The role involves analyzing power data, partnering with cross-functional teams, and automating flows. | ServeData | 7 |
| Senior Software Engineer — cuEquivariance Senior Software Engineer to join the cuEquivariance team, which builds and ships production GPU kernels and software interfaces for equivariant deep learning. The role involves CUDA kernel engineering, Python library development (PyTorch/JAX), and collaboration with research teams and external framework developers to accelerate geometric neural networks on NVIDIA GPUs. | Serve | 7 |
| Senior System Software Engineer - AI Performance and Efficiency Tools NVIDIA is seeking a Senior System Software Engineer to develop tools for AI researchers and SW/HW teams running AI workloads on GPU clusters. The role involves building internal profiling, analysis, debugging, benchmarking, and simulation tools to improve the performance and efficiency of AI workloads and systems. This includes partnering with HW architects and understanding deep learning frameworks, distributed training/inference, and GPU cluster technologies. | ServeData | 7 |
| Software Solutions Engineer NVIDIA is seeking a Software Solutions Engineer to support NVIDIA AI Enterprise customers. This role involves end-to-end customer issue resolution and building software features, automation, and deployment tooling to enhance product readiness and scalability in cloud and datacenter environments. The engineer will work with compute, cloud-native technologies, and GPU-accelerated AI frameworks, requiring strong debugging, communication, and ownership skills. | ServeAgent | 7 |
| Senior Systems Software Engineer - GPU Performance at Scale Senior Systems Software Engineer focused on GPU performance at scale for AI workloads. This role involves leading performance practices, aligning AI workloads with hardware, developing insights into AI workload performance, debugging complex issues, and collaborating with various software and firmware teams to optimize AI workload performance on NVIDIA GPUs. | Serve | 7 |
| Machine Learning Systems Engineer, Networking Machine Learning Systems Engineer focused on building and optimizing ML algorithms for real-time anomaly detection and health scoring in a large-scale AI data center AIOps platform. The role emphasizes production implementation in systems languages (Go, C/C++, Rust, Scala) for latency-sensitive and resource-constrained environments, processing massive telemetry streams. | Serve | 7 |
| Senior Software Engineer, AI Resiliency Senior Software Engineer to lead the development of AI software resiliency for large-scale AI supercomputers (100,000+ GPUs), focusing on features like fast checkpoint-recovery, error detection/isolation, and straggler/hang detection to minimize cluster downtime. The role involves hands-on C++ and Python coding, debugging, fault tolerance, and collaboration with AI researchers and hardware/software teams, integrating resiliency into AI frameworks like PyTorch and JAX/XLA. Experience with distributed systems, fault tolerance, AI frameworks, and debugging tools is required, with a preference for experience in training models, CUDA/NCCL/MPI, checkpointing strategies, and large-scale AI clusters/HPC. | Serve | 7 |
| Senior Networking Performances Architect NVIDIA is seeking a Senior Networking Performances Architect to shape the future of high-performance and ML/AI computing. This role will analyze network feature performance for AI workloads on large-scale HPC clusters, develop network behavior models, and generate insights for next-generation products. The ideal candidate will have a strong background in system engineering/architecture, performance research, Python, and a good understanding of AI models and large-scale networks. | Serve | 7 |
| Systems Software Engineer - New College Grad 2026 Systems Software Engineer role focused on applying AI and computational methods to accelerate semiconductor manufacturing and design using GPUs. The role involves developing and optimizing complex software solutions, with a strong emphasis on performance and parallel programming. | Serve | 7 |
| Senior Deep Learning Systems Engineer, Datacenters Senior Deep Learning Systems Engineer focused on analyzing and optimizing the performance and power consumption of deep learning applications on datacenter hardware, influencing the design of future AI systems and software stacks. This role involves developing software infrastructure, analysis tools, and profiling methodologies for DL workloads, with a strong emphasis on system architecture and performance analysis. | Serve | 7 |
| Senior HPC and AI Operation Engineer NVIDIA is seeking a Senior HPC and AI Operation Engineer to manage and maintain large-scale HPC/AI clusters, including job scheduling, CI/CD pipelines, and troubleshooting from bare metal to application level. The role involves supporting R&D activities and engaging in POCs, requiring strong Linux administration, scripting, and knowledge of HPC/AI technologies, storage, and networking. | Serve | 7 |
| Senior Developer Relations Manager NVIDIA is seeking a Senior Developer Relations Manager to engage with the China industrial and research community, focusing on integrating GPU-accelerated computing solutions, particularly in Generative AI, Agentic AI, and AI Storage. The role involves understanding community requirements, promoting NVIDIA tools, architecting solutions, and driving adoption of new products within the AI storage ecosystem. | ServeAgent | 7 |
| Senior System Software Engineer - AI Performance and Efficiency Tools Develops internal profiling, analysis, debugging, benchmarking, and simulation tools for AI workloads running on GPU clusters, supporting AI researchers and SW/HW teams to improve performance and efficiency. | ServeData | 7 |
| Senior Developer Technology Engineer - Windows AI Platform Senior Developer Technology Engineer focused on optimizing and deploying AI/GenAI applications on NVIDIA RTX platforms, particularly LLMs on Windows. This role involves working with internal teams and external developers, analyzing performance, conducting training, and improving user experience with OSS software like Llama.cpp and Ollama. Collaboration with driver and architecture teams is key to influencing future GPU features. | ServeAgent | 7 |
| Senior Deep Learning Tools Engineer – CUDA Tile Senior Deep Learning Tools Engineer at NVIDIA focused on performance validation, analysis, and tracking for AI workloads accelerated by CUDA Tile compiler technologies and GPU systems. The role involves designing and developing performance testing frameworks, building automated CI/CD pipelines, implementing benchmarking systems, analyzing performance trends, and collaborating with compiler and architecture teams to resolve performance issues. Requires strong programming skills in Python, experience with CI/CD, deep learning frameworks, and hardware-aware performance analysis. | Serve | 7 |
| Solution Architect, Financial Services NVIDIA is seeking a Solutions Architect for Financial Services to act as a trusted technical advisor to customers, enabling their productivity and driving adoption of NVIDIA's AI technologies. The role involves working with financial institutions, providing technical guidance, developing solution prototypes, and staying updated on industry trends. Requires a BS/MS/PhD in a technical field, 5+ years of AI experience, financial services background, and expertise in coding for NVIDIA GPUs. | Serve | 7 |
| Solution Architect, Energy Solution Architect with deep expertise in AI solutions for the Energy Industry, focusing on efficient use of compute platforms. The role involves being a trusted technical advisor, embedding NVIDIA software into customer architectures, improving application performance, and establishing foundations for next-gen AI systems. Responsibilities include supporting business development, working directly with developers and customers, analyzing architectures for acceleration, providing feedback to engineering/product/research, and delivering trainings/hackathons. | Serve | 7 |
| Senior Systems Software Engineer - GPU Performance at Scale Senior Systems Software Engineer focused on GPU performance at scale for AI workloads, involving collaboration with various hardware and software teams to optimize large-scale computing platforms and deliver insights into AI workload performance. | Serve | 7 |
| Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize how neural network workloads map onto future NVIDIA platforms. | Serve | 7 |
| Senior Software Developer Senior Software Developer to work on an open-source AI networking acceleration library, focusing on performance-oriented low-level infrastructure for inference, utilizing hardware offloads, GPU Kernels, and RDMA. Requires strong C++/C/Rust, Linux, and networking stack knowledge, with advantages in LLM inference, distributed storage, Linux internals, CUDA, and parallel programming. | Serve | 7 |
| Senior AI Frameworks Engineer NVIDIA is seeking a Senior AI Frameworks Engineer to contribute to the CUTLASS project, focusing on developing a Pythonic interface for high-performance GPU computations. The role involves designing APIs, building compilation infrastructure, optimizing developer experience, and managing production-grade delivery for the open-source community. | Serve | 7 |
| Senior Compiler Engineer - AI NVIDIA is seeking a Senior Compiler Engineer with expertise in machine learning and compiler technologies to focus on applied AI and ML within compilers and development tools. The role involves working with Python, C/C++, Julia, and Lisp/Scheme, with a strong foundation in compilers, code generation, and GPU architecture. Experience with LLVM is a plus. | Serve | 7 |
| Distinguished Software Architect - Deep Learning and HPC Communications Distinguished Software Architect role focused on designing and researching next-generation communication libraries and platforms for Deep Learning and High Performance Computing at NVIDIA. The role involves co-designing HW/SW solutions with GPU, Networking, and SW architects, driving adoption of new communication technologies, and keeping up with DL research. Requires deep expertise in HPC, parallel programming, communication runtimes, system/GPU architecture, and networking, with strong programming skills in C/C++. | Serve | 7 |
| Manager, Next-Generation AI Cluster Architecture Manager for Next-Generation AI Cluster Architecture at NVIDIA, focusing on developing and leading teams to build large-scale AI supercomputing systems, including GPU cluster architectures, networking, and system software. The role involves authoring reference architectures and collaborating on system bring-up and integration. | ServeAgent | 7 |
| Manager, Next-Gen AI Cluster Validation Manager to lead a team developing and validating next-generation NVIDIA AI supercomputing systems, integrating new compute, networking, storage, and software. Focus on building a platform for software development, automation, and performance engineering, and supporting large-scale deployments for AI and HPC. | Serve | 7 |
| GPU Power Architect - New College Grad 2026 NVIDIA is seeking a New College Grad Datacenter GPU Power Architect to contribute to the research and development of energy-efficient GPU and SOC architectures. The role involves developing power estimation models and tools, exploring energy efficiency at GPU and Datacenter levels, and deploying machine learning techniques to model GPU, CPU, Switch, and platform performance and power. The candidate will understand GenAI/HPC workload characteristics to drive HW/SW features for Perf@Watt improvements. | Serve | 7 |
| Senior Software Engineer, Data Center Workloads – Infrastructure Senior Software Engineer focused on developing and executing software-driven characterization workflows for AI workloads on NVIDIA rack-scale systems. The role involves analyzing, characterizing, and optimizing power, performance, and drive behavior across the full stack, including GPUs, CPUs, networking, and system software. Key responsibilities include building automated frameworks for data collection and analysis, investigating system behavior, and supporting new platform bring-up. | Serve | 7 |
| Senior Deep Learning Compiler Engineer NVIDIA is seeking a Senior Deep Learning Compiler Engineer to develop compiler optimization algorithms for deep learning networks. This role involves collaborating with deep learning software framework and hardware architecture teams to accelerate next-generation deep learning software, focusing on public APIs, performance, and compiler infrastructure for neural networks. | Serve | 7 |
| Senior AI Compiler Engineer, Algorithms and Code-Generation NVIDIA is seeking a Senior AI Compiler Engineer to develop compiler optimization algorithms for AI workloads, focusing on delivering leading inference performance on GPUs. The role involves analyzing deep learning networks, optimizing compiler techniques, and working with CUDA and various compiler technologies. | Serve | 7 |
| Senior Software Engineer - AI Research Clusters Senior Software Engineer to build and maintain GPU clusters for internal AI researchers, focusing on reliability, performance, and self-service. The role involves engineering solutions for cluster validation, monitoring, and operation, with an emphasis on AIOps and Agentic AI to reduce operational toil. | Serve | 7 |
| Senior Technical Product Manager - AI Infrastructure Senior Technical Product Manager at NVIDIA focused on AI Infrastructure, driving the roadmap for Enterprise Infrastructure products like NVIDIA DGX and Networking. The role involves defining product strategy, creating go-to-market collateral, and engaging with customers and internal teams to improve AI developer efficiency and data center solutions. Requires deep understanding of AI/ML infrastructure, HPC, cloud technologies, and AI/ML concepts like LLMs and Gen AI. | Serve | 7 |
| Senior Deep Learning Compiler Verification Engineer NVIDIA is seeking a Senior Deep Learning Compiler Verification Engineer to design and build systems for verifying correctness in deep learning compilers, focusing on graph transformations, IR lowering, and GPU execution. The role involves analyzing and validating optimizations, engineering test generation systems using deep learning solutions, and defining quality metrics for evolving models, compiler stacks, and hardware. | Serve | 7 |
| Senior Software Engineer - Deep Learning Compiler CI Infrastructure Senior Software Engineer to own and evolve CI/CD infrastructure for NVIDIA's deep learning compiler stacks. Responsibilities include designing and operating scalable CI systems for ML workloads, delivering performance signals, and applying AI/agent-based workflows to improve developer efficiency and triage. | Serve | 7 |
| Senior Solutions Architect, AI Infrastructure Senior Solutions Architect role at NVIDIA focused on architecting and scaling AI infrastructure, particularly GPU-based systems for deep learning inference, for customers. The role involves technical customer engagement, supporting sales, and providing onsite support. | Serve | 7 |
| Software Engineer, Neural Graphics Developer Tools Software Engineer role focused on developing developer tools for NVIDIA GPUs, specifically bridging AI and graphics for next-generation workflows. The role involves improving existing tools, creating new ones, and collaborating with research and architecture teams to enhance rendering performance and visual quality using AI techniques. | Serve | 7 |
| Senior Software Developer, AI Networking Senior Software Developer focused on AI Networking at NVIDIA, developing communication frameworks, production tools, and benchmarks for large-scale AI training and inference systems. The role involves enabling new AI models, analyzing workloads, designing automation, and collaborating with hardware teams. | ServeData | 7 |