Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Deep Learning Performance Software Engineer Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks. | Serve | 9 |
| Research Scientist, AI Accelerator Design and VLSI - New College Grad 2026 Research Scientist role focused on AI Accelerator Design and VLSI, involving AI HW/SW Co-Design, quantization, and applying generative AI to hardware design. Requires a PhD and experience in VLSI, computer architecture, or numerical algorithms for AI. Collaborates on research prototypes and publishes findings. | Serve | 9 |
| Senior DGX Cloud AI Infrastructure Software Engineer |
| ServePost-train |
| 9 |
| Senior GPU Networking Architect This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development. | Serve | 9 |
| Senior Software Architect, AI Networking NVIDIA is looking for a Senior Software Architect to design and optimize inference infrastructure for large language models running on GPU clusters. The role involves working across software and hardware domains to define deployment and scaling strategies, optimize latency and throughput, and collaborate with various teams to ensure high-performance solutions. | Serve | 9 |
| Senior Deep Learning Algorithm Engineer Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks. | ServePost-train | 9 |
| Research Scientist, ML Systems - PhD New College Grad 2026 Research Scientist role focused on ML Systems, contributing to hardware, software, and infrastructure for ML systems at various scales. The role involves understanding and developing solutions for efficiency, scaling, and resilience in ML systems, with a focus on co-design of algorithms and systems. Requires a PhD and expertise in areas like OS, distributed systems, inference/training systems, data management, cloud computing, or computer architecture. | ServePost-train | 9 |
| Senior GPU Architect, Deep Learning NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing. | Serve | 9 |
| Senior Deep Learning Computer Architect NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels. | Serve | 9 |
| Senior Deep Learning Performance Architect Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference. | ServePost-train | 9 |
| Senior Deep Learning Software Engineer, Inference Senior Software Engineer specializing in Deep Learning Inference, focusing on optimizing GPU-accelerated software for large-scale model serving and inference using frameworks like SGLang and vLLM. The role involves performance tuning, implementing latest algorithms, and scaling performance across NVIDIA accelerators. | Serve | 9 |
| Research Scientist, ML Systems - PhD New College Grad 2026 Research Scientist role focusing on ML Systems, contributing to hardware, software, and infrastructure for training, fine-tuning, and serving ML models at scale. Requires a PhD and expertise in systems research areas. | ServePost-train | 9 |
| Senior Software Architect, AI Networking Senior Software Architect role focused on designing and optimizing large-scale LLM inference infrastructure on GPU clusters, involving system-level optimizations for latency, throughput, and cost-efficiency. | Serve | 9 |
| Senior Software Research Architect, AI Networking NVIDIA is seeking a Senior Software Research Architect to improve the framework for large-scale LLM learning and prediction. This role focuses on designing and optimizing systems for generative AI workloads on advanced GPU clusters, specifically leveraging the NVIDIA Spectrum-X Networking Platform to define deployment and scaling strategies. The architect will work on inter-node communication, compute scheduling, and system-level optimization, collaborating with engineers and researchers to enable generative AI technologies in real-world applications. | ServePretrain | 9 |
| AI Computing Software Development Engineer, TensorRT-LLM NVIDIA is seeking a Software Development Engineer for its TensorRT-LLM team to develop and optimize LLM inference software for various platforms. The role involves performance analysis, tuning, and contributing to the architecture and hardware design, with a focus on scaling inference capabilities. | Serve | 9 |
| AI Computing Software Development Engineer, LLM Inference Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams. | Serve | 8 |
| Senior Inference Engineer, AIConfigurator for Dynamo Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms. | Serve | 8 |
| AI Computing Software Development Engineer, TensorRT NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks. | Serve | 8 |
| AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach. | Serve | 8 |
| DL System Software Engineer - AI Platform NVIDIA is seeking a DL System Software Engineer to join their AI Platform team. The role involves developing and building solutions for scheduling large-scale AI training and inference workloads on GPU clusters, optimizing performance and efficiency for large models. The engineer will work on core infrastructure, resource management, and GPU scheduling, contributing to NVIDIA's AI platform. | ServePost-train | 8 |
| Software Engineer, AI Networking Architect NVIDIA is seeking an AI Networking Architect to optimize AI workload performance by analyzing AI models, distributed training, and inference workloads, and translating research insights into software, hardware, and networking architecture requirements. The role involves building platforms and simulations to evaluate trade-offs and influence future NVIDIA product roadmaps. | ServeAgent | 8 |
| GPU Performance Engineer - Neural Reconstruction GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads, involving PyTorch, CUDA, and GPU profiling to improve training and rendering performance. | ServePost-train | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms. | Serve | 8 |
| Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026 This role focuses on modeling, simulating, and analyzing the system-level performance of agentic AI workloads in datacenter environments. The engineer will develop simulators, characterize LLM serving traffic, identify performance bottlenecks, and provide architectural recommendations for next-generation AI systems. The role requires strong programming skills in C++ and Python, a solid understanding of queueing theory, traffic modeling, and statistics, as well as fundamentals of deep learning and LLM inference serving. | ServeAgent | 8 |
| Deep Learning Computer Architect - New College Grad 2026 NVIDIA is seeking a Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics. The role involves analyzing DL methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and deep learning kernels. | Serve | 8 |
| Senior Manager, Artificial Intelligence - Machine Learning Platform Senior Manager for AI/ML Platform at NVIDIA, leading the development and management of tools and services for the entire AI/ML project lifecycle, focusing on large-scale model training and deployment efficiency. Requires extensive experience in AI/ML infrastructure, team leadership, and strategic vision for AI platforms. | ServePost-train | 8 |
| Manager, Deep Learning Algorithms Manager to lead engineering activities for productizing Deep Learning models, focusing on implementing and optimizing state-of-the-art algorithms for GPU-accelerated platforms. The role involves leading a team, collaborating with internal partners on roadmap development, and deploying training and inference workloads. | ServeData | 8 |
| Engineering Manager, Inference Benchmarking — AI Perf Engineering Manager for NVIDIA's AIPerf platform, a standard for assessing LLM serving performance. The role involves leading a team to build and advance the platform, focusing on core infrastructure, accuracy of benchmark results, and advising on upstream engine integrations for various AI workloads (LLM, multimodal, diffusion, computer vision). Requires strong systems engineering, inference infrastructure, and open-source community experience. | Serve | 8 |
| AI Computing Development Engineer, TensorRT and TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing. | Serve | 8 |
| GPU Performance Engineer - Neural Reconstruction GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads. This role involves profiling, identifying bottlenecks, and improving performance in CUDA, PyTorch, and C++ for training and rendering, while ensuring reconstruction quality is maintained. It requires strong programming, GPU optimization, and performance analysis skills, with collaboration across research and engineering teams. | ServeData | 8 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience. | Serve | 8 |
| Senior DGX Cloud AI Infrastructure Software Engineer NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems. | ServeData | 8 |
| AI Software Engineer, Kernel Libraries - New College Grad 2026 AI Software Engineer focused on developing inference systems software stack, including libraries, code generators, and GPU kernels for NVIDIA's hardware. The role involves innovating for efficient AI inference, optimizing kernels, designing abstractions for LLM serving engines, and building JIT compilers and runtimes. Collaboration with internal teams and contributions to open-source projects like FlashInfer, vLLM, and SGLang are expected. | Serve | 8 |
| Senior AI Infrastructure Software Engineer - DGX Cloud NVIDIA is seeking a Senior AI Infrastructure Software Engineer to design, build, and maintain AI platforms for large-scale AI training, inferencing, fine-tuning, and Agentic AI in production. The role involves developing platform and tools for AI/ML workload efficiency, resiliency, and observability, with a focus on distributed systems and Kubernetes. | Serve | 8 |
| Software Engineer - AI Research Clusters Software Engineer to build and maintain GPU clusters for internal AI researchers, focusing on reliability, performance, and self-service. The role involves applying AIOps and Agentic AI to reduce operational toil and support the training, fine-tuning, and deployment of advanced ML models. | Serve | 8 |
| Senior Performance Compiler Engineer - Triton Senior Performance Compiler Engineer to work on the open-source Triton compiler project, focusing on using compilers to improve AI performance on NVIDIA GPUs for large language models, agents, and other AI applications. The role involves investigating GPU hardware, designing and implementing compiler technology using MLIR to optimize kernel descriptions for efficient GPU code generation, and collaborating with internal teams. | Serve | 8 |
| Machine Learning Intern - 2026 NVIDIA is seeking a Machine Learning Intern to assist with developing demonstrations using NVIDIA SDKs, algorithmic development, and AI software development. The role involves keeping up with the latest NVIDIA technology, building demos, and engaging the AI community through workshops. | Serve | 8 |
| Solutions Architect - AI for Drug Discovery NVIDIA seeks a Solutions Architect for their EMEA team to drive AI adoption in drug discovery within the biopharma industry. The role involves acting as a technical advisor to pharmaceutical companies, biotechs, and research organizations, leveraging NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and supporting business development by guiding customers on production-grade inference, model training, RL, and post-training algorithms. The role also involves exploring foundation models, agentic LLM applications, and physical AI in biopharma, providing feedback to internal teams, and documenting/teaching NVIDIA solutions. | ServePost-train | 8 |
| Senior GPU System Architect Seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC datacenters. The role involves defining system architectures that integrate GPU compute, memory, and interconnects for optimal AI performance and scalability. Requires deep experience in system-level fabric/networking architecture and hardware-software co-design. | Serve | 8 |
| Solution Architect, Generative AI NVIDIA is seeking a Solution Architect to promote adoption and provide technical support for their GPU-accelerated computing solutions, focusing on generative AI, machine learning, and deep learning for enterprise clients in Japan. The role involves pre-sales activities, technical support for model training and deployment, and developing solutions for inference and agent-based systems. | ServeAgent | 8 |
| Senior Deep Learning Performance Architect Senior Deep Learning Performance Architect at NVIDIA to design and evaluate hardware architectures for AI/HPC applications, focusing on LLM inference and training performance, and optimizing system bottlenecks. | ServePost-train | 8 |
| Senior Data Center Performance Engineer - Benchmarking and Optimization Senior Data Center Performance Engineer at NVIDIA focused on benchmarking and optimizing data center platforms for AI training, inference, and HPC workloads. Responsibilities include designing benchmarks, characterizing workloads, identifying bottlenecks, and driving performance improvements through system tuning and architectural recommendations. | Serve | 8 |
| NCX Engineer, AI Accelerator This role focuses on engineering and deploying AI infrastructure and solutions for strategic customers, optimizing large-scale training and inference workloads on NVIDIA's AI platform. It involves MLOps, Kubernetes, GPU scheduling, and performance tuning, with a strong emphasis on customer-facing technical support and collaboration. | ServePost-train | 8 |
| Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize neural network workloads on future NVIDIA platforms. The role involves building and maintaining high-performance runtime and compiler components, defining workload mappings, integrating with the SW ecosystem, benchmarking, profiling, and collaborating with hardware teams. It also includes prototyping new compilation techniques and publishing technical work. | Serve | 8 |
| Senior AI Solutions Architect NVIDIA is seeking an AI Solutions Architect with deep expertise in AI solutions and scalable data center infrastructure. The role involves embedding NVIDIA software into customer architectures, improving application performance, and establishing technical foundations for next-generation AI systems. Responsibilities include supporting business development, working directly with developers and customers, analyzing architectures for acceleration opportunities, and delivering trainings. | ServeAgent | 8 |
| Senior Deep Learning Framework Communications Engineer Senior Deep Learning Framework Communications Engineer at NVIDIA, focusing on integrating and optimizing communication libraries (NCCL, NVSHMEM) within AI frameworks (PyTorch, TRT-LLM, vLLM, JAX) to enhance performance for large-scale AI training and inference. The role involves deep analysis of AI workloads, compiler improvements, and kernel authoring for multi-GPU systems. | Serve | 8 |
| Director, System Software Engineering - Metropolis Accelerated and Inferencing Software NVIDIA is seeking a Director of System Software Engineering to lead teams responsible for the full lifecycle of Vision AI strategy, from model onboarding to production deployment. The role focuses on transforming foundation models into real-time, GPU-accelerated video intelligence systems, scaling multimodal reasoning, and enabling agentic development workflows. Key responsibilities include architecting and operationalizing inference acceleration, driving implementations of frameworks like TensorRT and VLLM, collaborating with partners on custom models, and ensuring performance benchmarking. The ideal candidate has extensive experience in deep learning, GPU optimization, and leading engineering teams in embedded and enterprise platforms. | ServeAgent | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect role at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, developing new communication technologies, exploring hardware/software co-design, and building proofs-of-concept to drive innovation in large-scale GPU clusters. | Serve | 8 |
| Senior Solutions Architect - Deep Learning Senior Solutions Architect focused on Deep Learning and Agentic AI tools, collaborating with customers to build solutions using NVIDIA technology. Responsibilities include technical sales support, integrating NVIDIA tech into HPC, championing Deep Learning internally, and developing demo solutions. | ServeAgent | 8 |
| Senior Solutions Architect - AI Factory Deployment Senior Solutions Architect focused on deploying and validating AI factories, specifically running and debugging AI/LLM workloads on GPU clusters. Responsibilities include setting up environments, executing benchmarks, resolving performance issues, building observability, and recommending optimizations. | Serve | 8 |