Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| System Software Architect, AI and GPU Networking NVIDIA is seeking a System Software Architect to research and develop advanced networking solutions for AI data centers, focusing on accelerating AI workloads, inference, and model serving. The role involves enhancing GPU networking offerings, designing optimizations for data movement, and evaluating new technologies. | ServePost-train | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to collaborate with developers, optimize AI workloads on GPUs, research innovative AI techniques, and ensure peak performance on GPU architectures. The role involves developing and optimizing parallel algorithms and data structures, influencing next-gen architectures, and requires proficiency in C++, AI algorithms, and specific domains like multi-modal models or RL for LLMs. |
| ServePost-train |
| 8 |
| Senior Software Engineer, Robotics - Isaac Lab Senior Software Engineer to join the Isaac Lab team, focusing on developing a platform for robot learning, including perception-in-the-loop reinforcement learning, multi-agent/multi-task learning, and VLA & RL integration. The role involves sim-to-real efforts, defining training workflows, and collaborating with research teams to advance humanoid robots. | ShipData | 8 |
| Senior HPC Performance Engineer - AI for Science at Scale Senior HPC Performance Engineer focused on optimizing large-scale, CUDA-backed ML training frameworks for AI in Science applications, particularly in digital biology and chemistry. The role involves kernel design, GPU porting, distributed learning, and algorithmic improvements within HPC software stacks. | ServePost-train | 8 |
| Software Engineer, Metropolis Vision AI Software Engineer for NVIDIA's Metropolis Vision AI team, focusing on building and optimizing large-scale distributed Vision AI platforms for real-time and streaming scenarios. The role involves implementing high-performance pipelines, developing distributed services for video/image/3D data processing, enhancing multi-modal perception, using simulation/synthetic data, and profiling GPU-accelerated inference. Requires strong C++/Python, Linux, computer vision, deep learning, and distributed systems experience, with practical experience in PyTorch for training and deployment. | ServeData | 8 |
| Deeplearning Software Engineer -- Neural 3D reconstruction Software Engineer role focused on deep learning for neural 3D reconstruction, involving research, design, implementation, optimization, and deployment of DNN models. The role requires C++, PyTorch, and ML/DL techniques, with a preference for experience in DNN development and network acceleration. | ServePost-train | 8 |
| Senior AI Application Developer - GPU and SOC Architecture Modeling Senior AI Application Developer role focused on developing and deploying scalable GenAI applications to accelerate GPU/SOC architecture modeling. The role involves integrating LLMs into existing workflows, collaborating with hardware architects and infrastructure engineers, and researching emerging AI technologies. Requires proficiency in C++, Python, ML frameworks, and hands-on experience with LLMs and multimodal models. | Agent | 8 |
| Senior HPC and AI Networking Performance Research and Analysis Engineer Research Engineer focused on analyzing and optimizing the performance of large-scale distributed Deep Learning LLM training and inference, with a strong emphasis on networking aspects on NVIDIA supercomputers. | PretrainServe | 8 |
| Architect, AI Solutions Engineering NVIDIA is looking for an AI Solutions Architect to scale internal AI platforms and solutions for thousands of developers. The role involves identifying AI opportunities, setting system outcomes, optimizing performance and cost, and collaborating with AI product vendors. Requires strong experience in building large-scale distributed systems and hands-on experience with LLMs, RAG, fine-tuning, and agentic orchestration. | AgentServe | 8 |
| Senior High-Performance System Architect NVIDIA is seeking a Senior High-Performance System Architect to define and research NVL system architecture for large-scale, high-performance computing clusters used to train advanced AI models. The role involves working across algorithms, software, firmware, and hardware, collaborating with cross-functional teams, and analyzing simulation results. | ServePretrain | 8 |
| Manager, Deep Learning Algorithms Manager for Deep Learning Algorithms at NVIDIA, focusing on productizing DL models, optimizing inference, and leading engineering teams. The role involves working with LLMs/VLMs, inference optimization, and collaborating across NVIDIA to develop state-of-the-art algorithms for GPU-accelerated platforms. | Serve | 8 |
| Senior Deep Learning Engineer - AI for Wireless Systems NVIDIA is seeking a Senior Deep Learning Engineer to develop AI-native wireless networks, integrating deep learning into signal processing and radio access technologies. The role involves designing, prototyping, implementing, training, and optimizing deep learning models for real-time inference and deployment on GPU platforms, collaborating with researchers and system engineers. | ServePost-train | 8 |
| Engineering Manager - AI for RAN and 6G Wireless Systems NVIDIA is seeking an Engineering Manager to lead a team developing AI/ML models for 6G wireless networks. The role involves guiding model development, training, evaluation, and deployment, with a focus on integrating deep learning into signal processing and radio access technologies. Experience with Python, PyTorch/TensorFlow, and leading engineering teams is required. | ServePost-train | 8 |
| System Software Engineer - Deep Learning System Software Engineer at NVIDIA focused on accelerating deep learning inference for autonomous driving systems using NVIDIA GPUs and DL accelerators. The role involves developing SDKs/frameworks for LLMs and state-of-the-art models, benchmarking, and optimizing for latency, accuracy, and power consumption. Requires experience with deep learning frameworks, DNN optimization, and C/C++. | ServePost-train | 8 |
| Senior AI Infrastructure Software Engineer Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks. | AgentServe | 8 |
| Senior LLM Agents Architect Senior LLM Agents Architect at NVIDIA to build and deploy agentic systems integrating LLMs with domain tools for HW/SW engineering workflows. Focus on developing end-to-end agent flows for simulation analysis, kernel optimization, and developer efficiency, including prototyping, integration, evaluation, and mentoring. | Agent | 8 |
| Distinguished Engineer, JAX Distinguished Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing core JAX components, driving peak performance on NVIDIA products, and building tools to increase the efficiency of AI-based system development teams. It bridges numerical computing, simulation, and deep learning research with real-world applications. | Serve | 8 |
| Senior Deep Learning Compiler Engineer - PyTorch Senior Deep Learning Compiler Engineer to develop and optimize PyTorch models for NVIDIA GPUs using compiler technology like Thunder, TorchDynamo, and TorchInductor. Focus on performance analysis and contributing to open-source AI ecosystem. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software to accelerate AI and HPC workloads. The role involves investigating performance bottlenecks, exploring HW/SW co-design, building proofs-of-concept, and performing quantitative modeling for large GPU clusters. | Serve | 8 |
| Distinguished Engineer - Dynamo Distinguished Engineer role focused on NVIDIA Dynamo, an AI inferencing platform. The role involves technical leadership, driving product direction, and contributing to open-source projects to achieve state-of-the-art performance and scalability for AI inference across modalities on NVIDIA hardware. | Serve | 8 |
| Principal Software Engineer - Dynamo Principal Software Engineer for NVIDIA Dynamo, an open-source platform for efficient, scalable inference of large language and reasoning models in distributed GPU environments. Focuses on Kubernetes serving, scalability, disaggregated serving, dynamic GPU scheduling, intelligent routing, and distributed KV cache management. | ServeAgent | 8 |
| Principal Software Engineer – Large-Scale LLM Memory and Storage Systems NVIDIA is seeking a Principal Systems Engineer to design and evolve a unified memory layer for large-scale LLM inference, focusing on KV-cache offload, reuse, and sharing across heterogeneous clusters. The role involves deep integration with LLM serving engines and optimizing performance across GPU, CPU, and storage tiers. | Serve | 8 |
| Senior Software Engineer, Deep Learning - MLIR TRT Senior Software Engineer focused on developing and productizing deep learning solutions for autonomous driving vehicles, specifically involving compiler technology to optimize deep learning inference on NVIDIA hardware. The role requires expertise in deep learning frameworks, compiler technologies, and GPU programming. | Serve | 8 |
| Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK Senior Software Engineer at NVIDIA to build the future of real-time AI for sensor-driven applications using the Holoscan Platform. The role involves architecting APIs, prototyping GPU-accelerated algorithms for computer vision, imaging, sensor fusion, and low-latency rendering, and integrating generative models and multimodal foundation models into real-time pipelines. Focus on enabling GPU-resident generative methods for perception, simulation, and robotics. | AgentServe | 8 |
| Manager, Deep Learning Algorithms Manager for Deep Learning Algorithms at NVIDIA, focusing on leading engineering efforts for productizing DL models, optimizing inference, and collaborating with research teams to implement and improve algorithms. The role involves managing a team, aligning priorities, and developing the GPU-accelerated DL platform. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications This role focuses on architecting and implementing next-generation communication software and platforms for deep learning and high-performance computing (HPC) applications, specifically targeting the efficient scaling of GPU clusters. The work involves identifying performance bottlenecks, designing new communication technologies, exploring hardware/software co-design, and using simulation to evaluate performance at massive scales. | Serve | 8 |
| Senior Deep Learning Performance Architect NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and high-performance computing. Responsibilities include developing HW architectures for performance and energy efficiency, benchmarking AI workloads, creating simulation tools, and evaluating hardware features. Requires MS/PhD or equivalent experience with 4+ years in parallel computing architectures, GPU/ASIC architecture evaluation for training/inference, and strong Python/C++ skills. | Serve | 8 |
| Senior Research Engineer, Simulation NVIDIA is seeking a Senior Research Engineer specializing in physics simulation for their General Embodied Agent Research (GEAR) group, focusing on Project GR00T, an initiative to build foundation models and full-stack technology for humanoid robots. The role involves developing and optimizing simulation environments, implementing control algorithms, building procedural generation pipelines, and deploying learned models to physical robots, with a strong emphasis on sim2real transfer. | DataShip | 8 |
| Senior Software Architect, Advanced Development Senior Software Architect focused on accelerating networking and building AI data centers, researching transport functions for AI workloads, and leading architectural efforts in distributed AI, deep learning, HPC, SDN, virtualization, and storage. | Serve | 8 |
| Senior Manager, Deep Learning Performance Architecture NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures. | Serve | 8 |
| Deep Learning Performance Architect NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions. | Serve | 8 |
| AI Developer Technology Engineer NVIDIA is seeking an AI Developer Technology Engineer to work on optimizing AI techniques on GPU architectures and collaborate with customers and internal teams to influence future designs. The role involves studying and developing cutting-edge deep learning, graphs, and machine learning techniques, with a focus on performance analysis and optimization for GPUs. The engineer will also work with customers to understand their problems and provide AI solutions using GPUs, and collaborate with NVIDIA's internal teams to shape next-generation architectures and software platforms. | Serve | 8 |
| Director, Engineering – Software Engineering and AI Inferencing Platforms NVIDIA is seeking an Engineering Director to lead and scale software engineering teams in Vietnam, focusing on AI Inferencing Platforms and AI data/factory initiatives. The role involves driving the design, architecture, and delivery of high-performance system software platforms, collaborating with global teams, and overseeing the development and optimization of AI delivery platforms like NIMs and Blueprints. Experience with cloud, data, accelerated computing, and managing large AI/ML product teams is required. | ServeData | 8 |
| Senior System Software Architect, HPC and AI Networking NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers. | ServePost-train | 8 |
| Software Engineer, LLM Inference Software Engineer focused on developing and optimizing LLM inference software and frameworks, working with GPU-accelerated libraries and deep learning frameworks. | Serve | 8 |
| Compute Architecture Software Engineer NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments. | Serve | 8 |
| Software Engineer, cuDNN - Deep Learning Software Engineer role focused on developing and optimizing cuDNN, a GPU-accelerated library for deep neural networks, including LLM support. The role involves performance analysis, tuning, and collaboration with cross-functional teams to innovate across various AI applications. | Serve | 8 |
| Senior AI Storage Software Architect NVIDIA is seeking a Senior AI Storage Software Architect to define and design the next generation of storage solutions for AI workloads, including training, inferencing, KV cache, and RAG. The role involves researching AI storage workloads, optimizing them, designing the storage software stack and APIs, leading POCs, and driving hardware features for DPUs and NICs. Requires 5+ years of storage experience and familiarity with AI applications and technologies. | ServeData | 8 |
| Senior Manager, Engineering - AI Developer Tools Senior Engineering Manager to lead a team building and evolving AI developer tools and technology for local and cloud GPUs, focusing on the developer experience for AI workflows and managing AI workloads on accelerated infrastructure. | ServeAgent | 7 |
| Senior Software Engineer, AI Developer Tools Senior Software Engineer to craft intuitive AI developer tools that make advanced AI workflows accessible and scalable across diverse accelerated infrastructure. | Agent | 7 |
| Senior DL Compiler Engineer -CUDA Tile NVIDIA is hiring a Senior DL Compiler Engineer for the CUDA Tile team. This role involves designing and implementing compiler transformations, developing MLIR-based dialects and lowering passes, and optimizing performance for tile-based kernels on NVIDIA GPUs. The CUDA Tile programming model is a new addition to CUDA, shipped with CUDA 13.1. | Serve | 7 |
| Senior Software Engineer - Storage Software Engineer role focused on designing, building, and operating exascale infrastructure for AI research and development at NVIDIA. The role involves managing distributed systems, large-scale storage, compute orchestration, and automation to support AI workloads across thousands of GPUs and petabytes of storage. | Serve | 7 |
| Principal Developer, AI Networking This role focuses on optimizing AI workloads, specifically LLM training and inference, on large-scale GPU and CPU clusters. The core responsibility is to profile, analyze, and optimize the performance of distributed systems with a strong emphasis on high-performance networking and communication libraries. The engineer will develop tools for performance analysis and collaborate across hardware and software teams to identify and resolve bottlenecks. | ServePretrain | 7 |
| AI Automation Engineer, Security NVIDIA is seeking an AI Automation Engineer to build AI-native, agent-enabled security organization. The role involves developing AI agents for security programs, building and maintaining infrastructure for agent workflows, translating business needs into agent solutions, architecting integrations for agents to interact with data systems, owning ETL and agentic data pipelines, and ensuring data security and governance. The engineer will also monitor and optimize data infrastructure, pipelines, and agents, and mentor other engineers. | AgentData | 7 |
| Software R&D Engineer, RTL Optimization Tools Software R&D Engineer at NVIDIA focused on developing internal EDA tools for RTL optimization. The role involves fusing parallel computing, machine learning, and novel algorithms to improve hardware design productivity. It explores the use of LLMs, GNNs, GANs, and Reinforcement Learning for optimization tasks, and requires strong C++ development skills with a focus on graph-based algorithms and optimization. | Serve | 7 |
| Senior Software Engineer, AI Speed Infrastructure Senior Software Engineer to build AI speed infrastructure for Tegra, focusing on a fast build, test, and validation system. The role involves designing AI-native, self-healing CI workflows, integrating reasoning agents for failure triage and automation, and optimizing the entire code-to-merge pipeline for C/C++ codebases, with a strong emphasis on performance engineering and developer experience. | Agent | 7 |
| Senior Systems Software Engineer, Kubernetes Scale - DGX Cloud Senior Systems Software Engineer focused on scaling NVIDIA DGX Cloud's AI infrastructure, specifically optimizing Kubernetes and distributed inference serving for performance, cost, and reliability. The role involves end-to-end performance characterization, developing automated tests for AI workloads, debugging complex distributed systems, and contributing to open-source communities. | ServeAgent | 7 |
| Senior Software Engineer, Mapping - Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer for their Autonomous Vehicles mapping team. The role involves designing and developing algorithms for map-based driving products, including architecture design, efficient C++ development, and integrating algorithmic solutions. Key responsibilities include researching and developing transformer-based models for graphs, implementing evaluation frameworks for LLMs, fine-tuning pre-trained models, and building automated map content analysis and map-building workflows. The role requires a background in computer vision, 3D geometry, and machine learning, with heavy AI tool usage for development, and strong prompt-crafting skills. The position is focused on building AI-powered solutions for self-driving cars, with a primary focus on agentic systems for navigation and map content, and secondary involvement in model fine-tuning. | AgentPost-train | 7 |
| GPU Architect - New College Grad 2026 NVIDIA is seeking new college graduates for its GPU Architecture Group to design and validate GPU profiling and performance telemetry features. The role involves hardware modeling, test development, and infrastructure, with a focus on the world's leading AI platform. Responsibilities include building and maintaining hardware models, writing and executing test plans, contributing to development infrastructure, and collaborating with cross-functional teams. | Serve | 7 |
| GPU System Architect NVIDIA is seeking a GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves defining system architectures that tightly couple GPU compute, memory, and interconnects for optimal AI performance, scalability, and resilience. Responsibilities include architecting system topologies, defining high-speed interconnects, collaborating on RDMA hardware, using system models for analysis, and enabling hardware-software co-design. | Serve | 7 |