NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Software Engineer - AI Inference Senior Software Engineer focused on optimizing and contributing to open-source LLM inference serving engines like vLLM and SGLang to run efficiently on NVIDIA GPUs, focusing on high-throughput, low-latency inference at scale. | Serve | 9 |
| Senior Software Engineer, RAG and Agentic AI Senior Software Engineer to join the AI Blueprints team to build multimodal, scalable, production-grade reference RAG solutions using Agentic AI and NVIDIA Nemotron models. Develop orchestration layers to dynamically interact with proprietary unstructured and structured data sources. The role involves planning, building, and refining RAG workflows, designing and implementing AI agents for reasoning, planning, and multi-step execution, running POCs, hardening patterns, building and deploying end-to-end RAG pipelines using microservices architecture, and driving continuous improvement through evaluation and performance analysis. |
| 9 |
| Senior Solutions Architect, Autonomous Vehicles - Data Center NVIDIA is seeking a Senior Solutions Architect for Autonomous Vehicles and Robotics to help customers accelerate Physical AI workloads using NVIDIA's full-stack technologies. The role involves engaging with customers to optimize training, simulations, and synthetic data generation for AV perception and planning models, providing technical expertise, and driving full-stack adoption. The candidate will analyze and optimize AI models for GPU performance, build collateral for various AI workflows, and provide technical leadership. Requires 8+ years of ML/DL Infra experience in AVs, proficiency in Python, CUDA/C++, Linux, DevOps tools, and a strong understanding of AV models and simulations. Experience with model deployment at scale and robotics model development is a plus. The role focuses on the data and infrastructure aspects of AI model development and deployment in the AV domain. | DataServe | 9 |
| Solutions Architect, Model Builder - LATAM Solutions Architect focused on building and deploying agentic AI applications and enterprise agents, with a strong emphasis on localization, performance optimization, and leveraging NVIDIA's AI infrastructure. | AgentServe | 9 |
| Principal Engineer - AI Agents and Systems Principal Engineer to lead the deployment of advanced AI agent frameworks and local runtimes on Windows and NVIDIA GPUs, focusing on open-source agents, local inference, privacy, and security for consumer PCs. | AgentServe | 9 |
| Research Scientist, Generative AI for Physical AI - PhD New College Grad 2026 Research Scientist role focused on Generative AI for Physical AI, developing advanced video generative and video-language models, and scaling large-scale training systems for foundation models. Requires a PhD and expertise in PyTorch, diffusion, vision-language, reasoning models, RL, and physics simulation. | PretrainPost-train | 9 |
| Inference Optimization Architect, Speech AI NVIDIA is seeking an Inference Optimization Architect to accelerate and scale Speech AI models, focusing on reducing inference latency, improving throughput, and optimizing resource utilization across AI infrastructure. The role involves implementing model compression techniques, developing custom kernels, designing serving infrastructure, and optimizing inference across diverse GPU platforms. | Serve | 9 |
| Senior Director - GenAI Data Strategy Senior Director role focused on defining and executing a comprehensive data strategy for foundation models, encompassing multi-modal data acquisition, curation, synthetic generation, and alignment techniques like RLHF. This role bridges research insights with data collection to improve model performance and safety, and engages with customers to translate deployment gaps into data priorities. | DataPost-train | 9 |
| Senior Software Engineer - Agentic Memory Senior Software Engineer role focused on developing and researching agentic memory systems, including designing benchmarks, generating synthetic data, running experiments, and contributing to open-source evaluation tools. The role involves partnering with other NVIDIA teams deploying agents and advancing the state of the art in agentic memory evaluation. | AgentEval Gate | 9 |
| Senior Machine Learning Engineer, Perception - Autonomous Driving NVIDIA is seeking a Senior Machine Learning Engineer for their autonomous driving perception team. The role involves designing and developing end-to-end deep learning solutions for perception modules, focusing on road layout detection and other critical driving components. Responsibilities include applied research, data-driven development, and productizing solutions with a focus on safety, latency, and robustness. Experience with deep learning frameworks, Python/C++, and perception for autonomous driving or robotics is required. | ShipData | 9 |
| Senior High-Performance LLM Training Engineer NVIDIA is seeking an experienced Senior High-Performance LLM Training Engineer to optimize LLM training workloads on advanced computing systems. The role focuses on improving the efficiency of NVIDIA's high-performance LLM software stack using frameworks like PyTorch and JAX for training on thousands of GPUs, and influencing future hardware roadmaps. | Data | 9 |
| Senior Robotics Research Engineer, Robotics and AI for Drug Discovery Senior Robotics Research Engineer focused on building physical AI for drug discovery labs, involving robotics simulation, perception, task and motion planning, and training robots for manipulation tasks using imitation and reinforcement learning. | AgentData | 9 |
| Senior Solutions Architect, Generative AI Specialist Senior Solutions Architect specializing in Generative AI, focusing on building and architecting enterprise-grade agentic AI systems, RAG pipelines, and multi-modal workflows. The role involves leading prototyping, proof-of-concept collaborations, and providing technical advisory to sophisticated AI partners, with a strong emphasis on GPU-accelerated inference at scale, production optimization, and creating reusable technical assets. Responsibilities include problem-solving across the AI stack, collaborating with internal teams, and contributing to team growth and practice building. | AgentServe | 9 |
| Solutions Architect, LLM Model Builder Solutions Architect focused on enabling partners to build, benchmark, fine-tune, optimize, and deploy foundation model solutions for customer workloads, with an emphasis on reasoning, multimodal, and production inference. | ServePost-train | 9 |
| Solutions Architect, LLM Model Builder Solutions Architect focused on enabling partners to build, benchmark, fine-tune, optimize, and deploy foundation model solutions for customer workloads, with a strong emphasis on production inference and reasoning/multimodal models. | ServePost-train | 9 |
| Senior Solutions Architect, Generative AI Specialist Senior Solutions Architect specializing in Generative AI, focusing on building and deploying enterprise-grade agentic AI systems, RAG pipelines, and multi-modal workflows with GPU-accelerated inference at scale. The role involves acting as a technical advisor, leading prototyping, architecting solutions, and resolving complex system issues for NVIDIA's advanced AI partners. | AgentServe | 9 |
| Deep Learning Solution Architect NVIDIA is seeking a Deep Learning Solution Architect to drive the research, development, and optimization of Reinforcement Learning algorithms and infrastructure for LLMs and multimodal models. The role involves collaborating with internal teams, improving customer engagements with NVIDIA RL technologies, and developing toolchains and documentation. Requires MS/PhD, 5+ years of experience in RL, LLM training, or multimodal learning, proficiency in PyTorch, and strong engineering skills in distributed training or orchestration. | Post-trainAgent | 9 |
| Solutions Architect, Applied AI Builder This role focuses on building production-grade AI applications and agent systems for enterprises, involving design, orchestration, integration, observability, and deployment on NVIDIA's platforms. The candidate will lead by example as a hands-on developer, creating proof-of-concept solutions and deployable single-agent and multi-agent systems to solve real business problems. | Agent | 9 |
| Senior HPC and AI Networking Performance Research and Analysis Engineer Research and analysis engineer focused on optimizing AI networking performance for large-scale LLM training on distributed GPU clusters, involving profiling, analysis, tool development, and collaboration across hardware and software teams. | PretrainServe | 9 |
| Senior HPC and AI Network Software Architect NVIDIA is seeking a Senior HPC and AI Network Software Architect to design and build scalable AI infrastructure for distributed training and inference. The role involves developing software and hardware approaches to optimize communication efficiency and performance across large-scale systems, collaborating with AI framework teams and hardware teams. | ServePost-train | 9 |
| Senior Manager, Software Engineering - JAX Senior Engineering Manager to define and drive NVIDIA's JAX strategy, coordinating multiple teams to ensure JAX delivers peak performance across heterogeneous hardware (GPUs, CPUs, LPUs). The role involves supporting emerging needs across training, post-training, inference, and robotics, bridging new hardware capabilities with AI trends. Key responsibilities include driving engineering contribution strategy, promoting teamwork, building partnerships with open-source projects, designing processes, and leading a high-performing engineering organization. | ServePost-train | 9 |
| Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026 NVIDIA is seeking a Deep Learning Software Engineer to analyze and improve the performance of their inference ecosystem, focusing on TensorRT and related frameworks. The role involves optimizing inference solutions for various NVIDIA accelerators, developing new model pipelines, and collaborating with cross-functional teams on generative AI, robotics, and vision/speech understanding applications. | Serve | 9 |
| Deep Learning Solution Architect NVIDIA is seeking a Deep Learning Solution Architect to design and optimize production-grade generative AI solutions for enterprise customers, focusing on LLM training, RAG, and agentic inference using NVIDIA's ecosystem. | ServeAgent | 9 |
| Senior Deep Learning Engineer Senior Deep Learning Engineer at NVIDIA to optimize and deploy foundation models for physical AI applications (AVs, robots, video analytics) on GPU platforms, focusing on high-performance inference. | ServePost-train | 9 |
| Senior Applied Scientist - Sovereign AI Senior Applied Scientist/AI Engineer at NVIDIA focusing on Sovereign AI efforts. The role involves end-to-end model training (pre-training, CPT, SFT, alignment), rigorous evaluation and benchmarking, and inference optimization using tools like TensorRT-LLM and NIM. Requires strong Python, PyTorch, and experience with large-scale ML frameworks. | Post-trainServe | 9 |
| Deep Learning Senior Engineer, End-To-End Autonomous Driving NVIDIA is looking for a Deep Learning Senior Engineer to design, implement, and deploy end-to-end autonomous driving systems. The role focuses on leveraging LLMs, VLMs, and VLAs for reasoning and planning, involving model training, pre-training, fine-tuning, and integration into safety-critical vehicle firmware. Experience with production-grade ML models and C++ for deployment is required. | Post-trainAgent | 9 |
| Manager, Large Language Model Inference Manager for Large Language Model Inference at NVIDIA, focusing on developing and optimizing LLM/VLM/VLA inference software for NVIDIA GPUs and hardware platforms. The role involves leading a team in specialized kernel development, runtime optimizations, and frameworks for LLM inference, with a strong emphasis on delivering production-grade, high-performance software. | Serve | 9 |
| Senior Deep Learning Software Engineer, TensorRT Performance NVIDIA is seeking a Senior Deep Learning Software Engineer to analyze and improve the performance of their deep learning inference ecosystem, specifically focusing on TensorRT. The role involves optimizing inference solutions for various NVIDIA accelerators, contributing to inference frameworks, and developing new model pipelines for generative AI and other applications. | Serve | 9 |
| Senior Research Scientist NVIDIA is seeking a Senior Research Scientist to join their applied research team focused on building next-generation Conversational AI systems. The role involves developing new Deep Learning models for ASR, speech synthesis, NMT, and NLP, designing large-scale training algorithms, and open-sourcing models using NeMo. The position requires a PhD, significant research experience in speech recognition or NLP, strong Python and PyTorch skills, and a proven publication record. | Post-trainPretrain | 9 |
| Senior Perception Engineer, Obstacle Foundation Models - Autonomous Vehicles NVIDIA is seeking a Senior Perception Engineer to design and productize its next-generation autonomous driving perception stack. The role focuses on the core 3D obstacle perception pipeline, involving architecture and algorithm design, and hands-on implementation using transformer-based, multi-modal, and vision-language techniques. Responsibilities include developing perception models, building production-grade deep learning models with pretraining and fine-tuning, defining KPI frameworks, contributing to data strategy, and collaborating with safety and systems teams. Requires a PhD/MS/BS with significant relevant experience, proficiency in PyTorch, Python/C++, and experience in data-driven development. Experience with autonomous driving/robotics perception, embedded platforms, optimization, and publications in leading conferences are desirable. | ShipPost-train | 9 |
| Principal Deep Learning Senior Engineer, End-To-End Autonomous Driving NVIDIA is seeking a Principal Deep Learning Senior Engineer to design, implement, and deploy end-to-end autonomous driving systems. The role focuses on leveraging LLMs, VLMs, and VLAs for advanced reasoning and planning in vehicles and robotics, involving model training, pre-training, fine-tuning, and integration into safety-critical systems. | Post-trainAgent | 9 |
| Principal Deep Learning Engineer – Perception, Autonomous Driving Principal Deep Learning Engineer for NVIDIA's Autonomous Driving Perception team, focusing on developing, training, and deploying state-of-the-art perception systems (detection, segmentation, tracking) for vehicles. The role involves leading the end-to-end productization of these models, ensuring high quality and safety, defining data strategy, and providing technical leadership. Requires extensive experience in deep learning for computer vision and shipping commercial DL products. | ShipServe | 9 |
| Senior Deep Learning and Computer Vision Engineer - Autonomous Vehicles Senior Deep Learning and Computer Vision Engineer for Autonomous Vehicles team, focusing on applying state-of-the-art techniques to build ground truth, train deep neural networks, and develop training pipelines and real-time inference run-times for self-driving cars. | DataServe | 9 |
| Senior Software Engineer, AI Inference Systems NVIDIA is seeking a Senior Software Engineer to build and optimize AI inference systems for large-scale models, focusing on extreme efficiency and performance across multi-GPU, multi-node, and multi-cloud environments. The role involves architecting inference stacks, optimizing GPU kernels and compilers, driving benchmarks (MLPerf), and orchestrating large-scale deployments. | Serve | 9 |
| Deep Learning Engineer - LLM and VLM Model Compression NVIDIA is seeking a Deep Learning Engineer with 8+ years of experience to build deep learning frameworks for LLM and VLM model compression. The role involves designing and implementing algorithms for pruning, NAS, and distillation, experimenting with model compression, and collaborating with researchers. Experience with PyTorch, LLM/VLM training or inference, and DL fundamentals are required. Experience with model compression techniques, building DL frameworks, and GPU programming are preferred. The role is based in Poland or Switzerland, with a salary range of 292,500 PLN - 650,000 PLN. | Post-trainServe | 9 |
| Machine Learning Engineer, GeForce G-Assist Machine Learning Engineer at NVIDIA focused on building GeForce G-Assist, an on-device AI assistant. The role involves evaluating and improving SLMs and VLMs, optimizing local inference (e.g., llama.cpp), designing RAG systems, and supporting agentic AI workflows. Requires strong C/C++ and Python skills, experience with local inference frameworks, and knowledge of SLM/VLM architectures and agentic AI patterns. | AgentServe | 9 |
| Principal Perception Engineer, Obstacle Foundation Models - Autonomous Vehicles Principal Perception Engineer at NVIDIA for Autonomous Vehicles, focusing on designing and productizing next-generation 3D obstacle perception stacks using deep learning, transformers, and multi-modal techniques. The role involves technical leadership, hands-on algorithm development, production-grade model development, data strategy, and collaboration with safety and systems teams for large-scale deployment. | AgentData | 9 |
| Senior Deep Learning Communication Architect Senior Deep Learning Communication Architect role focused on optimizing communication performance for large-scale distributed deep learning training and inference. This involves identifying bottlenecks, designing efficient protocols, collaborating on hardware/software co-design, and exploring new communication technologies. The role requires deep understanding of parallelism techniques and experience with DNN frameworks and GPU computing. | ServePost-train | 9 |
| Senior Deep Learning Performance Architect - LPU NVIDIA is seeking a Senior Deep Learning Performance Architect to focus on hardware-software co-design for AI Inference performance. The role involves designing GPU and system architectures, analyzing deep learning algorithms, building performance models, and collaborating with various teams to guide AI direction. | Serve | 9 |
| Senior Systems Software Engineer - Deep Learning Solutions Senior Systems Software Engineer focused on optimizing deep learning inference for autonomous vehicles and robotics on edge devices. Requires deep understanding of model architectures, kernel trace analysis, and evaluation of modern architectures on GPUs/SOCs, with a focus on TensorRT and compiler technology for embedded hardware. | ServePost-train | 9 |
| AI Inference Performance Engineer This role focuses on optimizing and benchmarking Generative AI inference performance on NVIDIA's hardware accelerators, specifically working with frameworks like TensorRT-LLM, SGLang, and vLLM. The engineer will drive industry benchmark results by implementing optimizations in quantization, scheduling, memory management, and distributed inference. They will also define and optimize cutting-edge workloads, architect distributed inference systems from single-GPU to rack-scale, establish performance methodology using profiling, and contribute to open-source projects. The role requires strong programming skills (Python/C++), expertise in DL frameworks, and a deep understanding of LLM/VLM architectures and inference mechanics. | Serve | 9 |
| Senior Deep Learning Scientist, Multimodal Conversational AI Senior Deep Learning Scientist role focused on developing, training, fine-tuning, and deploying streaming multimodal conversational AI systems. This includes speech, audio, vision, voice chat, and action, as well as human-AI interaction. The role involves applying research to define algorithmic improvements and scale them through the Nemotron platform, working on high-impact LLM products. | Post-trainAgent | 9 |
| Senior Deep Learning Engineer - Model Evaluation & AI Systems Senior/Principal Deep Learning Engineer focused on building evaluation methodologies and infrastructure for AI models (LLMs, RAG, agents, vision/multimodal), including contributing to an open-source platform and collaborating with the community. The role involves working with model training, inference, and product teams to provide evaluation signals for release and optimization decisions. | Eval GateAgent | 9 |
| Senior Deep Learning Engineer Senior Deep Learning Engineer at NVIDIA focused on optimizing inference for next-generation AI workloads including multi-agent systems and generative multimodal models. The role involves characterizing emerging workloads and developing novel optimization methods across the inference stack, from algorithmic to system level, on NVIDIA hardware. Collaboration with research, framework development, and silicon architecture teams is key. | ServeAgent | 9 |
| Senior Deep Learning Architect, LLM Inference Senior Deep Learning Architect focused on LLM inference performance optimization, benchmarking, and contributing to deep learning software projects like PyTorch, TRT-LLM, vLLM, and SGLang. Requires strong knowledge of deep learning inference serving, PyTorch, profiling, and GPU microarchitecture. | Serve | 9 |
| Lead Principal Engineer, Enterprise Agentic AI Platform Lead Principal Engineer for Enterprise Agentic AI Platform at NVIDIA, focusing on building and scaling production-grade agentic AI systems, including multi-agent orchestration, memory systems, and evaluation pipelines. Requires deep expertise in distributed systems, Kubernetes, GPU inference, and hands-on coding in Python/Go. | AgentServe | 9 |
| Senior Systems Software Engineer - Deep Learning Solutions Senior Systems Software Engineer focused on deep learning inference optimization for autonomous vehicles and robotics on edge hardware. The role involves analyzing and improving deep learning models on NVIDIA platforms, benchmarking performance, evaluating emerging model architectures, and collaborating with compiler, runtime, and hardware teams to deliver inference solutions. | Serve | 9 |
| Senior Deep Learning Compiler Engineer - XLA Senior Deep Learning Compiler Engineer focused on optimizing inference and training performance for JAX and OpenXLA on NVIDIA GPUs. Develops compiler optimization algorithms, graph partitioning, tensor sharding, and code generation using MLIR, LLVM, and Triton. | ServePost-train | 9 |
| Principal Software Engineer - AI Inference Principal Software Engineer focused on advancing open-source LLM serving, specifically contributing to inference engines like vLLM and SGLang, optimizing them for NVIDIA GPUs and systems to achieve high-throughput, low-latency inference at scale. The role requires deep technical expertise in inference runtime architecture, GPU performance engineering, and distributed systems. | Serve | 9 |
| Senior Research Scientist for Generative AI Senior Research Scientist at NVIDIA focusing on original research in generative AI, including image, video, 3D, and audio generation. The role involves implementing and training large-scale models, building research prototypes, and collaborating with product teams for technology transfer. | Post-trainPretrain | 9 |