NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| GPU Performance Engineer - Neural Reconstruction GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads, involving PyTorch, CUDA, and GPU profiling to improve training and rendering performance. | ServePost-train | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms. | Serve |
| 8 |
| Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026 This role focuses on modeling, simulating, and analyzing the system-level performance of agentic AI workloads in datacenter environments. The engineer will develop simulators, characterize LLM serving traffic, identify performance bottlenecks, and provide architectural recommendations for next-generation AI systems. The role requires strong programming skills in C++ and Python, a solid understanding of queueing theory, traffic modeling, and statistics, as well as fundamentals of deep learning and LLM inference serving. | ServeAgent | 8 |
| Software Engineering Manager, Robotics Neural Reconstruction and Real2Sim Applications NVIDIA is seeking an Engineering Manager to lead a team focused on robotics Neural Reconstruction & Real2Sim Applications, advancing technologies for creating digital twins and workflows at scale for physical AI. | ShipData | 8 |
| Senior Applied AI and AI Infrastructure Engineer - Chip Design and DFX Senior Engineer focused on Applied AI and AI Infrastructure for Chip Design and DFX at NVIDIA. The role involves building and managing deployment cycles for ML & Gen AI projects, establishing robust AI infrastructure, and applying AI methods to solve complex problems in Design For Test. Requires expertise in agents, multi-agentic ecosystems, SQL, ETL, data modeling, cloud platforms, and strong programming skills in Python/C++. | AgentServe | 8 |
| Applied AI Engineer - VLSI Design NVIDIA is seeking an Applied AI Engineer to develop and deploy AI agents leveraging LLMs to solve complex problems in VLSI design. The role involves designing and building infrastructure for LLM-powered engineering assistants and multi-turn dialogue systems, fine-tuning models, and integrating them with CAD flows. | Agent | 8 |
| Senior ASIC AI Engineer Develop AI powered methodologies and Agents to generate micro-architecture, RTL, and physical design starting with specification, using AI agents to process large data and existing codebase to generate skills that can be widely used. Evaluate latest Multi-agent collaboration frameworks and apply them to generate area/power/timing/functionally accurate designs for memory system units in the GPU. | Agent | 8 |
| Senior System Software Engineer, Robotics NVIDIA is seeking a Senior System Software Engineer for their Robotics Platform Team, focusing on humanoid robots and embodied intelligence. The role involves integrating robotics software stacks, enabling deployment of foundation models and RL policies, developing validation workflows, and optimizing system metrics. The engineer will work with AI, simulation, and hardware teams to bring up and harden robotic systems. | ShipAgent | 8 |
| Deep Learning Computer Architect - New College Grad 2026 NVIDIA is seeking a Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics. The role involves analyzing DL methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and deep learning kernels. | Serve | 8 |
| Senior Manager, Artificial Intelligence - Machine Learning Platform Senior Manager for AI/ML Platform at NVIDIA, leading the development and management of tools and services for the entire AI/ML project lifecycle, focusing on large-scale model training and deployment efficiency. Requires extensive experience in AI/ML infrastructure, team leadership, and strategic vision for AI platforms. | ServePost-train | 8 |
| Manager, Deep Learning Algorithms Manager to lead engineering activities for productizing Deep Learning models, focusing on implementing and optimizing state-of-the-art algorithms for GPU-accelerated platforms. The role involves leading a team, collaborating with internal partners on roadmap development, and deploying training and inference workloads. | ServeData | 8 |
| Engineering Manager, Inference Benchmarking — AI Perf Engineering Manager for NVIDIA's AIPerf platform, a standard for assessing LLM serving performance. The role involves leading a team to build and advance the platform, focusing on core infrastructure, accuracy of benchmark results, and advising on upstream engine integrations for various AI workloads (LLM, multimodal, diffusion, computer vision). Requires strong systems engineering, inference infrastructure, and open-source community experience. | Serve | 8 |
| AI Computing Development Engineer, TensorRT and TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing. | Serve | 8 |
| Senior Software Engineer, Generative AI Systems Senior Software Engineer role focused on building and scaling Generative AI systems, including LLMs, agentic AI, and RAG pipelines. Responsibilities include designing and developing infrastructure for ML training and inference, creating evaluation frameworks, optimizing RAG pipelines, and building backend services and APIs. Requires strong software engineering fundamentals, experience with ML systems, distributed infrastructure, and GenAI workflows. | AgentServe | 8 |
| GPU Performance Engineer - Neural Reconstruction GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads. This role involves profiling, identifying bottlenecks, and improving performance in CUDA, PyTorch, and C++ for training and rendering, while ensuring reconstruction quality is maintained. It requires strong programming, GPU optimization, and performance analysis skills, with collaboration across research and engineering teams. | ServeData | 8 |
| Senior AI Tools Engineer, SRE Operations - GeForce NOW This role focuses on building and deploying AI/ML tools, specifically LLM- and Agent-based systems, to analyze production data for a global service (GeForce Now). The goal is to automate root cause analysis for incidents and predict future service trends, requiring strong data pipeline management and expertise in AI frameworks. | AgentData | 8 |
| Perception Engineer - Autonomous Driving NVIDIA is hiring a Perception Engineer for their Autonomous Driving team in China. The role involves research, design, and implementation of software features for autonomous driving perception, including DNN improvement, evaluation, and deployment. Requires strong C++/PyTorch, ML/DL techniques for Computer Vision, and experience with perception stacks. Familiarity with DNN development, network acceleration, and GPU computing is a plus. | ShipPost-train | 8 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience. | Serve | 8 |
| Principal Machine Learning Engineer, Accelerated Apache Spark This role focuses on applying ML/AI to optimize and accelerate Apache Spark workloads on NVIDIA GPUs, involving performance prediction, adaptive systems, and developing AI agents for system issue resolution and optimization. The role requires significant experience in ML/DL solution design, productionization, and large-scale data processing platforms like Spark, with a focus on LLM/GenAI, reinforcement learning, and adaptive ML systems. | AgentServe | 8 |
| Senior DGX Cloud AI Infrastructure Software Engineer NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems. | ServeData | 8 |
| Director, AI Enablement Director of AI Enablement at NVIDIA, responsible for developing and implementing an AI enablement roadmap to accelerate AI adoption and agentic developments across various internal workflows. The role focuses on building tools, services, and blueprints for NVIDIA AI developers, optimizing AI development, and transforming NVIDIA into an AI-native company. | Agent | 8 |
| AI Software Engineer, Kernel Libraries - New College Grad 2026 AI Software Engineer focused on developing inference systems software stack, including libraries, code generators, and GPU kernels for NVIDIA's hardware. The role involves innovating for efficient AI inference, optimizing kernels, designing abstractions for LLM serving engines, and building JIT compilers and runtimes. Collaboration with internal teams and contributions to open-source projects like FlashInfer, vLLM, and SGLang are expected. | Serve | 8 |
| Senior AI Infrastructure Software Engineer - DGX Cloud NVIDIA is seeking a Senior AI Infrastructure Software Engineer to design, build, and maintain AI platforms for large-scale AI training, inferencing, fine-tuning, and Agentic AI in production. The role involves developing platform and tools for AI/ML workload efficiency, resiliency, and observability, with a focus on distributed systems and Kubernetes. | Serve | 8 |
| Software Engineer - AI Research Clusters Software Engineer to build and maintain GPU clusters for internal AI researchers, focusing on reliability, performance, and self-service. The role involves applying AIOps and Agentic AI to reduce operational toil and support the training, fine-tuning, and deployment of advanced ML models. | Serve | 8 |
| Senior Engineer - AI Agents and Systems Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs, and creating the foundation of the desktop AI operating system. | Agent | 8 |
| Senior Engineer - AI Agents and Systems Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs. The role involves leading development for the foundation of the desktop AI operating system by combining local inference with robust privacy routers and sandboxed execution. | Agent | 8 |
| Manager, Test and Tools Development Engineering Manager for a test and tools development engineering team focused on building autonomous systems and AI-powered quality infrastructure for Omniverse. The role involves leading a team to design agentic test pipelines, multi-agent orchestration for test generation, failure triage, and establishing evaluation frameworks for AI-generated outputs. | Agent | 8 |
| Senior AI Engineer, Agents and Developer Workflows Senior AI Engineer role focused on developing and deploying AI agents and LLM-based solutions to automate software engineering workflows within NVIDIA. The role involves creating tools to improve developer efficiency, accelerate feedback loops, and enhance release reliability, with a focus on predictive modeling for risk identification and leveraging RAG and fine-tuning techniques. | Agent | 8 |
| Senior Performance Compiler Engineer - Triton Senior Performance Compiler Engineer to work on the open-source Triton compiler project, focusing on using compilers to improve AI performance on NVIDIA GPUs for large language models, agents, and other AI applications. The role involves investigating GPU hardware, designing and implementing compiler technology using MLIR to optimize kernel descriptions for efficient GPU code generation, and collaborating with internal teams. | Serve | 8 |
| Senior Systems Engineer, Neural Graphics Senior Systems Engineer role focused on integrating AI and traditional rendering techniques for real-time visual experiences. The role involves taking innovative techniques like AlpaDreams and driving them into production-ready, real-time pipelines, owning the end-to-end path from prototype to shipping product, and solving complex systems challenges related to latency, memory, and throughput. Requires deep expertise in graphics and AI, with a strong track record of shipping impactful products and experience with systems-level thinking. | ShipAgent | 8 |
| Senior Data and AI Solutions Engineer Senior Data and AI Solutions Engineer at NVIDIA to partner with engineering teams, transform data, BI, automation, and agentic AI into measurable engineering productivity gains. Develop and deploy production-grade AI and data solutions, including agentic systems and RAG pipelines, from concept to deployment. | AgentServe | 8 |
| Machine Learning Intern - AI Agents Conversational AI NVIDIA is seeking a Machine Learning Intern to support the development of AI agents for workflow automation and intelligent assistants. The role involves working on conversational AI, LLM applications, RAG systems, and speech AI workflows, building prototypes with NVIDIA AI technologies, and assisting with evaluation and deployment. The ideal candidate is pursuing a degree in a related field, has experience with machine learning, strong Python skills, and familiarity with Linux. | AgentServe | 8 |
| Machine Learning Intern - 2026 NVIDIA is seeking a Machine Learning Intern to assist with developing demonstrations using NVIDIA SDKs, algorithmic development, and AI software development. The role involves keeping up with the latest NVIDIA technology, building demos, and engaging the AI community through workshops. | Serve | 8 |
| Solutions Architect - AI for Drug Discovery NVIDIA seeks a Solutions Architect for their EMEA team to drive AI adoption in drug discovery within the biopharma industry. The role involves acting as a technical advisor to pharmaceutical companies, biotechs, and research organizations, leveraging NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and supporting business development by guiding customers on production-grade inference, model training, RL, and post-training algorithms. The role also involves exploring foundation models, agentic LLM applications, and physical AI in biopharma, providing feedback to internal teams, and documenting/teaching NVIDIA solutions. | ServePost-train | 8 |
| Senior GPU System Architect Seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC datacenters. The role involves defining system architectures that integrate GPU compute, memory, and interconnects for optimal AI performance and scalability. Requires deep experience in system-level fabric/networking architecture and hardware-software co-design. | Serve | 8 |
| Senior System Software Engineer Senior Software System Engineer to customize and expand NVIDIA’s autonomous driving solutions, delivering and scaling a foundation model-based end-to-end stack for autonomous driving. Involves applied research and development of deep learning models, issue triage, performance maintenance, and collaboration to build a unified production system. | ShipPost-train | 8 |
| Solution Architect, Generative AI NVIDIA is seeking a Solution Architect to promote adoption and provide technical support for their GPU-accelerated computing solutions, focusing on generative AI, machine learning, and deep learning for enterprise clients in Japan. The role involves pre-sales activities, technical support for model training and deployment, and developing solutions for inference and agent-based systems. | ServeAgent | 8 |
| Applied AI Engineer - Silicon Co-Design Group NVIDIA is seeking an Applied AI Engineer to design, develop, and integrate AI/LLM-powered systems into their chip design and automation infrastructure. The role involves architecting and implementing solutions to enhance workflow efficiency, scalability, and intelligence, driving initiatives from concept to deployment. Requires hands-on experience building and deploying ML/AI systems or data-intensive backend services, with a focus on owning AI agents or LLM-powered workflows end-to-end. | Agent | 8 |
| AI Research Engineer - Applied Scientist Compilers AI Research Engineer/Applied Scientist focused on Compilers/Low-level optimization to develop AI compiler solutions for NVIDIA's software stack and GPU acceleration. Responsibilities include applying AI to compilation, implementing AI-based solutions for GPU programming, building training pipelines (fine-tuning, RL), defining model I/O, developing evaluation frameworks, prompt engineering, integrating learned policies, prototyping models, creating datasets, and applying RL for optimization. | Post-trainServe | 8 |
| Senior Research Engineer, Robotics Systems Senior/Principal Engineer in robotics systems, focusing on foundation models and full-stack technology for humanoid robots. Responsibilities include designing teleoperation software, optimizing control stacks, deploying neural network models on hardware, and collaborating on the MLOps lifecycle. Requires strong robotics and software engineering background, with experience in real-time control and deploying ML models on robotic hardware. | ShipData | 8 |
| Senior Perception Engineer - Autonomous Vehicles Senior Perception Engineer at NVIDIA focused on developing and productizing autonomous driving solutions using deep learning and multi-sensor fusion. The role involves applied research, algorithm development, and ensuring solutions meet production requirements for safety, latency, and robustness. | ShipPost-train | 8 |
| Senior Software Engineer, Agentic AI Senior Software Engineer to develop core libraries for Agentic Applications, focusing on building foundational technology, scalable capabilities, reusable blocks, and high-quality libraries to accelerate developer productivity and ensure agent quality. The role involves benchmarking, identifying bottlenecks, and optimizing performance, cost, and latency for agents. Collaboration with teams on data pipelines, RAG, vector databases, and GPU-optimized workflows is expected. | Agent | 8 |
| Software Engineering Intern, AI Tools - Fall 2026 NVIDIA is seeking a Software Engineering Intern to join its AI Tools and Infrastructure team, focusing on Agentic AI. The intern will work with LLMs and orchestration frameworks to design, build, and deploy intelligent agents and AI tools, gaining exposure to the full lifecycle of agent development from conceptualization to deployment. | Agent | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| Senior Deep Learning Performance Architect Senior Deep Learning Performance Architect at NVIDIA to design and evaluate hardware architectures for AI/HPC applications, focusing on LLM inference and training performance, and optimizing system bottlenecks. | ServePost-train | 8 |
| Senior Data Center Performance Engineer - Benchmarking and Optimization Senior Data Center Performance Engineer at NVIDIA focused on benchmarking and optimizing data center platforms for AI training, inference, and HPC workloads. Responsibilities include designing benchmarks, characterizing workloads, identifying bottlenecks, and driving performance improvements through system tuning and architectural recommendations. | Serve | 8 |
| Senior GenAI Engagement Lead, Partner Platforms This role focuses on driving the technical integration of Generative AI software with enterprise partners, involving hands-on design and deployment of RAG, LLM inference, and Multi-Agent workflows. The position requires deep technical expertise in AI/ML, partner engagement, and understanding of the GenAI lifecycle, with a focus on production deployments and influencing product roadmaps. | AgentServe | 8 |
| NCX Engineer, AI Accelerator This role focuses on engineering and deploying AI infrastructure and solutions for strategic customers, optimizing large-scale training and inference workloads on NVIDIA's AI platform. It involves MLOps, Kubernetes, GPU scheduling, and performance tuning, with a strong emphasis on customer-facing technical support and collaboration. | ServePost-train | 8 |
| Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize neural network workloads on future NVIDIA platforms. The role involves building and maintaining high-performance runtime and compiler components, defining workload mappings, integrating with the SW ecosystem, benchmarking, profiling, and collaborating with hardware teams. It also includes prototyping new compilation techniques and publishing technical work. | Serve | 8 |
| Senior AI Solutions Architect NVIDIA is seeking an AI Solutions Architect with deep expertise in AI solutions and scalable data center infrastructure. The role involves embedding NVIDIA software into customer architectures, improving application performance, and establishing technical foundations for next-generation AI systems. Responsibilities include supporting business development, working directly with developers and customers, analyzing architectures for acceleration opportunities, and delivering trainings. | ServeAgent | 8 |