Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Deep Learning Senior Engineer, End-To-End Autonomous Driving NVIDIA is seeking a Deep Learning Senior Engineer to design, implement, and deploy end-to-end autonomous driving systems. The role focuses on AI 2.0, leveraging LLMs, VLMs, and VLAs for reasoning and planning in autonomous vehicles and robotics. Responsibilities include training large-scale models, building and fine-tuning LLM/VLM/VLA systems, exploring data generation strategies, and deploying models in production environments, integrating them with vehicle firmware. | Post-trainAgent | 9 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to analyze, model, and optimize deep learning system performance, particularly for LLM workloads, on state-of-the-art hardware architectures. This role influences future hardware and software design by collaborating with various internal teams. |
| Serve |
| 9 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architectures for edge devices, workstations, and data center GPUs. The role involves benchmarking, performance modeling, bottleneck identification, and exploring new hardware/software capabilities, with a focus on LLMs and generative AI. Experience with AI agents for engineering workflows is also mentioned. | ServePost-train | 9 |
| Deep Learning Performance Software Engineer Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks. | Serve | 9 |
| AI Computing Architect NVIDIA is seeking an AI Computing Architect to develop innovative architectures for deep learning performance and efficiency, analyze trade-offs using models and simulators, and prototype algorithms. The role requires strong programming skills, computer architecture background, and a foundation in machine learning. | ServePost-train | 9 |
| Machine Learning Engineer - Humanoid Robotics Machine Learning Engineer focused on humanoid robotics, developing and advancing foundation models (GR00T, Cosmos) for loco-manipulation, and implementing algorithms for real-world robot deployment. The role involves robot learning, synthetic data generation, and sim-to-real transfer. | AgentData | 9 |
| LLM Reinforcement Learning Framework Engineer NVIDIA is seeking an LLM Reinforcement Learning Framework Engineer to develop and deploy RL algorithms for LLM post-training, focusing on improving reasoning and alignment. The role involves integrating RL components into NVIDIA's LLM stack, crafting experiments, and ensuring production readiness. Requires strong Python, PyTorch, and practical RL experience with LLMs, along with familiarity in async/distributed orchestration. | Post-trainAgent | 9 |
| Deep Learning Solution Architect NVIDIA is seeking a Deep Learning Solution Architect to drive the research, development, and optimization of Reinforcement Learning algorithms and infrastructure for LLMs and multimodal models. The role involves collaborating with internal teams, improving customer engagements with NVIDIA RL technologies, and developing toolchains and documentation. Requires MS/PhD, 5+ years of experience in RL, LLM training, or multimodal learning, proficiency in PyTorch, and strong engineering skills in distributed training or orchestration. | Post-trainAgent | 9 |
| Deep Learning Solution Architect NVIDIA is seeking a Deep Learning Solution Architect to design and optimize production-grade generative AI solutions for enterprise customers, focusing on LLM training, RAG, and agentic inference using NVIDIA's ecosystem. | ServeAgent | 9 |
| Deep Learning Performance Software Engineer Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks. | Serve | 9 |
| Senior Capability Development Engineer NVIDIA is seeking a Senior Capability Development Engineer to develop and enhance internal RAG and Agent platforms for Ops Engineering productivity. The role involves developing, training, fine-tuning, and deploying multimodal LLMs, building LLM-based applications (RAG, TEXT2SQL, Agents), applying advanced tuning techniques, measuring performance, analyzing accuracy/bias, and driving dataset development. Requires strong Python skills, familiarity with ML/DL frameworks and LLMs, and practical experience with LLM training frameworks. | AgentPost-train | 9 |
| Senior LLM Train Framework Engineer NVIDIA is seeking a Senior LLM Train Framework Engineer to contribute to the Megatron Core team, focusing on building and developing open-source frameworks for LLM and Multimodal foundation model pretraining and post-training. The role involves addressing AI training and inference challenges across the model lifecycle, enhancing distributed training strategies, and optimizing performance on NVIDIA GPUs. | PretrainPost-train | 9 |
| Senior AI Training Performance Engineer NVIDIA is seeking a Senior AI Training Performance Engineer to optimize AI training workloads on state-of-the-art hardware and software platforms. The role involves analyzing, profiling, and optimizing performance across the hardware/software stack, implementing production-quality software, and building automation tools. Requires a strong background in deep learning training, computer architecture (especially GPU), performance tuning, and programming in C++, Python, and CUDA. | Data | 9 |
| AI Computing Software Development Engineer, LLM Inference Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams. | Serve | 8 |
| AI Computing Software Development Engineer, TensorRT NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks. | Serve | 8 |
| AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach. | Serve | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms. | Serve | 8 |
| Senior System Software Engineer, Robotics NVIDIA is seeking a Senior System Software Engineer for their Robotics Platform Team, focusing on humanoid robots and embodied intelligence. The role involves integrating robotics software stacks, enabling deployment of foundation models and RL policies, developing validation workflows, and optimizing system metrics. The engineer will work with AI, simulation, and hardware teams to bring up and harden robotic systems. | ShipAgent | 8 |
| AI Computing Development Engineer, TensorRT and TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing. | Serve | 8 |
| Perception Engineer - Autonomous Driving NVIDIA is hiring a Perception Engineer for their Autonomous Driving team in China. The role involves research, design, and implementation of software features for autonomous driving perception, including DNN improvement, evaluation, and deployment. Requires strong C++/PyTorch, ML/DL techniques for Computer Vision, and experience with perception stacks. Familiarity with DNN development, network acceleration, and GPU computing is a plus. | ShipPost-train | 8 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience. | Serve | 8 |
| Senior DGX Cloud AI Infrastructure Software Engineer NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems. | ServeData | 8 |
| Senior AI Engineer, Agents and Developer Workflows Senior AI Engineer role focused on developing and deploying AI agents and LLM-based solutions to automate software engineering workflows within NVIDIA. The role involves creating tools to improve developer efficiency, accelerate feedback loops, and enhance release reliability, with a focus on predictive modeling for risk identification and leveraging RAG and fine-tuning techniques. | Agent | 8 |
| Senior System Software Engineer Senior Software System Engineer to customize and expand NVIDIA’s autonomous driving solutions, delivering and scaling a foundation model-based end-to-end stack for autonomous driving. Involves applied research and development of deep learning models, issue triage, performance maintenance, and collaboration to build a unified production system. | ShipPost-train | 8 |
| Applied AI Engineer - Silicon Co-Design Group NVIDIA is seeking an Applied AI Engineer to design, develop, and integrate AI/LLM-powered systems into their chip design and automation infrastructure. The role involves architecting and implementing solutions to enhance workflow efficiency, scalability, and intelligence, driving initiatives from concept to deployment. Requires hands-on experience building and deploying ML/AI systems or data-intensive backend services, with a focus on owning AI agents or LLM-powered workflows end-to-end. | Agent | 8 |
| SOC AI Application Engineer — AI Services, Agents and Knowledge Systems NVIDIA is seeking an AI Engineer to build and operate AI application-layer services for SOC design automation, including assistants, retrieval, Q&A, workflow automation, and AI agents. The role involves designing LLM-backed services, building RAG and knowledge systems, applying agent and orchestration patterns, improving developer experience with AI-assisted coding, and owning reliability and evaluation. Requires strong Python, experience shipping services, and hands-on use of LLM frameworks and RAG. | AgentServe | 8 |
| AI Computing Software Development Engineer, TensorRT NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust inferencing software for GPUs, focusing on performance analysis, optimization, and tuning. The role involves collaborating with various teams to guide machine learning inferencing direction and potentially publishing key results. | Serve | 8 |
| Senior Autonomous Driving Software Engineer, L4 Planning Senior engineer to build the main stack for autonomous driving, focusing on prediction, decision-making, planning, and control architecture. This includes crafting system-level safety, implementing end-to-end data-driven AV pipelines, large-scale model inference, and integrating classical and end-to-end hybrid systems for scalability from research to production. Requires 6+ years of experience in production autonomous driving systems, AI foundation models, large-scale ML systems, end-to-end driving models, and robotics/embodied AI system architecture. | AgentServe | 8 |
| AI Computing Development Engineer, TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize inferencing software for AI models, specifically focusing on TensorRT-LLM. This role involves performance analysis, tuning, and collaboration across teams to advance machine learning inferencing capabilities. | Serve | 8 |
| Software Engineering Intern, Automation Infra - 2026 NVIDIA is seeking an intern to build and maintain AI agent skills, MCP tools, and agentic workflows to automate operations. The role involves developing cloud-native services, data automation pipelines, and interactive dashboards. Candidates should have strong Python skills, experience with AI-powered applications like AI agents and RAG pipelines, and familiarity with cloud-native technologies. | Agent | 8 |
| Senior Software Engineer, 3D, 4D Reconstruction Senior Software Engineer role focused on applying deep learning and computer vision to 3D/4D world modeling for autonomous driving products, involving building reconstruction systems, developing evaluation methods, and optimizing neural network performance. | AgentEval Gate | 8 |
| Robotics and Agent Solution Architecture Intern - 2026 Internship role focused on building innovative tools and applications in robotics, agentic modeling, and AI model inference, utilizing NVIDIA SDKs and frameworks. Involves AI engineering, optimization, and exploring new trends in AI and computing acceleration. | AgentServe | 8 |
| Senior Infrastructure and Methodology Engineer for SoC-Clocks NVIDIA is seeking a Senior Infrastructure and Methodology Engineer to optimize chip design workflows by developing AI and agentic applications. The role involves integrating LLM capabilities into various engineering tools, designing agent applications, and establishing evaluation benchmarks. Requires full-stack web development and AI application development experience, with a preference for ASIC knowledge. | Agent | 8 |
| Developer Technology Engineer - AI NVIDIA Developer Technology Engineer focused on optimizing AI workloads, particularly large language models (LLMs), on NVIDIA's GPU platform. The role involves deep dives into application performance, GPU kernel optimization, distributed training and inference, and collaboration with various internal teams and external developers. It requires strong software engineering skills, parallel programming expertise, and a focus on performance analysis and tuning. | ServePost-train | 8 |
| Senior Solutions Architect, CSP System Senior Solutions Architect focused on building and optimizing Kubernetes infrastructure for Agentic AI and Agentic RL workloads, working with Cloud Service Providers in China. | AgentServe | 8 |
| Senior Solutions Architect - KV Cache and AI Storage Senior Solutions Architect focused on building LLM inference platforms using NVIDIA GPUs, KV cache, and tiered memory solutions. The role involves technical exploration with customers, performance analysis, and translating customer needs into product roadmaps. | Serve | 8 |
| Solutions Architect - Top AI Labs Solutions Architect role at NVIDIA focusing on optimizing LLM inference and training acceleration, contributing to open-source frameworks like SGLang and vLLM, and developing KV cache offloading. Requires strong programming, systems fundamentals, and experience in performance analysis. | ServePretrain | 8 |
| Solutions Architect, Generative AI - CSP NVIDIA is seeking an AI-focused Solutions Architect with expertise in LLMs, generative AI, agentic AI, or recommender systems. The role involves providing technical expertise to customers, assisting with GPU infrastructure for AI, optimizing training and inference pipelines, and gathering customer feedback for product development. This position requires 3+ years of experience in AI for large models and proficiency with AI tools. | ServePost-train | 8 |
| Senior Deep Learning Solution Architect Senior Deep Learning Solution Architect at NVIDIA, focusing on LLM inference and training acceleration, performance optimization, and contributing to open-source frameworks like SGLang and vLLM. The role involves developing and optimizing inference frameworks, KV cache offloading, and exploring distributed training performance. | ServePost-train | 8 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing core parallel algorithms and data structures for GPUs, specifically working with LLM training frameworks and performance optimization. Collaborates with application developers and internal NVIDIA teams to improve performance and developer efficiency. | Data | 8 |
| Solutions Architect - CPU and LPU NVIDIA Solutions Architect focused on optimizing AI inference workloads across CPU, GPU, and LPU platforms for customers. The role involves technical expertise, proof-of-concept development, and optimizing AI efficiency in heterogeneous environments. | ServeAgent | 8 |
| Senior Software Engineer, 3D/4D Reconstruction Senior Software Engineer at NVIDIA focused on 3D/4D reconstruction for autonomous driving products. The role involves building and optimizing systems using deep learning, computer vision, and generative AI techniques, including large geometry models, Gaussian splatting, and diffusion models. Responsibilities include developing reconstruction systems, inventing evaluation methods, creating visualization tools, building automated workflows, and optimizing neural network performance for training and deployment. The goal is to improve reconstruction fidelity and simulation realism for end-to-end driving models. | AgentEval Gate | 8 |
| NIM Solutions Architect This role focuses on deploying and optimizing large models using NVIDIA's Inference Microservice (NIM) and related tools. The Solutions Architect will package optimized models (LLM, VLM, etc.) into containers for deployment, refine NIM tools for the community, and design/implement agentic AI solutions for customer scenarios. The role requires strong programming skills, experience with inference engines, and MLOps practices, with a focus on performance engineering and model optimization. | ServeAgent | 8 |
| Solution Architecture Intern, AI in Industry - 2026 NVIDIA is seeking an AI in Industry Solution Architecture Intern to help optimize large models, develop AI workflows, and deliver advanced AI solutions. The intern will provide technical support, design and implement optimizations for AI models, and set up model training or inference to identify and resolve bottlenecks. This role involves working with various AI models and inference frameworks, conducting research, and collaborating with global teams. | ServePost-train | 8 |
| Performance Engineer Intern, Deep Learning and HPC - 2026 NVIDIA is seeking a Performance Engineer Intern to support performance testing of datacenter products and applications, focusing on AI workloads like LLM training and inference, as well as HPC. The role involves benchmarking, profiling, analyzing performance, developing automation scripts, and collaborating with internal teams. The intern will aggregate and report testing data for sales, marketing, and engineering teams, and assist in developing tools and processes for automated testing. | ServePost-train | 8 |
| Senior Software Engineer, Robotics - Isaac Lab NVIDIA is seeking a Senior Software Engineer for their Isaac Lab team to develop features for a robot learning platform, focusing on reinforcement learning, multi-agent learning, and sim-to-real deployment. The role involves automating workflows, scaling in the cloud, and collaborating with research teams on next-generation robots. | AgentData | 8 |
| Software Engineering Manager, Robotics NVIDIA is seeking a Robotics Software Engineering Manager to lead a team focused on sim-first development, real-world deployment, and continuous learning for physical AI robots, such as Humanoid Robots. The role involves hands-on development, implementation, and deployment of real-time software stacks, fostering innovation, and collaborating with cross-functional teams. | ShipAgent | 8 |
| Deeplearning Software Engineer -- Neural 3D reconstruction Software Engineer role focused on deep learning for neural 3D reconstruction, involving research, design, implementation, optimization, and deployment of DNN models. The role requires C++, PyTorch, and ML/DL techniques, with a preference for experience in DNN development and network acceleration. | ServePost-train | 8 |
| Senior AI Infrastructure Software Engineer Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks. | AgentServe | 8 |
| Senior Manager, Deep Learning Performance Architecture NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures. | Serve | 8 |