Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Autonomous Driving Software Engineer, L4 Planning Senior engineer to build the main stack for autonomous driving, focusing on prediction, decision-making, planning, and control architecture. This includes crafting system-level safety, implementing end-to-end data-driven AV pipelines, large-scale model inference, and integrating classical and end-to-end hybrid systems for scalability from research to production. Requires 6+ years of experience in production autonomous driving systems, AI foundation models, large-scale ML systems, end-to-end driving models, and robotics/embodied AI system architecture. | AgentServe | 8 |
| Deep Learning Architect, LLM Inference - New College Grad 2026 The role focuses on optimizing LLM inference server performance, workload characterization, and benchmarking for NVIDIA's GPUs. It involves collaborating with AI startups, developing performance tools, contributing to deep learning software projects, and guiding inference serving direction. |
| Serve |
| 8 |
| Senior Deep Learning Scientist, Speech Synthesis NVIDIA is seeking a Senior Deep Learning Scientist to work on their Speech AI product, Riva. The role involves training speech synthesis models (mel-spectrogram and vocoder), measuring and analyzing model performance, maintaining the TTS evaluation system, and improving speech data processing and training set preparation. The ideal candidate has a Master's or PhD, 5+ years of ML/AI experience, strong Python and PyTorch skills, and hands-on experience training speech synthesis models. | DataPost-train | 8 |
| Senior Software Engineer - Robotics Senior Software Engineer role at NVIDIA focused on building Physical AI systems for humanoid robots. The role involves defining technical direction for generative AI workflows in robotics, spanning simulation, real-world deployment, and continuous learning. Responsibilities include building the humanoid reference platform, integrating NVIDIA products, and providing technical mentorship. Requires significant robotics software engineering experience, with a focus on AI-powered robots and robot learning. | ShipData | 8 |
| Senior AI-Native Systems Software Engineer, TensorRT Senior engineer to architect and build an AI-native framework using AI agents for software development, focusing on scaling, performance optimization, and integrating SOTA models for inference. | AgentServe | 8 |
| Senior Performance Engineer - LLM Inference Frameworks NVIDIA is seeking a Senior Performance Engineer to optimize LLM inference infrastructure on GPUs, focusing on throughput, memory efficiency, and scalability. The role involves designing and implementing high-performance pipelines, profiling, tuning model execution, and innovating techniques like Speculative Decoding and quantization. Experience with deep learning frameworks and performance debugging is required. | Serve | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| OEM Solutions Architect - AI Full Stack Public Sector NVIDIA is seeking a Solutions Architect to be the lead technical authority for Federal partnerships, focusing on deploying Generative AI at scale for U.S. Government agencies. The role involves architecting and optimizing the 'AI Factory,' leading POCs for NVIDIA's AI software stack, and navigating complex Federal security frameworks. The ideal candidate has extensive experience in full-stack data center architecture, the AI lifecycle (data curation, fine-tuning, inference orchestration), and strategic communication with both technical and leadership audiences within the public sector. | ServePost-train | 8 |
| Solutions Architect – OEM AI Solutions Architect role focused on integrating NVIDIA's software and tools with OEM partners' offerings, specifically for AI security and accelerated compute solutions. The role involves designing GPU-accelerated pipelines, developing proof-of-concept technologies, guiding Agentic AI workflows (RAG, GNN), and translating AI/cybersecurity research into deployable architectures for enterprise and government customers. | AgentData | 8 |
| Senior Staff Software Engineer - Agentic Automation Senior Staff Software Engineer to own engineering efforts for NVIDIA enterprise systems, transforming support into AI-infused automated resolution systems using LLM-based agents, tool calling, RAG, and orchestration frameworks. Requires full-stack experience, strong systems thinking, and incident management skills. | Agent | 8 |
| AI Computing Development Engineer, TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize inferencing software for AI models, specifically focusing on TensorRT-LLM. This role involves performance analysis, tuning, and collaboration across teams to advance machine learning inferencing capabilities. | Serve | 8 |
| Senior Software Engineer, JAX Senior Software Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing JAX core components, driving performance on NVIDIA products, and building tools to increase efficiency for AI-based systems. | Serve | 8 |
| Software Engineering Intern, Automation Infra - 2026 NVIDIA is seeking an intern to build and maintain AI agent skills, MCP tools, and agentic workflows to automate operations. The role involves developing cloud-native services, data automation pipelines, and interactive dashboards. Candidates should have strong Python skills, experience with AI-powered applications like AI agents and RAG pipelines, and familiarity with cloud-native technologies. | Agent | 8 |
| SOC AI Application Engineer — AI Services, Agents and Knowledge Systems NVIDIA is seeking an AI Engineer to build AI application-layer services for SOC design automation, including assistants, retrieval, Q&A, workflow automation, and AI agents. The role involves designing, implementing, and operating LLM-backed services, building RAG and knowledge systems, applying agent and orchestration patterns, improving developer experience with AI-assisted coding, and owning reliability and evaluation. | AgentServe | 8 |
| Senior Architect - Server Performance NVIDIA is seeking architects to drive architectural performance for its next-generation AI server systems. This position demands a unique capability to bridge deep architectural knowledge, workload analysis, and hands-on silicon investigations. Candidates should be adept at working directly with silicon, high-level models, and simulators. Responsibilities include conducting performance investigations on both NVIDIA and competitive platforms, and developing targeted microbenchmarks to examine specific architectural aspects. The role does not heavily involve modeling tasks (functional or performance), though occasional focused assignments may arise. | Serve | 8 |
| Solutions Architect, Inference Deployments This role focuses on building and deploying AI inference solutions at scale using NVIDIA's GPU technology and Kubernetes. The Solutions Architect will collaborate with engineering, DevOps, and customers to optimize and serve generative AI models, ensuring low-latency inference in enterprise environments. | Serve | 8 |
| Solutions Architect, Agentic AI NVIDIA is seeking Solutions Architects to build and deploy agentic AI applications at scale for enterprises, focusing on integrating enterprise data, developing multi-modal dialogue systems, and task-specific agents. The role involves working with agentic frameworks, providing feedback to improve software products, and educating vertical teams. | Agent | 8 |
| Senior Solutions Architect, Generative AI Senior Solutions Architect role focused on customer engagements, improving AI workload performance, and developing proof-of-concepts for Generative AI solutions (LLMs, recommenders) using NVIDIA software and technologies. Requires strong coding, GPU optimization, and communication skills. | ServeAgent | 8 |
| Senior Software Engineer, 3D, 4D Reconstruction Senior Software Engineer role focused on applying deep learning and computer vision to 3D/4D world modeling for autonomous driving products, involving building reconstruction systems, developing evaluation methods, and optimizing neural network performance. | AgentEval Gate | 8 |
| Principal Deep Learning Communication Architect NVIDIA is seeking a Principal Deep Learning Communication Architect to lead the technical roadmap for communication libraries across next-generation platforms, ensuring seamless scaling of models to massive clusters. The role involves designing and optimizing communication primitives for heterogeneous interconnects, co-designing with application developers and silicon architects, and developing analytical models for system behavior. Expertise in parallel computing, HPC/distributed deep learning, inference engines, and GPU architecture is required. | ServeAgent | 8 |
| Robotics and Agent Solution Architecture Intern - 2026 Internship role focused on building innovative tools and applications in robotics, agentic modeling, and AI model inference, utilizing NVIDIA SDKs and frameworks. Involves AI engineering, optimization, and exploring new trends in AI and computing acceleration. | AgentServe | 8 |
| Senior Infrastructure and Methodology Engineer for SoC-Clocks NVIDIA is seeking a Senior Infrastructure and Methodology Engineer to optimize chip design workflows by developing AI and agentic applications. The role involves integrating LLM capabilities into various engineering tools, designing agent applications, and establishing evaluation benchmarks. Requires full-stack web development and AI application development experience, with a preference for ASIC knowledge. | Agent | 8 |
| Technical Lead, GenAI - Autonomous Vehicles This role is a Technical Lead focused on Generative AI within Autonomous Vehicles, engaging with developer ecosystems and partners to promote NVIDIA's AI platforms. The candidate will act as a technical advisor, develop expertise in NVIDIA's platforms, create enablement resources, and represent partner needs internally. Requires a strong technical background in AI, AV systems, and GenAI model development, with experience in production code, DevOps, and DL/RL frameworks. | Agent | 8 |
| Senior Software Engineer, Computer Vision - Autonomous Vehicles Senior Software Engineer at NVIDIA for Autonomous Vehicles, focusing on Computer Vision and Machine Learning for offline perception tasks. Responsibilities include advancing DL components for training and inference, developing tools for large datasets, and integrating DL algorithms into large-scale pipelines. | DataServe | 8 |
| Developer Technology Engineer - AI NVIDIA Developer Technology Engineer focused on optimizing AI workloads, particularly large language models (LLMs), on NVIDIA's GPU platform. The role involves deep dives into application performance, GPU kernel optimization, distributed training and inference, and collaboration with various internal teams and external developers. It requires strong software engineering skills, parallel programming expertise, and a focus on performance analysis and tuning. | ServePost-train | 8 |
| Senior Solutions Architect, CSP System Senior Solutions Architect focused on building and optimizing Kubernetes infrastructure for Agentic AI and Agentic RL workloads, working with Cloud Service Providers in China. | AgentServe | 8 |
| Senior Architect NVIDIA is seeking a Senior Architect to lead the development of software infrastructure for AI-driven scientific discovery in chemistry and materials science. The role involves shaping NVIDIA ALCHEMI and its ecosystem, translating AI research (ML interatomic potentials, generative modeling) into product direction, and engaging with internal/external stakeholders. The ideal candidate has a PhD or equivalent experience, 8+ years of AI/ML software development for chemistry/materials, strong GPU computing and ML framework experience, and expertise in scientific software architecture. | Ship | 8 |
| AI for Design Engineer Develop and deploy AI agents and frameworks for hardware verification tasks, processing codebases and optimizing retrieval/generation algorithms for enterprise data. | Agent | 8 |
| Engineering Manager, Prediction and Planning - Autonomous Vehicles Engineering Manager for NVIDIA's Autonomous Vehicles division, leading teams to build and scale AI-native autonomous driving systems, integrating classical safety stacks with foundation models and large-scale AI systems from research to production. | ShipAgent | 8 |
| Senior Integration Engineer - Autonomous Vehicles NVIDIA is seeking a Senior Integration Engineer to work on their end-to-end autonomous driving application, focusing on integrating modular software components and optimizing performance on heterogeneous hardware architectures. The role involves defining software architecture for L2/L3/L4 autonomous driving solutions, performing in-vehicle and simulation testing, and developing efficient C++ code using CUDA. | Agent | 8 |
| Senior Integration Engineer - Autonomous Vehicles Senior Integration Engineer for NVIDIA's end-to-end autonomous driving application, focusing on integrating software components, optimizing performance, and developing efficient C++ code on heterogeneous hardware architectures (including GPUs) for L2/L3/L4 autonomous driving solutions. | AgentServe | 8 |
| Senior AI Software Development Engineer, TensorRT-LLM NVIDIA is seeking a Senior AI Software Development Engineer for its TensorRT-LLM team. The role involves crafting and developing robust, scalable inference software for LLMs, focusing on performance analysis, optimization, and tuning. The engineer will write high-quality C++/Python code for the core backend software and collaborate with various teams to guide deep learning inference direction. A strong background in software development, LLM inference techniques, and deep learning frameworks is required. | Serve | 8 |
| Senior Product Manager, AI Frameworks Product Manager for AI Frameworks at NVIDIA, focusing on Recommender Systems and Generative Recommendation Models. The role involves building products for frontier RecSys and Generative Recommendation Models on Nvidia systems, enabling researchers and operators, and pushing the boundaries of what is possible in research-to-production. Responsibilities include creating and optimizing pre-training/inference and post-training frameworks, developing product strategy, roadmaps, and go-to-market plans, and collaborating with internal and external customers. Requires experience with training/inference post-training and optimization software, GenAI/ML concepts, large-scale distributed systems, and technical product management. | Post-trainServe | 8 |
| Senior Product Manager, AI Inference - Dynamo Product Manager for NVIDIA Dynamo, a distributed inference framework for LLMs and Generative AI. Focuses on defining the roadmap for high-scale serving, optimizing hardware-software co-design, and developing agentic inference capabilities. Collaborates with engineering, open-source communities, and customers to integrate model evaluation into workflows. | ServeAgent | 8 |
| AI and FSI Developer Technology Engineer - New College Grad 2026 NVIDIA is seeking an AI and FSI Developer Technology Engineer to optimize AI and HPC workloads on NVIDIA GPUs and CPUs, focusing on performance tuning and eliminating bottlenecks for financial markets. The role involves research, development, analysis, and collaboration with experts to improve performance across the stack, from algorithms to kernels. The engineer will also publish and present their work and influence future hardware/software designs. | Serve | 8 |
| Senior Software Engineer, RAG and Agentic AI Senior Software Engineer role focused on building and deploying production-grade RAG solutions and AI agents. The role involves designing and implementing scalable RAG architectures, developing AI agents with reasoning and multi-step execution capabilities, and orchestrating complex microservices deployments. Emphasis on optimizing RAG pipelines for accuracy, relevance, and performance, and driving continuous improvement through rigorous evaluation and collaboration. | AgentServe | 8 |
| Senior Software Engineer, Platform Engineering Senior Software Engineer to build next-generation AI platforms and products, focusing on agentic AI systems, RAG, and scalable infrastructure for enterprise workflows. | Agent | 8 |
| Solutions Architect, Physical AI and Robotics NVIDIA is looking for a Solutions Architect to guide partners in building enterprise Physical AI systems using Omniverse, Cosmos, synthetic data, and coding-agent-assisted digital twins workflows. The role involves technical advising on simulation, digital twins, robotics, industrial autonomy, and auto, focusing on architecture, compute, testing, and rollout strategies. Key responsibilities include guiding partners on synthetic data generation, evaluation methods, using coding agents for development acceleration, defining benchmarks, advising on compute infrastructure for simulation and inference, and building reference architectures. | AgentData | 8 |
| Senior Solutions Architect - KV Cache and AI Storage Senior Solutions Architect focused on building LLM inference platforms using NVIDIA GPUs, KV cache, and tiered memory solutions. The role involves technical exploration with customers, performance analysis, and translating customer needs into product roadmaps. | Serve | 8 |
| Solutions Architect - Top AI Labs Solutions Architect role at NVIDIA focusing on optimizing LLM inference and training acceleration, contributing to open-source frameworks like SGLang and vLLM, and developing KV cache offloading. Requires strong programming, systems fundamentals, and experience in performance analysis. | ServePretrain | 8 |
| Senior Systems Software Engineer, E-commerce AI Platform - GeForce NOW Senior Systems Software Engineer to architect and deploy production-grade AI agents for NVIDIA's e-commerce platform, focusing on personalization, logistics, and customer experience. Requires expertise in Python, Java, GoLang, distributed systems, and AI frameworks like LangChain/LangGraph. | Agent | 8 |
| Senior Applied Machine Learning Scientist Senior Applied ML Scientist at NVIDIA to develop ML and data-science solutions for predictive-maintenance, root-cause analysis, and AIOPS, driving projects from ideation to production within the Applied Networking AI group. | Ship | 8 |
| Solutions Architect, Generative AI - CSP NVIDIA is seeking an AI-focused Solutions Architect with expertise in LLMs, generative AI, agentic AI, or recommender systems. The role involves providing technical expertise to customers, assisting with GPU infrastructure for AI, optimizing training and inference pipelines, and gathering customer feedback for product development. This position requires 3+ years of experience in AI for large models and proficiency with AI tools. | ServePost-train | 8 |
| Senior Deep Learning Solution Architect Senior Deep Learning Solution Architect at NVIDIA, focusing on LLM inference and training acceleration, performance optimization, and contributing to open-source frameworks like SGLang and vLLM. The role involves developing and optimizing inference frameworks, KV cache offloading, and exploring distributed training performance. | ServePost-train | 8 |
| Senior SOC Product Architect Physical AI Platforms This role focuses on architecting physical AI platforms for automotive and robotics, specifically defining the SoC architecture for embedded computer vision and AI systems. The individual will analyze use cases, map requirements to hardware/software features, define system requirements, and drive recommendations into product roadmaps. The role involves deep benchmarking, customer interaction, technical leadership, and mentorship, with a strong emphasis on functional safety (ISO 26262, SOTIF). | Serve | 8 |
| Senior Technical Program Manager - Agentic System Senior Technical Program Manager to drive and coordinate cross-functional teams for large-scale technical projects in agentic AI, connecting foundation models with real-world applications for edge deployment and AI workflows. | Agent | 8 |
| Deep Learning Algorithms Engineer - ACOT NVIDIA is looking for an AI Acceleration & Optimization Engineer to optimize the performance, scalability, and efficiency of AI models (LLMs, VLMs, diffusion, multimodal) on NVIDIA GPU platforms. The role involves profiling, identifying bottlenecks, and applying optimization techniques like quantization and kernel fusion, using tools such as CUDA, TensorRT, and Nsight. Collaboration with various teams (algorithms, systems, hardware, research, CUDA, compiler, frameworks) is key to bringing models from research to production. | ServePost-train | 8 |
| Principal Software Engineer - Enterprise AI Platform Principal Software Engineer to lead security foundations for autonomous, self-evolving agents in an enterprise setting. This role involves defining security requirements, designing scalable architectures with guardrails, implementing isolation and access controls, building secure data access pathways, establishing observability and auditing, and operating a continuous evaluation framework for agent behavior. The goal is to enable developer velocity while ensuring robust safety and security for agents that generate and execute code and access data. | Agent | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX NVIDIA is seeking a Senior Machine Learning Applications and Compiler Engineer to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to map neural network workloads onto future NVIDIA platforms. | Serve | 8 |
| Senior Power Analysis and Optimization Engineer Senior Engineer to apply AI/ML and LLMs to power analysis and optimization for NVIDIA's GPUs and SoCs. Focus on developing and productionizing ML/RL models and custom LLMs to improve energy efficiency, interpret power data, and recommend optimizations. Involves RTL analysis, Verilog prototyping, and automation. | ServeData | 8 |