Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Deep Learning Framework Communications Engineer Senior Deep Learning Framework Communications Engineer at NVIDIA, focusing on integrating and optimizing communication libraries (NCCL, NVSHMEM) within AI frameworks (PyTorch, TRT-LLM, vLLM, JAX) to enhance performance for large-scale AI training and inference. The role involves deep analysis of AI workloads, compiler improvements, and kernel authoring for multi-GPU systems. | Serve | 8 |
| Senior Scientific Machine Learning Engineer – Earth-2 Develops and enhances machine learning frameworks (NVIDIA PhysicsNeMo, NVIDIA Earth2Studio) for scientific ML technology in weather, climate, and earth system modeling. Focuses on implementing new deep learning techniques and enhancing Earth-2 technologies. | Post-train |
| 8 |
| Senior Solutions Architect, Generative AI Data Processing Senior Solutions Architect role focused on assisting customers in deploying Generative AI solutions, particularly for data processing and agentic workflows, using NVIDIA's AI technology stack. The role involves technical advisory, system design, and implementation at scale, with a strong emphasis on Deep Learning, LLMs, and GPU technologies. | AgentServe | 8 |
| Director, System Software Engineering - Metropolis Accelerated and Inferencing Software NVIDIA is seeking a Director of System Software Engineering to lead teams responsible for the full lifecycle of Vision AI strategy, from model onboarding to production deployment. The role focuses on transforming foundation models into real-time, GPU-accelerated video intelligence systems, scaling multimodal reasoning, and enabling agentic development workflows. Key responsibilities include architecting and operationalizing inference acceleration, driving implementations of frameworks like TensorRT and VLLM, collaborating with partners on custom models, and ensuring performance benchmarking. The ideal candidate has extensive experience in deep learning, GPU optimization, and leading engineering teams in embedded and enterprise platforms. | ServeAgent | 8 |
| Director, Isaac for Healthcare Engineering Director of Engineering for NVIDIA's Isaac for Healthcare initiative, focusing on building a platform for healthcare robotics companies to develop, simulate, train, and deploy physical AI systems. The role involves platform leadership, team building, partner enablement, technical strategy, and cross-functional collaboration, with a strong emphasis on shipping sophisticated software platforms at scale. | ShipData | 8 |
| Manager, Solutions Architecture - Global Partner Team Manager of Solutions Architecture for NVIDIA's Global Partner Team, focusing on leading technical engagements with GSIs and AI consulting firms. The role involves building and scaling Agentic AI services, providing architectural oversight for complex AI workflows, and collaborating with product and engineering teams. Requires deep technical expertise in Generative/Agentic AI, RAG, LLM orchestration, and AI infrastructure. | Agent | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect role at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, developing new communication technologies, exploring hardware/software co-design, and building proofs-of-concept to drive innovation in large-scale GPU clusters. | Serve | 8 |
| Senior Solutions Architect - Deep Learning Senior Solutions Architect focused on Deep Learning and Agentic AI tools, collaborating with customers to build solutions using NVIDIA technology. Responsibilities include technical sales support, integrating NVIDIA tech into HPC, championing Deep Learning internally, and developing demo solutions. | ServeAgent | 8 |
| SOC AI Application Engineer — AI Services, Agents and Knowledge Systems NVIDIA is seeking an AI Engineer to build and operate AI application-layer services for SOC design automation, including assistants, retrieval, Q&A, workflow automation, and AI agents. The role involves designing LLM-backed services, building RAG and knowledge systems, applying agent and orchestration patterns, improving developer experience with AI-assisted coding, and owning reliability and evaluation. Requires strong Python, experience shipping services, and hands-on use of LLM frameworks and RAG. | AgentServe | 8 |
| Director, Product Platform Retail and CPG Industries NVIDIA is seeking a Director to define and build the Retail & CPG Industries product platform. This role involves architecting and developing a platform leveraging NVIDIA's full stack, including Agentic AI and accelerated computing, to reshape digital commerce, supply chains, and intelligent stores. The platform will utilize NVIDIA Nemo microservices and Nemotron models, with a focus on Agentic AI for various business functions. The ideal candidate will have hands-on development experience with AI agents, LLMs, RAG, and distributed systems, and will collaborate with engineering teams to deliver a production-ready, scalable platform. | AgentServe | 8 |
| Senior Solutions Architect - AI Factory Deployment Senior Solutions Architect focused on deploying and validating AI factories, specifically running and debugging AI/LLM workloads on GPU clusters. Responsibilities include setting up environments, executing benchmarks, resolving performance issues, building observability, and recommending optimizations. | Serve | 8 |
| Senior Software Engineer, Deep Learning Inference Senior Software Engineer focused on optimizing deep learning inference for LLMs and omnimodal architectures on NVIDIA hardware, including GPU kernel tuning, distributed inference, and contributing to open-source libraries. | Serve | 8 |
| Senior Hardware Architect, Deep Learning GPU and System Senior Hardware Architect role focused on designing next-generation GPUs and systems to advance the state of AI, analyzing deep learning workloads, and proposing new features for acceleration. Requires 8+ years of experience in performance, hardware architecture, and deep learning analysis. | Serve | 8 |
| Senior Software Engineer - VLM Microservices for Neural Reconstruction Senior Software Engineer to design, build, and optimize containerized inference execution for 3D Vision Language Models (VLMs) for neural reconstruction, turning research into production-grade software (NIMs). The role involves developing benchmarks, releasing and maintaining models, contributing to open-source projects like vLLM, and collaborating with research and product teams. Requires experience with AI distributed systems, inference platforms, Python/C++, and software engineering fundamentals. | ServePost-train | 8 |
| AI Computing Software Development Engineer, TensorRT NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust inferencing software for GPUs, focusing on performance analysis, optimization, and tuning. The role involves collaborating with various teams to guide machine learning inferencing direction and potentially publishing key results. | Serve | 8 |
| Senior Applied Machine Learning Engineer - VLSI Design NVIDIA is seeking a Senior Applied Machine Learning Engineer to build AI-driven software systems for circuit design, combining automation algorithms, DL models, and agentic workflows. The role involves working on pre-silicon and post-silicon hardware design data, circuit optimization, and AI systems for EDA/design automation, translating requirements into AI/ML and agentic system problems, and testing/releasing models and AI systems. | AgentData | 8 |
| Applied Machine Learning Engineer, Circuit Design - New College Grad 2026 NVIDIA is seeking an Applied Machine Learning Engineer for their Circuit Design team, focusing on building AI-driven software systems that combine automation algorithms, DL models, and agentic workflows to accelerate end-to-end circuit design. The role involves working with hardware design data, circuit optimization, and developing AI/ML solutions for EDA, with a focus on agent-driven design exploration and optimization. | Agent | 8 |
| Senior Solutions Architect, Generative AI Senior Solutions Architect role focused on customer engagements for NVIDIA's generative AI technologies, involving AI model training and deployment optimization, particularly for LLMs and recommenders in the consumer internet industry. Requires strong coding, GPU optimization, and communication skills. | ServeData | 8 |
| Principal AI and ML Infra Software Engineer, GPU Clusters This role focuses on enhancing the efficiency of AI and ML research on GPU clusters by collaborating with researchers to identify and address infrastructure deficiencies. The engineer will optimize performance, monitor resource utilization, and contribute to the AI/ML infrastructure ecosystem, keeping up-to-date with the latest AI/ML technologies. | Serve | 8 |
| Principal Cloud Services Software Engineer NVIDIA DGX Cloud Team is seeking a Principal Cloud Services Software Engineer to develop and optimize AI infrastructure services for large-scale AI training workflows. The role involves designing and implementing resilient, efficient services orchestrated by Kubernetes, with a focus on backend development, distributed systems, and high-performance computing. | ServeAgent | 8 |
| Senior Deep Learning Software Engineer - Autonomous Vehicles Senior Deep Learning Software Engineer focused on developing and productizing deep learning solutions for autonomous vehicles. The role involves training, fine-tuning, optimizing perception DNNs, applying quantization, improving DNN architectures, and enhancing inference speed and power consumption. It requires strong programming skills, experience with deep learning frameworks, computer vision tasks, and familiarity with CNNs and Transformer architectures. Experience with low precision inference, quantization, and NVIDIA software libraries is a plus. | ServePost-train | 8 |
| Compiler Engineer - AI Inference NVIDIA is seeking an AI Compiler Engineer to optimize kernel generation and computational graph optimizations for AI inference and training workloads on next-generation GPUs. The role involves hands-on development, collaboration on hardware/software co-design, and scaling AI deployments in datacenters. | ServePost-train | 8 |
| Senior Software Engineer, Metropolis Vision AI Senior Software Engineer to develop and optimize high-performance Vision AI pipelines and large-scale distributed services for processing video, image, and 3D data. The role involves crafting real-time systems, developing multi-modal perception, using simulation/synthetic data, and profiling/tuning GPU-accelerated inference pipelines. Collaboration with research and platform teams is key, with an emphasis on bringing research into production at scale. | ServePost-train | 8 |
| Senior Software Engineer, AI Networking Senior Software Engineer role focused on building and productizing ML tools for optimizing AI workloads (LLM training/inference) across GPU/CPU clusters, with a focus on networking and system resource utilization. Involves distributed deep learning, ML-based optimization techniques, and performance analysis. | ServeAgent | 8 |
| Machine Learning Intern - 2026 NVIDIA is seeking a Machine Learning Intern to assist with AI technology development and demonstrations. The intern will work with NVIDIA SDKs, engage with the AI community, and contribute to machine learning projects and AI software development. | Serve | 8 |
| Senior Autonomous Driving Software Engineer, L4 Planning Senior engineer to build the main stack for autonomous driving, focusing on prediction, decision-making, planning, and control architecture. This includes crafting system-level safety, implementing end-to-end data-driven AV pipelines, large-scale model inference, and integrating classical and end-to-end hybrid systems for scalability from research to production. Requires 6+ years of experience in production autonomous driving systems, AI foundation models, large-scale ML systems, end-to-end driving models, and robotics/embodied AI system architecture. | AgentServe | 8 |
| Deep Learning Architect, LLM Inference - New College Grad 2026 The role focuses on optimizing LLM inference server performance, workload characterization, and benchmarking for NVIDIA's GPUs. It involves collaborating with AI startups, developing performance tools, contributing to deep learning software projects, and guiding inference serving direction. | Serve | 8 |
| Senior Deep Learning Scientist, Speech Synthesis NVIDIA is seeking a Senior Deep Learning Scientist to work on their Speech AI product, Riva. The role involves training speech synthesis models (mel-spectrogram and vocoder), measuring and analyzing model performance, maintaining the TTS evaluation system, and improving speech data processing and training set preparation. The ideal candidate has a Master's or PhD, 5+ years of ML/AI experience, strong Python and PyTorch skills, and hands-on experience training speech synthesis models. | DataPost-train | 8 |
| Senior Software Engineer - Robotics Senior Software Engineer role at NVIDIA focused on building Physical AI systems for humanoid robots. The role involves defining technical direction for generative AI workflows in robotics, spanning simulation, real-world deployment, and continuous learning. Responsibilities include building the humanoid reference platform, integrating NVIDIA products, and providing technical mentorship. Requires significant robotics software engineering experience, with a focus on AI-powered robots and robot learning. | ShipData | 8 |
| Senior AI-Native Systems Software Engineer, TensorRT Senior engineer to architect and build an AI-native framework using AI agents for software development, focusing on scaling, performance optimization, and integrating SOTA models for inference. | AgentServe | 8 |
| Senior Performance Engineer - LLM Inference Frameworks NVIDIA is seeking a Senior Performance Engineer to optimize LLM inference infrastructure on GPUs, focusing on throughput, memory efficiency, and scalability. The role involves designing and implementing high-performance pipelines, profiling, tuning model execution, and innovating techniques like Speculative Decoding and quantization. Experience with deep learning frameworks and performance debugging is required. | Serve | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| OEM Solutions Architect - AI Full Stack Public Sector NVIDIA is seeking a Solutions Architect to be the lead technical authority for Federal partnerships, focusing on deploying Generative AI at scale for U.S. Government agencies. The role involves architecting and optimizing the 'AI Factory,' leading POCs for NVIDIA's AI software stack, and navigating complex Federal security frameworks. The ideal candidate has extensive experience in full-stack data center architecture, the AI lifecycle (data curation, fine-tuning, inference orchestration), and strategic communication with both technical and leadership audiences within the public sector. | ServePost-train | 8 |
| Solutions Architect – OEM AI Solutions Architect role focused on integrating NVIDIA's software and tools with OEM partners' offerings, specifically for AI security and accelerated compute solutions. The role involves designing GPU-accelerated pipelines, developing proof-of-concept technologies, guiding Agentic AI workflows (RAG, GNN), and translating AI/cybersecurity research into deployable architectures for enterprise and government customers. | AgentData | 8 |
| Senior Staff Software Engineer - Agentic Automation Senior Staff Software Engineer to own engineering efforts for NVIDIA enterprise systems, transforming support into AI-infused automated resolution systems using LLM-based agents, tool calling, RAG, and orchestration frameworks. Requires full-stack experience, strong systems thinking, and incident management skills. | Agent | 8 |
| AI Computing Development Engineer, TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize inferencing software for AI models, specifically focusing on TensorRT-LLM. This role involves performance analysis, tuning, and collaboration across teams to advance machine learning inferencing capabilities. | Serve | 8 |
| Senior Software Engineer, JAX Senior Software Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing JAX core components, driving performance on NVIDIA products, and building tools to increase efficiency for AI-based systems. | Serve | 8 |
| Software Engineering Intern, Automation Infra - 2026 NVIDIA is seeking an intern to build and maintain AI agent skills, MCP tools, and agentic workflows to automate operations. The role involves developing cloud-native services, data automation pipelines, and interactive dashboards. Candidates should have strong Python skills, experience with AI-powered applications like AI agents and RAG pipelines, and familiarity with cloud-native technologies. | Agent | 8 |
| SOC AI Application Engineer — AI Services, Agents and Knowledge Systems NVIDIA is seeking an AI Engineer to build AI application-layer services for SOC design automation, including assistants, retrieval, Q&A, workflow automation, and AI agents. The role involves designing, implementing, and operating LLM-backed services, building RAG and knowledge systems, applying agent and orchestration patterns, improving developer experience with AI-assisted coding, and owning reliability and evaluation. | AgentServe | 8 |
| Senior Architect - Server Performance NVIDIA is seeking architects to drive architectural performance for its next-generation AI server systems. This position demands a unique capability to bridge deep architectural knowledge, workload analysis, and hands-on silicon investigations. Candidates should be adept at working directly with silicon, high-level models, and simulators. Responsibilities include conducting performance investigations on both NVIDIA and competitive platforms, and developing targeted microbenchmarks to examine specific architectural aspects. The role does not heavily involve modeling tasks (functional or performance), though occasional focused assignments may arise. | Serve | 8 |
| Solutions Architect, Inference Deployments This role focuses on building and deploying AI inference solutions at scale using NVIDIA's GPU technology and Kubernetes. The Solutions Architect will collaborate with engineering, DevOps, and customers to optimize and serve generative AI models, ensuring low-latency inference in enterprise environments. | Serve | 8 |
| Solutions Architect, Agentic AI NVIDIA is seeking Solutions Architects to build and deploy agentic AI applications at scale for enterprises, focusing on integrating enterprise data, developing multi-modal dialogue systems, and task-specific agents. The role involves working with agentic frameworks, providing feedback to improve software products, and educating vertical teams. | Agent | 8 |
| Senior Solutions Architect, Generative AI Senior Solutions Architect role focused on customer engagements, improving AI workload performance, and developing proof-of-concepts for Generative AI solutions (LLMs, recommenders) using NVIDIA software and technologies. Requires strong coding, GPU optimization, and communication skills. | ServeAgent | 8 |
| Senior Software Engineer, 3D, 4D Reconstruction Senior Software Engineer role focused on applying deep learning and computer vision to 3D/4D world modeling for autonomous driving products, involving building reconstruction systems, developing evaluation methods, and optimizing neural network performance. | AgentEval Gate | 8 |
| Principal Deep Learning Communication Architect NVIDIA is seeking a Principal Deep Learning Communication Architect to lead the technical roadmap for communication libraries across next-generation platforms, ensuring seamless scaling of models to massive clusters. The role involves designing and optimizing communication primitives for heterogeneous interconnects, co-designing with application developers and silicon architects, and developing analytical models for system behavior. Expertise in parallel computing, HPC/distributed deep learning, inference engines, and GPU architecture is required. | ServeAgent | 8 |
| Robotics and Agent Solution Architecture Intern - 2026 Internship role focused on building innovative tools and applications in robotics, agentic modeling, and AI model inference, utilizing NVIDIA SDKs and frameworks. Involves AI engineering, optimization, and exploring new trends in AI and computing acceleration. | AgentServe | 8 |
| Senior Infrastructure and Methodology Engineer for SoC-Clocks NVIDIA is seeking a Senior Infrastructure and Methodology Engineer to optimize chip design workflows by developing AI and agentic applications. The role involves integrating LLM capabilities into various engineering tools, designing agent applications, and establishing evaluation benchmarks. Requires full-stack web development and AI application development experience, with a preference for ASIC knowledge. | Agent | 8 |
| Technical Lead, GenAI - Autonomous Vehicles This role is a Technical Lead focused on Generative AI within Autonomous Vehicles, engaging with developer ecosystems and partners to promote NVIDIA's AI platforms. The candidate will act as a technical advisor, develop expertise in NVIDIA's platforms, create enablement resources, and represent partner needs internally. Requires a strong technical background in AI, AV systems, and GenAI model development, with experience in production code, DevOps, and DL/RL frameworks. | Agent | 8 |
| Senior Software Engineer, Computer Vision - Autonomous Vehicles Senior Software Engineer at NVIDIA for Autonomous Vehicles, focusing on Computer Vision and Machine Learning for offline perception tasks. Responsibilities include advancing DL components for training and inference, developing tools for large datasets, and integrating DL algorithms into large-scale pipelines. | DataServe | 8 |
| Developer Technology Engineer - AI NVIDIA Developer Technology Engineer focused on optimizing AI workloads, particularly large language models (LLMs), on NVIDIA's GPU platform. The role involves deep dives into application performance, GPU kernel optimization, distributed training and inference, and collaboration with various internal teams and external developers. It requires strong software engineering skills, parallel programming expertise, and a focus on performance analysis and tuning. | ServePost-train | 8 |