Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Solutions Architect, CSP System Senior Solutions Architect focused on building and optimizing Kubernetes infrastructure for Agentic AI and Agentic RL workloads, working with Cloud Service Providers in China. | AgentServe | 8 |
| AI for Design Engineer Develop and deploy AI agents and frameworks for hardware verification tasks, processing codebases and optimizing retrieval/generation algorithms for enterprise data. | Agent | 8 |
| Engineering Manager, Prediction and Planning - Autonomous Vehicles Engineering Manager for NVIDIA's Autonomous Vehicles division, leading teams to build and scale AI-native autonomous driving systems, integrating classical safety stacks with foundation models and large-scale AI systems from research to production. |
| ShipAgent |
| 8 |
| Senior Integration Engineer - Autonomous Vehicles NVIDIA is seeking a Senior Integration Engineer to work on their end-to-end autonomous driving application, focusing on integrating modular software components and optimizing performance on heterogeneous hardware architectures. The role involves defining software architecture for L2/L3/L4 autonomous driving solutions, performing in-vehicle and simulation testing, and developing efficient C++ code using CUDA. | Agent | 8 |
| Senior Integration Engineer - Autonomous Vehicles Senior Integration Engineer for NVIDIA's end-to-end autonomous driving application, focusing on integrating software components, optimizing performance, and developing efficient C++ code on heterogeneous hardware architectures (including GPUs) for L2/L3/L4 autonomous driving solutions. | AgentServe | 8 |
| Senior AI Software Development Engineer, TensorRT-LLM NVIDIA is seeking a Senior AI Software Development Engineer for its TensorRT-LLM team. The role involves crafting and developing robust, scalable inference software for LLMs, focusing on performance analysis, optimization, and tuning. The engineer will write high-quality C++/Python code for the core backend software and collaborate with various teams to guide deep learning inference direction. A strong background in software development, LLM inference techniques, and deep learning frameworks is required. | Serve | 8 |
| AI and FSI Developer Technology Engineer - New College Grad 2026 NVIDIA is seeking an AI and FSI Developer Technology Engineer to optimize AI and HPC workloads on NVIDIA GPUs and CPUs, focusing on performance tuning and eliminating bottlenecks for financial markets. The role involves research, development, analysis, and collaboration with experts to improve performance across the stack, from algorithms to kernels. The engineer will also publish and present their work and influence future hardware/software designs. | Serve | 8 |
| Senior Software Engineer, RAG and Agentic AI Senior Software Engineer role focused on building and deploying production-grade RAG solutions and AI agents. The role involves designing and implementing scalable RAG architectures, developing AI agents with reasoning and multi-step execution capabilities, and orchestrating complex microservices deployments. Emphasis on optimizing RAG pipelines for accuracy, relevance, and performance, and driving continuous improvement through rigorous evaluation and collaboration. | AgentServe | 8 |
| Senior Software Engineer, Platform Engineering Senior Software Engineer to build next-generation AI platforms and products, focusing on agentic AI systems, RAG, and scalable infrastructure for enterprise workflows. | Agent | 8 |
| Solutions Architect, Physical AI and Robotics NVIDIA is looking for a Solutions Architect to guide partners in building enterprise Physical AI systems using Omniverse, Cosmos, synthetic data, and coding-agent-assisted digital twins workflows. The role involves technical advising on simulation, digital twins, robotics, industrial autonomy, and auto, focusing on architecture, compute, testing, and rollout strategies. Key responsibilities include guiding partners on synthetic data generation, evaluation methods, using coding agents for development acceleration, defining benchmarks, advising on compute infrastructure for simulation and inference, and building reference architectures. | AgentData | 8 |
| Senior Solutions Architect - KV Cache and AI Storage Senior Solutions Architect focused on building LLM inference platforms using NVIDIA GPUs, KV cache, and tiered memory solutions. The role involves technical exploration with customers, performance analysis, and translating customer needs into product roadmaps. | Serve | 8 |
| Solutions Architect - Top AI Labs Solutions Architect role at NVIDIA focusing on optimizing LLM inference and training acceleration, contributing to open-source frameworks like SGLang and vLLM, and developing KV cache offloading. Requires strong programming, systems fundamentals, and experience in performance analysis. | ServePretrain | 8 |
| Senior Systems Software Engineer, E-commerce AI Platform - GeForce NOW Senior Systems Software Engineer to architect and deploy production-grade AI agents for NVIDIA's e-commerce platform, focusing on personalization, logistics, and customer experience. Requires expertise in Python, Java, GoLang, distributed systems, and AI frameworks like LangChain/LangGraph. | Agent | 8 |
| Senior Applied Machine Learning Scientist Senior Applied ML Scientist at NVIDIA to develop ML and data-science solutions for predictive-maintenance, root-cause analysis, and AIOPS, driving projects from ideation to production within the Applied Networking AI group. | Ship | 8 |
| Solutions Architect, Generative AI - CSP NVIDIA is seeking an AI-focused Solutions Architect with expertise in LLMs, generative AI, agentic AI, or recommender systems. The role involves providing technical expertise to customers, assisting with GPU infrastructure for AI, optimizing training and inference pipelines, and gathering customer feedback for product development. This position requires 3+ years of experience in AI for large models and proficiency with AI tools. | ServePost-train | 8 |
| Senior Deep Learning Solution Architect Senior Deep Learning Solution Architect at NVIDIA, focusing on LLM inference and training acceleration, performance optimization, and contributing to open-source frameworks like SGLang and vLLM. The role involves developing and optimizing inference frameworks, KV cache offloading, and exploring distributed training performance. | ServePost-train | 8 |
| Senior SOC Product Architect Physical AI Platforms This role focuses on architecting physical AI platforms for automotive and robotics, specifically defining the SoC architecture for embedded computer vision and AI systems. The individual will analyze use cases, map requirements to hardware/software features, define system requirements, and drive recommendations into product roadmaps. The role involves deep benchmarking, customer interaction, technical leadership, and mentorship, with a strong emphasis on functional safety (ISO 26262, SOTIF). | Serve | 8 |
| Senior Technical Program Manager - Agentic System Senior Technical Program Manager to drive and coordinate cross-functional teams for large-scale technical projects in agentic AI, connecting foundation models with real-world applications for edge deployment and AI workflows. | Agent | 8 |
| Deep Learning Algorithms Engineer - ACOT NVIDIA is looking for an AI Acceleration & Optimization Engineer to optimize the performance, scalability, and efficiency of AI models (LLMs, VLMs, diffusion, multimodal) on NVIDIA GPU platforms. The role involves profiling, identifying bottlenecks, and applying optimization techniques like quantization and kernel fusion, using tools such as CUDA, TensorRT, and Nsight. Collaboration with various teams (algorithms, systems, hardware, research, CUDA, compiler, frameworks) is key to bringing models from research to production. | ServePost-train | 8 |
| Principal Software Engineer - Enterprise AI Platform Principal Software Engineer to lead security foundations for autonomous, self-evolving agents in an enterprise setting. This role involves defining security requirements, designing scalable architectures with guardrails, implementing isolation and access controls, building secure data access pathways, establishing observability and auditing, and operating a continuous evaluation framework for agent behavior. The goal is to enable developer velocity while ensuring robust safety and security for agents that generate and execute code and access data. | Agent | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX NVIDIA is seeking a Senior Machine Learning Applications and Compiler Engineer to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to map neural network workloads onto future NVIDIA platforms. | Serve | 8 |
| Senior Power Analysis and Optimization Engineer Senior Engineer to apply AI/ML and LLMs to power analysis and optimization for NVIDIA's GPUs and SoCs. Focus on developing and productionizing ML/RL models and custom LLMs to improve energy efficiency, interpret power data, and recommend optimizations. Involves RTL analysis, Verilog prototyping, and automation. | ServeData | 8 |
| Senior System Software Engineer – Embedded AI Inference Senior Software Engineer to develop production automotive software for AI inference and agent orchestration in C++ for embedded platforms. Focus on building next-generation automotive software applications, including in-car agentic AI and inference of LLM/VLM/VLA models on NVIDIA GPUs. | ServeAgent | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX Develops algorithms and optimizations for NVIDIA's LPX inference and compiler stack, focusing on mapping neural network workloads onto future NVIDIA platforms and optimizing end-to-end inference performance. Requires strong software engineering, compiler/runtime development, and deep learning framework experience. | Serve | 8 |
| DL System Software Engineer - AI Platform NVIDIA is seeking a DL System Software Engineer to develop an AI Platform for efficient inference and training of large-scale models on GPU clusters. The role involves designing and building solutions for scheduling workloads, resource management, and performance optimization, working with various NVIDIA AI technologies. | ServePost-train | 8 |
| Senior Software Engineer, TensorRT-LLM NVIDIA is seeking a Senior Software Engineer for its TensorRT-LLM team to develop and scale inferencing software for LLMs and Generative AI. The role involves crafting robust inferencing software, performing benchmarking and profiling for GPU applications, writing high-quality Python code for LLM inference, and improving the TensorRT-LLM library. Collaboration with software, research, and product teams is key. | Serve | 8 |
| Senior Software Engineer – TensorRT Edge-LLM Senior Software Engineer to develop and optimize a state-of-the-art inference framework for Large Language, Vision-Language, and Multimodal models on edge and embedded platforms, focusing on real-time performance and constrained environments. | Serve | 8 |
| Senior Performance Engineer - Deep Learning Senior Performance Engineer at NVIDIA focused on optimizing Deep Learning models and frameworks (PyTorch, JAX) for NVIDIA GPUs. The role involves building and supporting Transformer Engine, collaborating on systems research for performance improvements, implementing and benchmarking new DL models, contributing to MLPerf, and engaging with the open-source community and enterprise customers. It also involves influencing future hardware and software design. | ServePost-train | 8 |
| Senior System Software Engineer, 3D Computer Vision Senior System Software Engineer focused on 3D Computer Vision at NVIDIA, involving the development and deployment of advanced neural reconstruction models for generating 3D scenes. The role requires strong programming skills in Python and C/C++, a background in computer vision and deep learning, and experience with production-grade software development. | Post-trainServe | 8 |
| Senior Software Engineer, Quantized Inference Senior Software Engineer focused on optimizing quantized inference for LLMs by implementing recipes, developing kernels, and collaborating on inference engines like vLLM and TRT-LLM. The role involves model export pipelines, benchmarking, and data analysis tooling. | Serve | 8 |
| Senior Compiler Engineer, AI Inference Performance NVIDIA is seeking a Senior Compiler Engineer to optimize AI inference performance for their Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate next-generation deep learning software for various AI applications. | Serve | 8 |
| Senior Compiler Engineer, AI Inference Platforms NVIDIA is seeking a Senior Compiler Engineer to join its Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate AI inference performance on NVIDIA GPUs. The compiler is critical for data centers, personal devices, automotive, and robotics, focusing on inference performance, build time, memory footprints, and ease of use. | Serve | 8 |
| Principal GenAI Engagement Lead, Partner Platforms This role focuses on driving the technical integration of NVIDIA's Generative AI software with enterprise partners, including ISVs and CSPs. The Principal GenAI Engagement Lead will build trusted relationships, accelerate adoption, and influence product direction by designing and shipping methodologies, code, and reference architectures for RAG, LLM inference, and Multi-Agent workflows. The role requires a strong background in AI/ML, deep learning, and enterprise-grade GenAI systems, with experience in various LLM application stages and MLOps. The individual will act as the key technical lead, ensuring the deployment of robust, scalable GenAI solutions. | AgentServe | 8 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing core parallel algorithms and data structures for GPUs, specifically working with LLM training frameworks and performance optimization. Collaborates with application developers and internal NVIDIA teams to improve performance and developer efficiency. | Data | 8 |
| Solutions Architect - CPU and LPU NVIDIA Solutions Architect focused on optimizing AI inference workloads across CPU, GPU, and LPU platforms for customers. The role involves technical expertise, proof-of-concept development, and optimizing AI efficiency in heterogeneous environments. | ServeAgent | 8 |
| Principal AI Developer Technology Engineer This role focuses on researching and developing techniques to accelerate AI workloads (deep learning, machine learning) on advanced computer architectures, specifically GPUs. The engineer will perform in-depth analysis and optimization of complex AI and HPC algorithms, publish findings, and influence future hardware/software design. Requires deep C/C++ programming, parallel programming (CUDA, etc.), low-level performance optimization, and CPU/GPU architecture expertise. | Serve | 8 |
| Principal AI Developer Technology Engineer Seeking a Principal Developer Technology Engineer to research and develop techniques for GPU acceleration of AI workloads, focusing on performance optimization of deep learning and HPC algorithms on modern CPU and GPU architectures. This role involves collaborating with internal teams and the developer community, influencing hardware/software design, and publishing findings. | Serve | 8 |
| Solution Architect, Financial Services Solutions Architect for Financial Services at NVIDIA, focusing on guiding customers in leveraging NVIDIA's AI technologies, particularly in areas like model distillation, domain adaptation, reinforcement learning, and post-training algorithms. The role involves technical advocacy, collaborative innovation, and knowledge sharing within the financial services sector, requiring expertise in AI frameworks, Python, distributed computing, and the AI model lifecycle. | Post-trainPretrain | 8 |
| AI Chip Design Engineer - New College Grad 2026 NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks. The role involves creating AI agents to enhance productivity, building production infrastructure for these agents, and optimizing algorithms for enterprise data. Requires strong proficiency in LLM libraries, GPU/CPU architectures, and HW verification methodologies. | Agent | 8 |
| Senior Solutions Architect – Simulation Solutions 3D Reconstruction This role focuses on developing and scaling AI platforms for simulation and 3D reconstruction, particularly within the Omniverse ecosystem. The Senior Solutions Architect will act as a technical advisor, prototype solutions, implement intricate technical systems, provide technical enablement, and advocate for partner needs. The role requires expertise in AI, systems knowledge, autonomous systems, simulation, generative AI, Python, C++, DL/RL frameworks, computer vision, and 3D reconstruction. | AgentServe | 8 |
| AI Chip Design Engineer - New College Grad 2026 NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks, focusing on building and maintaining infrastructure for AI agents that process large codebases and optimize verification flows. The role involves developing retrieval and generation algorithms, integrating AI optimizations, and working with HW engineering teams. | Agent | 8 |
| Solution Architect, Energy NVIDIA is seeking a Solution Architect with deep expertise in AI solutions to drive the efficient use of compute platforms in the Energy Industry. The role involves being a trusted technical advisor to developers and customers, embedding NVIDIA software, improving application performance, and establishing the foundation for next-generation AI systems. Responsibilities include supporting business development, working directly with customers, assisting in the adoption of NVIDIA software, analyzing architectures for acceleration opportunities, providing feedback to engineering teams, and delivering trainings and demonstrations. Requires an MS/PhD in a technical field, 5+ years of experience in AI/ML/DL/NLP/Generative AI, and 5+ years of industrial experience in power grid software and advanced ML for grid operations. Familiarity with accelerated computing, GPU systems, Python/C/C++, major AI frameworks, containers, and version control is essential. Experience designing and building complex AI/ML solutions, and reasoning across various system components is also required. | Serve | 8 |
| Senior Tools Development Engineer NVIDIA is seeking a Senior Tools Development Engineer to build agentic infrastructure for test automation and quality engineering on the Omniverse platform. The role involves designing and deploying multi-agent systems, orchestration frameworks, and evaluation systems to improve software quality and reliability. | Agent | 8 |
| Senior Software Engineer, 3D/4D Reconstruction Senior Software Engineer at NVIDIA focused on 3D/4D reconstruction for autonomous driving products. The role involves building and optimizing systems using deep learning, computer vision, and generative AI techniques, including large geometry models, Gaussian splatting, and diffusion models. Responsibilities include developing reconstruction systems, inventing evaluation methods, creating visualization tools, building automated workflows, and optimizing neural network performance for training and deployment. The goal is to improve reconstruction fidelity and simulation realism for end-to-end driving models. | AgentEval Gate | 8 |
| Developer Relations Manager – AI Natives NVIDIA is seeking a Developer Relations Manager to engage with AI-native companies, helping them design, optimize, and scale their AI platforms on NVIDIA technologies. The role involves advising founders and engineering teams on building agentic systems, AI copilots, and multimodal applications, with a focus on accelerating training, optimizing inference, and delivering AI experiences. The ideal candidate has deep technical expertise in AI systems, developer platforms, and large-scale inference infrastructure. | ServeAgent | 8 |
| Senior AI Performance and Efficiency Engineer Senior AI/ML Performance and Efficiency Engineer focused on optimizing GPU cluster performance for AI/ML researchers by addressing infrastructure and application bottlenecks. This role involves building tools, analyzing efficiency, and collaborating across teams to improve hardware, software, and infrastructure usage for various ML workloads like Robotics, Autonomous vehicles, LLMs, and Videos. | Serve | 8 |
| Senior AI Developer Technology Engineer Senior Developer Technology Engineer focused on researching and developing techniques to GPU accelerate AI workloads, optimizing performance on modern CPU and GPU architectures, and collaborating with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |
| Senior AI Formal Verification Engineer Senior AI Formal Verification Engineer to enhance in-house formal tools with AI, leveraging LLMs and ML to automate intent-to-proof workflows and debug complex chips. Role involves architecting methodologies, developing AI agents, and creating AI-based debug assistants. | AgentServe | 8 |
| Engineering Manager, AI Developer Technology Engineering Manager for NVIDIA's AI Developer Technology team, focused on leading a team to optimize and develop algorithms for Deep Learning and Machine Learning applications, influencing next-generation hardware/software, and collaborating with customers and internal teams. The role involves optimizing training and inference performance on NVIDIA hardware. | ServePost-train | 8 |
| Senior Developer Technology Engineer - AI Senior Developer Technology Engineer focused on researching and optimizing AI/ML workloads for GPU acceleration, involving deep analysis, performance tuning, and collaboration with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |