NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior AI Infrastructure Software Engineer Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks. | AgentServe | 8 |
| Senior Manager, Deep Learning Performance Architecture NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures. |
| Serve |
| 8 |
| Deep Learning Performance Architect NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions. | Serve | 8 |
| Senior System Software Architect, HPC and AI Networking NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers. | ServePost-train | 8 |
| Software Engineer, LLM Inference Software Engineer focused on developing and optimizing LLM inference software and frameworks, working with GPU-accelerated libraries and deep learning frameworks. | Serve | 8 |
| Compute Architecture Software Engineer NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments. | Serve | 8 |
| Software Engineer, cuDNN - Deep Learning Software Engineer role focused on developing and optimizing cuDNN, a GPU-accelerated library for deep neural networks, including LLM support. The role involves performance analysis, tuning, and collaboration with cross-functional teams to innovate across various AI applications. | Serve | 8 |
| Deep Learning Performance Architect, CUTLASS DSL NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++. | Serve | 7 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks. | Serve | 7 |
| Infrastructure Software Engineer, Deep Learning Libraries NVIDIA is seeking an Infrastructure Software Engineer to enable next-generation deep learning libraries by designing and developing scalable automation for build, test, integration, and release processes. The role involves developing and deploying AI agents to automate the software development cycle and configuring industry-standard tools, with a focus on open-source products like CUTLASS. | Agent | 7 |
| Deep Learning Compiler Engineer - CUDA NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks. | Serve | 7 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing AI and deep learning applications on GPU architectures, working with customers to provide AI solutions, and collaborating with internal teams to influence future hardware and software design. | Serve | 7 |
| Senior Software Engineer, Driving Behavior and Multi-Vehicle Adaptation – Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer for their China Autonomous Driving Team to own the driving behavior of their autonomous driving stack across multiple production programs. This role involves deep root-cause analysis, adapting planning and control algorithms to diverse vehicle platforms, and building automation tools. The engineer will also perform on-vehicle testing, tune real-world performance, and collaborate with OEM partners to deliver safe, comfortable, and production-ready autonomous driving behavior. | AgentServe | 7 |
| Senior Software Engineer, Context Fusion and Multi-Vehicle Adaptation - Autonomous Vehicles Senior Software Engineer role at NVIDIA focusing on Context Fusion and Multi-Vehicle Adaptation for Autonomous Vehicles. The role involves analyzing and resolving fusion issues, adapting fusion logic across different vehicle platforms and environments, building debugging and validation workflows, and collaborating with global teams and OEM partners. Requires strong system-level debugging, C/C++ skills, and experience in autonomous driving or robotics. | Agent | 7 |
| Software Manager, Planning and Control - Autonomous Vehicles Software Manager for Planning and Control in Autonomous Vehicles at NVIDIA, leading a team to productize and deliver ADAS and autonomy functions. Responsibilities include setting algorithmic direction, designing software architecture, building testing infrastructure, and managing a team of developers. Requires significant software product and management experience, C++/C proficiency, and Agile/Linux environment familiarity. Experience shipping ADAS/Autonomy functions, building from scratch, robust testing infrastructure, algorithm development for physical systems, and automotive systems are highly desirable. | Ship | 7 |
| Senior System Software Engineer - AI Performance and Efficiency Tools NVIDIA is seeking a Senior System Software Engineer to develop tools for AI researchers and SW/HW teams running AI workloads on GPU clusters. The role involves building internal profiling, analysis, debugging, benchmarking, and simulation tools to improve the performance and efficiency of AI workloads and systems. This includes partnering with HW architects and understanding deep learning frameworks, distributed training/inference, and GPU cluster technologies. | ServeData | 7 |
| Applied AI Engineer, Product Convergence and Closure NVIDIA is seeking an Applied AI Engineer to rebuild their silicon toolchain using AI. The role involves building infrastructure to transform raw simulation data into firmware tuning, product specs, and manufacturing limits, automating analysis and validation using LLMs and agents, and developing observability systems. The engineer will work with various teams to translate hardware requirements into production workflows. Requires 4+ years of Python production experience and hands-on LLM application in engineering problems, with a focus on data quality and distinguishing useful AI tools from hype. | AgentServe | 7 |
| Senior Developer Relations Manager NVIDIA is seeking a Senior Developer Relations Manager to engage with the China industrial and research community, focusing on integrating GPU-accelerated computing solutions, particularly in Generative AI, Agentic AI, and AI Storage. The role involves understanding community requirements, promoting NVIDIA tools, architecting solutions, and driving adoption of new products within the AI storage ecosystem. | ServeAgent | 7 |
| Senior System Software Engineer, Robotics Senior System Software Engineer role focused on building the Physical AI platform for NVIDIA's robotics projects. Responsibilities include robot bring-up, developing auto-verification pipelines, and supporting R&D. The role requires strong robotics software engineering skills, experience with various robot embodiments, and familiarity with AI/ML algorithms for robotics. The engineer will deploy and test software on physical robots and digital twins, and use agentic AI for software development. | ShipAgent | 7 |
| AI Developer Technology Engineer NVIDIA is seeking an AI Developer Technology Engineer to enhance embodied AI responsibilities, develop on products like IsaacSim and Isaac Lab, profile and optimize GPU-based physics simulator performance, and collaborate with various teams to invent next-generation architectures and software platforms. Requires experience with C++, CUDA, Python, Linux, physics simulators, and prior involvement with embodied AI or humanoid robotics firms. | ShipData | 7 |
| Developer Technology Engineer – AI NVIDIA Developer Technology Engineer focused on optimizing deep learning and machine learning workloads on NVIDIA's accelerated computing platform (GPU, CPU, DPU) for key customers. Requires strong C/C++ and CUDA experience, with an MS/PhD in CS or related field. | Serve | 7 |
| Senior Computer Vision and Deep Learning Hardware Architect NVIDIA is seeking an Autonomous Vehicle Performance Architecture Engineer to design, model, and verify state-of-the-art programmable vision accelerators (PVA) for automotive and robotics. The role involves optimizing software for autonomous driving solutions, analyzing and prototyping applications, building performance models for future architectures, and collaborating with teams to enhance PVA architecture. Requires a Masters/PhD, 3+ years of relevant experience, strong C/C++ and computer architecture skills, and performance modeling/optimization expertise. Experience in DSP programming, autonomous vehicle software, deep learning, computer vision, and self-driving cars is a plus. | ServePost-train | 7 |
| Senior Software Engineer, NCCL Senior Software Engineer role focused on designing, implementing, and maintaining highly-optimized communication runtimes for Deep Learning frameworks and HPC programming interfaces on GPU clusters. This involves system software development, parallel programming interface contributions, and proof-of-concept creation for new designs and hardware features. | Serve | 7 |
| Solution Architect – Accelerated Computing Libraries NVIDIA is seeking a Solution Architect to drive the adoption of their AI and accelerated computing libraries across industries. The role involves understanding customer workloads, designing solutions using NVIDIA libraries for LLM inference and training acceleration, and collaborating with product teams to improve features and performance. The candidate will also build technical assets and analyze industry trends. | Serve | 7 |
| Senior Deep Learning Test Development Engineer, SDET Senior Deep Learning Test Development Engineer (SDET) at NVIDIA's AI SWQA team, responsible for validating the robustness and performance of NVIDIA's AI software and GPU Infrastructure across various AI scenarios. The role involves test planning, design, execution, automation, and bug management, with a focus on improving workflow processes and efficiency. Experience with LLM inference frameworks and AI development tools is required. | Serve | 7 |
| Validation Data Scientist, Verification and Validation - Autonomous Vehicles The Validation Data Scientist will build tooling, perform large-scale analysis, and drive data-driven evaluation of vehicle-level behavior and Operational Design Domain (ODD) coverage during scaled testing for autonomous vehicles. This role involves building and improving evaluation frameworks, data pipelines, and data curation strategies, defining core metrics, and automating scalable workflows using cloud platforms and AI. The goal is to influence product development, technical reviews, and software releases by providing quantitative analyses and clear reporting. | Eval GateData | 7 |
| Validation Data Engineer, Verification and Validation - Autonomous Vehicles NVIDIA is seeking a Validation Data Engineer for its Autonomous Vehicles team. The role involves building tooling, performing large-scale analysis, and driving data-driven evaluation of vehicle-level behavior and ODD coverage using real-world and virtual AV driving logs. Responsibilities include implementing evaluation frameworks, data pipelines, and data curation strategies, defining core metrics, and contributing to scalable workflows using cloud platforms and AI. The ideal candidate has 5+ years of experience in data engineering or analytics, strong Python skills, and experience with autonomous vehicle behavior analysis. | Eval Gate | 7 |
| Senior Software Test Development Engineer - Deep Learning NVIDIA is seeking a Senior Software Test Development Engineer for its AI SWQA team. This role involves defining, developing, and executing tests to validate the robustness and performance of NVIDIA's AI software and GPU infrastructure across various AI applications like autonomous driving, healthcare, and NLP. The engineer will collaborate with AI product teams, develop complex test plans, manage bug lifecycles, and automate test cases for CI/CD pipelines. The position requires a Master's degree, 5+ years of QA/test automation experience, strong Python skills, and direct experience with AI tools/products or using AI for major features. Experience with AI for QA automation and deep learning frameworks is a plus. | Serve | 7 |
| Senior Solutions Architect, GPU System NVIDIA is seeking a Senior Solutions Architect with expertise in GPU server platforms and AI infrastructure to help customers design, deploy, and optimize NVIDIA-based AI factories. The role involves leading presales and architecture engagements, designing end-to-end AI data center solutions, and supporting the deployment of NVIDIA platforms for LLM training and inference workloads. | ServeAgent | 7 |
| Solution Architect - Top AI Labs Solution Architect role focused on designing AI computing platform architectures and supporting top AI Labs and model builders in integrating NVIDIA technologies for Deep Learning, HPC, Robotics, and Signal Processing applications. Requires experience with ML, data analytics, computer vision, and parallel programming on cloud/HPC architectures. | Serve | 7 |
| Senior Robotics DevTech Engineer NVIDIA is seeking a Senior Robotics DevTech Engineer in China to act as a technical bridge between the robotics community and the global NVIDIA Omniverse platform team. The role involves supporting local robotics partners, building expertise in Isaac workflows, enabling partner iteration, and contributing to the product roadmap through market intelligence. Responsibilities include local ecosystem triage and support, developing expertise in robotics simulation tools, translating partner needs to engineering, and generating regional insights. | AgentData | 7 |
| Software Engineer, Robotics - Isaac Lab Software Engineer role focused on building and maintaining CI/CD pipelines, automation, and performance optimization for a large-scale robotics simulation and learning platform (Isaac Lab). The role involves infrastructure for ML and simulation systems, benchmarking, profiling, and supporting issue triage. | DataAgent | 7 |
| Software Engineer, Robotics - Isaac Lab Software Engineer for NVIDIA's Isaac Lab team, focusing on developing and extending physics simulation APIs for robot learning. The role involves debugging simulation issues, translating research into APIs, and engaging with the robotics community. Requires extensive Python and deep learning stack experience, with a strong background in physics simulation or robotics control, and experience in reinforcement learning and imitation learning. | DataAgent | 7 |
| Devtech Compute Engineer NVIDIA is seeking a Devtech Compute Engineer to develop performance-critical code for deep learning applications, focusing on accelerating model training and inference on GPUs, particularly for recommender systems. The role involves optimizing CUDA kernels, integrating solutions into open-source libraries, and collaborating with hardware teams to define future solutions across various domains like LLM, Recsys, Robotics, and Assisted Driving. | ServeData | 7 |
| Senior System Software Architect, AI and GPU Networking This role focuses on architecting and enhancing NVIDIA's GPU Networking offerings to accelerate AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and design of new technologies for AI data centers. | ServePost-train | 7 |
| Senior Developer Technology Engineer This role focuses on optimizing GPU-accelerated code for training and inference performance of large-scale recommender systems. It involves designing and implementing high-performance C++/CUDA components, developing tests, and optimizing data flows between GPUs, NICs, and SSDs. The ideal candidate has experience with C++, CUDA, Python, GPU performance profiling, and ideally, building or optimizing recommender systems or production ML workloads on GPUs. | ServeShip | 7 |
| HPC and AI Cluster Engineer NVIDIA is seeking an HPC and AI Cluster Engineer to manage and maintain large-scale HPC/AI clusters, including Linux job scheduling, CI/CD pipelines, and troubleshooting from bare metal to application level. The role involves supporting R&D activities and POCs, working with cutting-edge hardware and software, and collaborating with researchers and customers to develop solutions. | Serve | 7 |
| GPU Computing Engineer - Autonomous Driving NVIDIA is seeking a GPU Computing Engineer in Shanghai to analyze Deep Learning models and investigate TensorRT stability and performance issues. The role involves working with a global team on CUDA and TensorRT development, extracting feature requirements, and generating documentation. Requires strong C/C++/Python skills, knowledge of inference networks, and experience with deep learning frameworks like PyTorch. | Serve | 7 |
| Deep Learning Performance Software Engineer NVIDIA is seeking a Deep Learning Performance Software Engineer to develop GPU-accelerated deep learning software, focusing on optimizing deep learning kernels and end-to-end performance through tile-based GPU programming. The role requires strong C/C++ skills, GPU programming experience (CUDA or OpenCL), and performance modeling/optimization knowledge. | Serve | 7 |
| System Software Engineer - Autonomous Vehicles System Software Engineer specializing in self-driving vehicle technology, responsible for integrating and maintaining end-to-end software for autonomous driving systems, including Sensing/Perception/Localization/Planning/Control, and optimizing vertical stack performance. | Ship | 7 |
| Senior Prediction Software Engineer - Autonomous Vehicles Senior Software Engineer role focused on developing and productizing autonomous driving features, specifically in prediction and planning. Requires C++ experience, understanding of physics and control systems, and experience with deep learning-based systems for autonomous vehicles. | Ship | 7 |
| Senior DGX Cloud AI Infrastructure Software Engineer Senior Software Engineer role focused on building and integrating AI infrastructure for DGX Cloud, enabling developers to access GPU-optimized virtual machines. Responsibilities include crafting IaaS API integrations, developing a two-sided marketplace, and improving testing and observability for scalable, fault-tolerant solutions. | Serve | 7 |
| Senior Software Engineer, Humanoid Robotics Senior Robotics Software Engineer to build the platform for Physical AI robots, enabling sim-first development, real-world deployment, and continuous learning. Role involves integrating NVIDIA robotics products and spearheading efforts in Shanghai. | ShipAgent | 7 |