Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Compute Architecture Software Engineer NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments. | Serve | 8 |
| Software Engineer, cuDNN - Deep Learning Software Engineer role focused on developing and optimizing cuDNN, a GPU-accelerated library for deep neural networks, including LLM support. The role involves performance analysis, tuning, and collaboration with cross-functional teams to innovate across various AI applications. | Serve | 8 |
| Senior AI Storage Software Architect NVIDIA is seeking a Senior AI Storage Software Architect to define and design the next generation of storage solutions for AI workloads, including training, inferencing, KV cache, and RAG. The role involves researching AI storage workloads, optimizing them, designing the storage software stack and APIs, leading POCs, and driving hardware features for DPUs and NICs. Requires 5+ years of storage experience and familiarity with AI applications and technologies. |
| ServeData |
| 8 |
| Senior Manager, Engineering - AI Developer Tools Senior Engineering Manager to lead a team building and evolving AI developer tools and technology for local and cloud GPUs, focusing on the developer experience for AI workflows and managing AI workloads on accelerated infrastructure. | ServeAgent | 7 |
| Senior Software Engineer, AI Developer Tools Senior Software Engineer to craft intuitive AI developer tools that make advanced AI workflows accessible and scalable across diverse accelerated infrastructure. | Agent | 7 |
| Senior DL Compiler Engineer -CUDA Tile NVIDIA is hiring a Senior DL Compiler Engineer for the CUDA Tile team. This role involves designing and implementing compiler transformations, developing MLIR-based dialects and lowering passes, and optimizing performance for tile-based kernels on NVIDIA GPUs. The CUDA Tile programming model is a new addition to CUDA, shipped with CUDA 13.1. | Serve | 7 |
| Senior Software Engineer - Storage Software Engineer role focused on designing, building, and operating exascale infrastructure for AI research and development at NVIDIA. The role involves managing distributed systems, large-scale storage, compute orchestration, and automation to support AI workloads across thousands of GPUs and petabytes of storage. | Serve | 7 |
| Principal Developer, AI Networking This role focuses on optimizing AI workloads, specifically LLM training and inference, on large-scale GPU and CPU clusters. The core responsibility is to profile, analyze, and optimize the performance of distributed systems with a strong emphasis on high-performance networking and communication libraries. The engineer will develop tools for performance analysis and collaborate across hardware and software teams to identify and resolve bottlenecks. | ServePretrain | 7 |
| AI Automation Engineer, Security NVIDIA is seeking an AI Automation Engineer to build AI-native, agent-enabled security organization. The role involves developing AI agents for security programs, building and maintaining infrastructure for agent workflows, translating business needs into agent solutions, architecting integrations for agents to interact with data systems, owning ETL and agentic data pipelines, and ensuring data security and governance. The engineer will also monitor and optimize data infrastructure, pipelines, and agents, and mentor other engineers. | AgentData | 7 |
| Software R&D Engineer, RTL Optimization Tools Software R&D Engineer at NVIDIA focused on developing internal EDA tools for RTL optimization. The role involves fusing parallel computing, machine learning, and novel algorithms to improve hardware design productivity. It explores the use of LLMs, GNNs, GANs, and Reinforcement Learning for optimization tasks, and requires strong C++ development skills with a focus on graph-based algorithms and optimization. | Serve | 7 |
| Senior Software Engineer, AI Speed Infrastructure Senior Software Engineer to build AI speed infrastructure for Tegra, focusing on a fast build, test, and validation system. The role involves designing AI-native, self-healing CI workflows, integrating reasoning agents for failure triage and automation, and optimizing the entire code-to-merge pipeline for C/C++ codebases, with a strong emphasis on performance engineering and developer experience. | Agent | 7 |
| Senior Systems Software Engineer, Kubernetes Scale - DGX Cloud Senior Systems Software Engineer focused on scaling NVIDIA DGX Cloud's AI infrastructure, specifically optimizing Kubernetes and distributed inference serving for performance, cost, and reliability. The role involves end-to-end performance characterization, developing automated tests for AI workloads, debugging complex distributed systems, and contributing to open-source communities. | ServeAgent | 7 |
| Senior Software Engineer, Mapping - Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer for their Autonomous Vehicles mapping team. The role involves designing and developing algorithms for map-based driving products, including architecture design, efficient C++ development, and integrating algorithmic solutions. Key responsibilities include researching and developing transformer-based models for graphs, implementing evaluation frameworks for LLMs, fine-tuning pre-trained models, and building automated map content analysis and map-building workflows. The role requires a background in computer vision, 3D geometry, and machine learning, with heavy AI tool usage for development, and strong prompt-crafting skills. The position is focused on building AI-powered solutions for self-driving cars, with a primary focus on agentic systems for navigation and map content, and secondary involvement in model fine-tuning. | AgentPost-train | 7 |
| GPU Architect - New College Grad 2026 NVIDIA is seeking new college graduates for its GPU Architecture Group to design and validate GPU profiling and performance telemetry features. The role involves hardware modeling, test development, and infrastructure, with a focus on the world's leading AI platform. Responsibilities include building and maintaining hardware models, writing and executing test plans, contributing to development infrastructure, and collaborating with cross-functional teams. | Serve | 7 |
| GPU System Architect NVIDIA is seeking a GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves defining system architectures that tightly couple GPU compute, memory, and interconnects for optimal AI performance, scalability, and resilience. Responsibilities include architecting system topologies, defining high-speed interconnects, collaborating on RDMA hardware, using system models for analysis, and enabling hardware-software co-design. | Serve | 7 |
| NBU Manufacturing Test Engineer NVIDIA is seeking a Manufacturing Test Engineer to design tools for product definition, data collection, test case execution, and results analysis. The role involves driving NBU diagnosis, analyzing test issues, qualifying equipment, and automating NBU product testing using AI and machine learning techniques. The engineer will also provide feedback on debug tools and support NBU product test setup. | Serve | 7 |
| Senior ML Platform Engineer Senior ML Platform Engineer at NVIDIA responsible for architecting, building, and scaling high-performance ML infrastructure using Infrastructure-as-Code (IaC) practices. The role focuses on creating reliable, automated platforms for training and deploying advanced ML models on GPU systems, applying SRE principles, and developing internal automation for ML workflows. Requires strong software engineering skills in Python/Go, experience with Kubernetes/Docker, and a solid understanding of ML workflows. | Serve | 7 |
| Senior Software Engineer - Developer Tools for Deep Learning Senior Software Engineer to enhance NVIDIA's developer tools for deep learning, focusing on neural network design and performance efficiency. The role involves partnering with management and architects, staying updated on research, and working with SOTA computer vision and LLMs. | ServePost-train | 7 |
| Senior Deep Learning Hardware Modeling Architect - LPU NVIDIA is seeking a Senior Deep Learning Hardware Modeling Architect to optimize AI inference speed and efficiency. The role involves driving architectural specifications, developing written specifications for component-level and system-level designs, and embodying these specifications in an executable model. The candidate will ensure high performance using C++ software practices, solid algorithms, and parallelism, and resolve performance and correctness issues across chip and hardware subsystems. | Serve | 7 |
| Senior AI Infrastructure Engineer - DGX Cloud Senior AI Infrastructure Engineer responsible for designing, building, and maintaining large-scale production systems for NVIDIA's DGX Cloud, focusing on AI training and inferencing platforms. This role involves infrastructure automation, distributed systems, performance characterization, and ensuring reliability and availability of GPU cloud services. | Serve | 7 |
| Senior Compiler Engineer - DL NVIDIA is seeking a Senior Compiler Engineer for its Deep Learning Compiler (DLC) team. This role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and hardware teams to accelerate deep learning inference performance. The compiler is critical for data centers, personal devices, automotive, and robotics, aiming for leading inference performance, fast build times, and reduced memory footprints. | Serve | 7 |
| Deep Learning Performance Architect, CUTLASS DSL NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++. | Serve | 7 |
| Senior Software Architect, GPU Networking Research NVIDIA is seeking a Senior Software Architect to focus on GPU Networking Research for accelerating AI workloads and building AI data centers. The role involves leading vision, architecture, design, and proof-of-concept development for future GPU Networking offerings, identifying new technologies, and working with the community. Requires M.Sc./Ph.D. or equivalent experience, 8+ years in systems architecture, and experience in virtualization, networking, storage, and OS drivers. Experience in performance profiling, optimization, and HW offloads is crucial. A research track record and knowledge of Deep Learning frameworks are desirable. | Serve | 7 |
| Principal Simulation Engineer, Industrial Physics and Robotics NVIDIA is seeking a Principal Simulation Engineer to lead the development of advanced physically based simulation systems for robotics and industrial digital twins. This role requires deep expertise in multibody dynamics, contact, friction, and flexible bodies, with a focus on integrating simulation with robotics workflows and applying modern AI-assisted and agentic development. The ideal candidate has a track record of building production-level simulation software and experience validating simulators against physical systems. | Agent | 7 |
| Senior Software Engineer, CUTLASS Kernels Senior Software Engineer to develop and optimize high-performance deep learning kernels (e.g., GEMM, attention, convolution) using CUTLASS CUDA C++ and Python DSL for NVIDIA GPUs and future architectures. The role involves optimizing kernels for peak throughput, collaborating with various NVIDIA teams (architecture, compiler, libraries, DL frameworks), and requires strong C++ and CUDA experience, understanding of computer architecture, and experience with parallel programming languages targeting accelerators. | Serve | 7 |
| Senior Software Engineer, CUTLASS Performance Senior Software Engineer role focused on optimizing the performance of CUTLASS, a high-performance linear algebra and Tensor Core primitive ecosystem for NVIDIA GPUs. The role involves benchmarking deep learning models, identifying performance gaps, developing tooling for optimization, and acting as a performance representative across NVIDIA teams. | Serve | 7 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks. | Serve | 7 |
| Principal Architect, System Software - Orbital Data Center NVIDIA is seeking a Principal Architect to lead the system software architecture for their Orbital Data Center (ODC) modules, specifically Space-1. This role involves designing and implementing a resilient, production-ready inference platform for the harsh environment of low-Earth orbit, covering the full stack from firmware to AI workloads. The architect will collaborate with hardware teams, drive customer use cases, and ensure the platform operates reliably for 5-year missions, enabling AI adoption in space. | Serve | 7 |
| Software Engineer, TensorRT Specialized Platforms - New College Grad 2025 Software Engineer role focused on developing and optimizing high-performance deep learning inference software (TensorRT) for specialized platforms. Requires strong C++ skills, familiarity with deep learning frameworks, and interest in performance optimization and systems programming. | Serve | 7 |
| Senior Software Engineer, Agentic Systems Senior Software Engineer to build NeMo Platform, focusing on NeMo Evaluator for developing, evaluating, deploying, and operating AI systems at scale. The role involves designing and implementing Python APIs, SDK workflows, and plugin interfaces for building, measuring, and improving agents, with a strong emphasis on agentic development and automated improvement. | Eval GateAgent | 7 |
| Senior Datacenter Performance Model Engineer Develops datacenter-scale performance modeling and prediction tools for AI researchers running AI workloads on GPU clusters. Involves building production tools, automating workflows, and partnering with architects. | Serve | 7 |
| Infrastructure Software Engineer, Deep Learning Libraries NVIDIA is seeking an Infrastructure Software Engineer to enable next-generation deep learning libraries by designing and developing scalable automation for build, test, integration, and release processes. The role involves developing and deploying AI agents to automate the software development cycle and configuring industry-standard tools, with a focus on open-source products like CUTLASS. | Agent | 7 |
| Deep Learning Compiler Engineer - CUDA NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks. | Serve | 7 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing AI and deep learning applications on GPU architectures, working with customers to provide AI solutions, and collaborating with internal teams to influence future hardware and software design. | Serve | 7 |
| Senior Software Engineer, Test - Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer, Test for their Autonomous Vehicles team. The role focuses on developing and productizing autonomous vehicle solutions, with a strong emphasis on building and scaling simulation environments for testing and evaluation. The engineer will lead efforts in test architecture, tool development, and automation, working with C++/Python and collaborating across teams. Experience with AI/ML systems is a plus. | Agent | 7 |
| Lead Algorithm Engineer, Map-Perception Fusion Lead engineer for autonomous vehicle systems, focusing on fusing map and perception data to create real-time 3D world models for navigation. This role involves architecting scalable systems, advancing mapless driving capabilities, and improving scenario understanding through techniques like static obstacle modeling and occupancy grids. | Agent | 7 |
| Senior Systems Software Engineer - Omniverse Senior Systems Software Engineer to build next-generation AI-driven developer and robotics workflows at NVIDIA. Role involves technical leadership, software architecture, and implementation of scalable libraries, features, and services. Focus on AI-assisted development, code generation, CI/CD, and integration with robotics platforms like Isaac or ROS. | Agent | 7 |
| Senior HPC AI Cluster Engineer NVIDIA is seeking an experienced HPC-AI Engineer to join their Networking Clusters Solutions Infrastructure team. The role involves designing, implementing, and maintaining large-scale HPC/AI clusters, managing job schedulers, developing CI/CD pipelines, and automating infrastructure deployment and monitoring. The engineer will work with cutting-edge hardware and software, support R&D, and engage in POCs for future improvements. | Serve | 7 |
| Senior Power Analysis and Optimization Engineer This role focuses on applying AI, ML, and LLMs to optimize power efficiency in NVIDIA's GPUs and SoCs. The engineer will develop and productionize ML/RL-based models for power analysis and optimization, design and train custom LLMs for interpreting power data and recommending improvements, and apply AI to tune power-efficient configurations. The role involves analyzing power data, partnering with cross-functional teams, and automating flows. | ServeData | 7 |
| Senior Software Engineer, Driving Behavior and Multi-Vehicle Adaptation – Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer for their China Autonomous Driving Team to own the driving behavior of their autonomous driving stack across multiple production programs. This role involves deep root-cause analysis, adapting planning and control algorithms to diverse vehicle platforms, and building automation tools. The engineer will also perform on-vehicle testing, tune real-world performance, and collaborate with OEM partners to deliver safe, comfortable, and production-ready autonomous driving behavior. | AgentServe | 7 |
| Senior Software Engineer, Context Fusion and Multi-Vehicle Adaptation - Autonomous Vehicles Senior Software Engineer role at NVIDIA focusing on Context Fusion and Multi-Vehicle Adaptation for Autonomous Vehicles. The role involves analyzing and resolving fusion issues, adapting fusion logic across different vehicle platforms and environments, building debugging and validation workflows, and collaborating with global teams and OEM partners. Requires strong system-level debugging, C/C++ skills, and experience in autonomous driving or robotics. | Agent | 7 |
| Software Manager, Planning and Control - Autonomous Vehicles Software Manager for Planning and Control in Autonomous Vehicles at NVIDIA, leading a team to productize and deliver ADAS and autonomy functions. Responsibilities include setting algorithmic direction, designing software architecture, building testing infrastructure, and managing a team of developers. Requires significant software product and management experience, C++/C proficiency, and Agile/Linux environment familiarity. Experience shipping ADAS/Autonomy functions, building from scratch, robust testing infrastructure, algorithm development for physical systems, and automotive systems are highly desirable. | Ship | 7 |
| Senior Software Engineer — cuEquivariance Senior Software Engineer to join the cuEquivariance team, which builds and ships production GPU kernels and software interfaces for equivariant deep learning. The role involves CUDA kernel engineering, Python library development (PyTorch/JAX), and collaboration with research teams and external framework developers to accelerate geometric neural networks on NVIDIA GPUs. | Serve | 7 |
| Senior Staff Engineer, Enterprise SaaS Platform and Automation Senior Staff Engineer role focused on building agentic systems and AI-powered automation for enterprise SaaS platforms, including onboarding, support, and vendor integration. The role involves designing and implementing solutions using Python, APIs, and LLMs to reduce manual processes and improve employee productivity. | Agent | 7 |
| Senior Software Engineer - Simulation Senior Software Engineer role focused on building scalable 3D simulation software for Digital Twin and Synthetic Data Generation applications, collaborating on backend services and AI Agents for end-to-end SDG solutions. Requires strong C/C++/Python skills, 3D simulation experience, and proficiency in physics game engines and containerization tools. | Agent | 7 |
| Senior Software Engineer - Simulation Senior Software Engineer role focused on building scalable 3D simulation software for Digital Twin and Synthetic Data Generation applications, collaborating with teams to build backend services and AI Agents for end-to-end SDG solutions. Requires strong programming skills in C/C++, Python, and experience with 3D simulation and physics engines. | Agent | 7 |
| Senior AI Security Architect NVIDIA is seeking a Senior AI Security Architect to define and evolve secure SDLC methodologies for AI products using AI-assisted and agentic development environments. The role involves designing frameworks for secure adoption of AI coding assistants and autonomous agents, architecting secure agentic development environments, and integrating AI-native workflows into the SDLC. The candidate will also define security guardrails and validation mechanisms for AI-assisted software development processes. | Agent | 7 |
| Senior System Software Engineer - AI Performance and Efficiency Tools NVIDIA is seeking a Senior System Software Engineer to develop tools for AI researchers and SW/HW teams running AI workloads on GPU clusters. The role involves building internal profiling, analysis, debugging, benchmarking, and simulation tools to improve the performance and efficiency of AI workloads and systems. This includes partnering with HW architects and understanding deep learning frameworks, distributed training/inference, and GPU cluster technologies. | ServeData | 7 |
| Software Solutions Engineer NVIDIA is seeking a Software Solutions Engineer to support NVIDIA AI Enterprise customers. This role involves end-to-end customer issue resolution and building software features, automation, and deployment tooling to enhance product readiness and scalability in cloud and datacenter environments. The engineer will work with compute, cloud-native technologies, and GPU-accelerated AI frameworks, requiring strong debugging, communication, and ownership skills. | ServeAgent | 7 |
| Senior Systems Software Engineer - GPU Performance at Scale Senior Systems Software Engineer focused on GPU performance at scale for AI workloads. This role involves leading performance practices, aligning AI workloads with hardware, developing insights into AI workload performance, debugging complex issues, and collaborating with various software and firmware teams to optimize AI workload performance on NVIDIA GPUs. | Serve | 7 |