Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Deep Learning Compiler Engineer NVIDIA is seeking a Senior Deep Learning Compiler Engineer to develop compiler optimization algorithms for deep learning networks. This role involves collaborating with deep learning software framework and hardware architecture teams to accelerate next-generation deep learning software, focusing on public APIs, performance, and compiler infrastructure for neural networks. | Serve | 7 |
| Senior Software Engineer - Deep Learning Compiler CI Infrastructure Senior Software Engineer to own and evolve CI/CD infrastructure for NVIDIA's deep learning compiler stacks. Responsibilities include designing and operating scalable CI systems for ML workloads, delivering performance signals, and applying AI/agent-based workflows to improve developer efficiency and triage. | Serve |
| 7 |
| SoC Product Architect, Telecom AI RAN NVIDIA is seeking a Lead SoC Product Architect for their Telecom AI RAN platform, focusing on defining the architecture and roadmap for radio and distributed unit products. The role involves analyzing workloads, driving competitive analysis, synthesizing customer requirements, and collaborating with engineering teams to ensure efficient implementation of AI-native RAN applications. The ideal candidate will have extensive experience in wireless RAN/baseband architecture or SoC product definition, with a strong understanding of 3GPP RAN standards and L1/PHY algorithms. | Serve | 7 |
| Senior System Software Engineer - Neural Graphics Performance Senior System Software Engineer focused on optimizing neural graphics performance, specifically Gaussian Splatting and neural reconstruction algorithms, for applications in robotics, healthcare, and AV development. The role involves implementing and optimizing reconstruction/rendering algorithms using CUDA and Slang, optimizing data processing pipelines, and influencing software architecture for performance. | ServeData | 7 |
| Senior System Software Engineer - Dynamo-Triton Inference Server Senior System Software Engineer to work on Dynamo-Triton Inference Server, a GPU-accelerated AI inference serving platform. The role involves developing high-performance inference software, contributing to feature development, driving customer adoption, and optimizing throughput and latency for both LLM and non-LLM workloads. | Serve | 7 |
| Senior Software Performance Engineer - AV Platform Senior Software Performance Engineer for Autonomous Vehicles platform, focusing on optimizing latency and throughput of L2/L3/L4 autonomous driving solutions on NVIDIA's heterogeneous hardware architectures. Requires strong C++ skills, parallel programming, performance analysis, and experience with GPGPU/CUDA. | ServeAgent | 7 |
| Senior AI and ML HPC Cluster Engineer This role focuses on designing, implementing, and managing large-scale GPU compute clusters for AI/ML and HPC workloads. It involves infrastructure engineering, automation, and supporting researchers with performance analysis and optimization. The role requires expertise in cluster management, Linux administration, container technologies, scripting, and MPI workflows. | Serve | 7 |
| Manager, Software Architecture Manager for a systems and networking engineering team focused on building distributed AI communication systems (libraries, frameworks, system integrations) for GPUs, nodes, and storage. The role involves setting technical direction, leading execution, and fostering technical excellence within the team, with a focus on AI infrastructure problems. | Serve | 7 |
| Senior Software Architect, AI Systems and Networking This role focuses on building and optimizing systems-level software for high-performance communication and memory management libraries essential for distributed AI workloads. It involves hardware-software co-optimization, profiling data movement, and integrating networking capabilities into AI serving stacks, bridging applied research and production engineering. | Serve | 7 |
| Deep Learning Kernel Software Performance Architect - New College Grad 2026 NVIDIA is seeking a Deep Learning Kernel Software Performance Architect to develop and analyze processor and system architectures that accelerate machine learning and data analytics applications. The role involves debugging deep learning software, developing analysis tools, and collaborating with various NVIDIA teams to optimize performance. | Serve | 7 |
| Senior Manager, Site Reliability Engineering Senior Manager of Site Reliability Engineering to lead and reshape IT operations at scale, building AI-powered systems for reliability, speed, and employee experience. Focuses on transforming Incident, Problem, and Change Management using observability, AI insights, and orchestration to move towards predictive and autonomous operations. | Serve | 7 |
| Senior Software Engineer, Machine Learning Inference Senior Software Engineer role focused on designing and implementing inference software optimizations for NVIDIA TensorRT and TensorRT-LLM to accelerate AI applications on NVIDIA GPUs. Involves C++, Python, and CUDA development, collaboration with AI experts, and optimization of deep learning frameworks and compilers. | Serve | 7 |
| Senior Math Libraries Engineer - Sparsity in AI Software engineer to design and develop C++ libraries and tools for unstructured sparsity in Deep Learning (DL) and High-Performance Computing (HPC) on NVIDIA GPUs. This involves DSL specifications, on-demand code generation, and enabling the system in Python/PyTorch. The role focuses on performance evaluation, library quality, and collaboration with product management. | Serve | 7 |
| Senior Software Engineer, JAX Senior Software Engineer focused on performance optimizations for JAX, a deep learning framework, to build a scalable platform for data, training, and analysis. The role involves developing core JAX components, working with AI researchers, and building tools to improve AI system development efficiency. | Serve | 7 |
| Senior AI and FSI Developer Technology Engineer Senior AI and FSI Developer Technology Engineer at NVIDIA focused on optimizing AI and HPC workloads on NVIDIA CPUs and GPUs for the financial services industry. The role involves researching, designing, and developing techniques to accelerate these workloads, profiling and eliminating performance bottlenecks, and collaborating with internal and external experts to influence future hardware and software designs. The engineer will also publish and present their work. | Serve | 7 |
| Senior Software Engineer - NIM Factory Container and Cloud Infrastructure Senior Software Engineer role focused on container and cloud infrastructure for NVIDIA Inference Microservices (NIMs) and hosted services. The role involves designing and implementing container strategies, building enterprise-grade software for container build, packaging, and deployment, and improving reliability, performance, and scale across thousands of GPUs, with a focus on disaggregated LLM inference. | Serve | 7 |
| Senior Math Libraries Engineer – AI and HPC Senior engineer to join NVIDIA's Math Libraries team, focusing on kernel generation for AI and HPC, specifically matrix operations, JITing, and fusions. The role involves designing and implementing high-performance numerical dense linear algebra software on GPUs, providing technical leadership, and collaborating with product management. | Serve | 7 |
| Senior Manager, GPU Cloud Infrastructure - GeForce NOW Senior Manager to lead the design, scaling, and operations of high-performance networking for GPU-based cloud infrastructure, critical for cloud gaming, AI/ML training, and inference platforms. | Serve | 7 |
| Senior Staff AI Platform Engineer Senior Staff AI Platform Engineer at NVIDIA responsible for building, supporting, and maintaining AI-native infrastructure for enterprise products. This role involves architecting and scaling LLM/ML infrastructure, designing observability for AI models, developing automation, and troubleshooting complex distributed systems. The engineer will also drive AI-assisted engineering practices and partner with product teams to deliver scalable AI solutions. | ServeAgent | 7 |
| Senior Software Engineer, Deep Learning Inference - Automotive Safety Senior Software Engineer focused on developing high-performance deep learning inference software for safety-critical automotive applications using C++. The role involves integrating hardware functionalities into TensorRT, optimizing performance, and ensuring rigorous safety validation and documentation. | Serve | 7 |
| Senior Software Engineer, Deep Learning Inference - TensorRT NVIDIA is seeking a Senior Software Engineer to develop and scale a state-of-the-art inference framework for accelerating Deep Learning models, particularly LLMs, on NVIDIA GPUs using TensorRT. The role involves crafting inferencing software, developing components of TensorRT, and optimizing the deployment of trained models using C++ and Python. | Serve | 7 |
| Senior MLOps Engineer, GenAI Framework This role focuses on building and maintaining CI/CD pipelines and release processes for NVIDIA's GenAI frameworks (Megatron-LM, NeMo). It involves implementing scalable DevOps solutions, managing infrastructure (Kubernetes, Docker, Slurm), automating tasks for research and development cycles, and developing quality control measures. The goal is to enable efficient work for GenAI software engineers, DL algorithm engineers, and research scientists, optimizing performance and ensuring high-quality software delivery. | Serve | 7 |
| Senior Systems Performance Engineer Senior Systems Performance Engineer at NVIDIA focused on validating and optimizing GPU accelerated computing products, specifically for Deep Learning/AI applications. The role involves system architecture, performance modeling, and developing stress/performance testing strategies for ML/LLM workloads. | Serve | 7 |
| Senior Software Engineer - NIM Platform SDK and Framework Senior Software Engineer to own and evolve the core NIM Platform SDK and microservice framework, powering NVIDIA Inferencing Microservices (NIM). Focus on high-performance systems programming, multi-cloud abstractions, and API framework development for production-ready AI inference at scale. | Serve | 7 |
| Senior Software R&D Engineer, Digital Logic Synthesis NVIDIA is seeking an EDA Software R&D Engineer to develop internal EDA tools by fusing advances in parallel computing, machine learning, and novel algorithms in C++. The role involves inventing and developing new algorithms for RTL synthesis, digital logic optimization, and physical-aware synthesis techniques, with a focus on prototyping and evaluating ML methods to guide optimization decisions and integrating successful approaches into production. | Serve | 7 |
| Senior ASIC Methodology Engineer - LPU Division This role focuses on inventing and pioneering AI-driven hardware development methodologies for ASICs, aiming to improve predictability, convergence, and turnaround time in the ASIC development lifecycle. The engineer will leverage data to enable AI models and analytics, establish metrics for improvement, share best practices, and track advances in AI, EDA, and hardware design research. | Serve | 7 |
| Senior AI Developer Technology Engineer, Financial Sector Senior AI Developer Technology Engineer focused on optimizing AI and HPC workloads for financial markets on NVIDIA's computing platforms. This role involves research, development, performance analysis, and collaboration with the developer community and internal teams to influence hardware and software design. | Serve | 7 |
| Neural Graphics Engineer NVIDIA is seeking a Neural Graphics Engineer to work on technologies at the intersection of AI and real-time rendering. The role involves implementing and optimizing neural graphics techniques, prototyping neural rendering and generative 3D approaches, and contributing to the graphics software stack. Experience with C++, Python, computer graphics, and machine learning is required, with a preference for hands-on experience in neural rendering or generative AI for 3D content. | ServeData | 7 |
| Platform Architecture Engineer, GeForce NOW This role focuses on architecting and optimizing cloud infrastructure for AI workloads, specifically for the GeForce NOW service. The engineer will perform deep performance and power analysis of GPU/CPU microarchitecture for AI inference, deploy and optimize AI/gaming kernels, and build models to guide platform decisions balancing performance, power, and cost. The role requires strong programming skills and experience with AI models and performance analysis methodologies. | Serve | 7 |
| Senior Deep Learning Kernel Software Performance Architect Senior Kernel Performance Architect for Deep Learning Software at NVIDIA, focusing on crafting and prototyping GPU-accelerated system architectures to optimize deep learning and data analytics workloads. Requires expertise in kernel performance, math libraries, GPU computing, and parallel programming. | Serve | 7 |