Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Solutions Architect, Generative AI Deployment and AIOps Senior Solutions Architect focused on deploying Generative AI and LLMs, optimizing inference performance on Kubernetes, and collaborating with customers and internal teams on NVIDIA's AI platforms. | Serve | 7 |
| Senior Deep Learning Test Development Engineer, SDET Senior Deep Learning Test Development Engineer (SDET) at NVIDIA's AI SWQA team, responsible for validating the robustness and performance of NVIDIA's AI software and GPU Infrastructure across various AI scenarios. The role involves test planning, design, execution, automation, and bug management, with a focus on improving workflow processes and efficiency. Experience with LLM inference frameworks and AI development tools is required. | Serve |
| Senior Staff AI Platform Engineer Senior Staff AI Platform Engineer at NVIDIA responsible for building, supporting, and maintaining AI-native infrastructure for enterprise products. This role involves architecting and scaling LLM/ML infrastructure, designing observability for AI models, developing automation, and troubleshooting complex distributed systems. The engineer will also drive AI-assisted engineering practices and partner with product teams to deliver scalable AI solutions. | ServeAgent | 7 |
| Senior Engineer - Deep Learning Compiler Verification and Infrastructure Senior Engineer focused on Deep Learning Compiler Verification and Infrastructure at NVIDIA. The role involves implementing compiler verification software and related infrastructure to accelerate deep learning workloads, working closely with compiler developers on functional and performance testing, and applying deep learning techniques to verification solutions. | Serve | 7 |
| Senior Software Engineer, Deep Learning Inference - Automotive Safety Senior Software Engineer focused on developing high-performance deep learning inference software for safety-critical automotive applications using C++. The role involves integrating hardware functionalities into TensorRT, optimizing performance, and ensuring rigorous safety validation and documentation. | Serve | 7 |
| Senior Software Engineer, Deep Learning Inference - TensorRT NVIDIA is seeking a Senior Software Engineer to develop and scale a state-of-the-art inference framework for accelerating Deep Learning models, particularly LLMs, on NVIDIA GPUs using TensorRT. The role involves crafting inferencing software, developing components of TensorRT, and optimizing the deployment of trained models using C++ and Python. | Serve | 7 |
| Senior Software Test Development Engineer - Deep Learning NVIDIA is seeking a Senior Software Test Development Engineer for its AI SWQA team. This role involves defining, developing, and executing tests to validate the robustness and performance of NVIDIA's AI software and GPU infrastructure across various AI applications like autonomous driving, healthcare, and NLP. The engineer will collaborate with AI product teams, develop complex test plans, manage bug lifecycles, and automate test cases for CI/CD pipelines. The position requires a Master's degree, 5+ years of QA/test automation experience, strong Python skills, and direct experience with AI tools/products or using AI for major features. Experience with AI for QA automation and deep learning frameworks is a plus. | Serve | 7 |
| Senior Software Test Development Engineer - Deep Learning NVIDIA is seeking a Senior Software Test Development Engineer for their Deep Learning SWQA team. This role involves defining, developing, and executing tests to validate the robustness and performance of NVIDIA's Deep Learning software and GPU infrastructure across various AI applications. Responsibilities include collaborating with AI product teams, developing complex test plans, automating test cases, and managing the bug lifecycle. The ideal candidate has 6+ years of QA/test automation experience, scripting skills, C/C++ development, and understanding of Deep Learning frameworks and models, particularly in end-to-end customer scenarios. | Serve | 7 |
| Senior AI Inference Compiler Engineer NVIDIA is seeking a Senior AI Inference Compiler Engineer to develop compiler IR, programming models, and optimizations for future GPU architectures, focusing on delivering leading inference performance for deep learning models. The role involves collaborating with deep learning software framework and hardware architecture teams to accelerate next-generation AI software, defining APIs, optimizing performance, and generating kernels for neural networks. | Serve | 7 |
| Senior MLOps Engineer, GenAI Framework This role focuses on building and maintaining CI/CD pipelines and release processes for NVIDIA's GenAI frameworks (Megatron-LM, NeMo). It involves implementing scalable DevOps solutions, managing infrastructure (Kubernetes, Docker, Slurm), automating tasks for research and development cycles, and developing quality control measures. The goal is to enable efficient work for GenAI software engineers, DL algorithm engineers, and research scientists, optimizing performance and ensuring high-quality software delivery. | Serve | 7 |
| System Software Engineer, Python and C/C++ - Deep Learning System Software Engineer role at NVIDIA focused on deep learning, data analytics, and machine learning. The role involves researching, prototyping, developing, and optimizing solutions, tools, and libraries. It also includes analyzing and improving deep learning libraries and frameworks, defining APIs, and performance tuning. The position requires strong Python and C/C++ programming skills, experience in complex system design, and knowledge of algorithms and data structures. The role is primarily focused on the engineering and optimization of AI infrastructure and tools. | Serve | 7 |
| Architecture Energy Modeling Engineer - New College Grad 2026 NVIDIA is seeking an Architecture Energy Modeling Engineer to develop and deploy energy-efficient methodologies for their GPUs. This role involves building Machine Learning based power models, integrating them into simulators, and analyzing energy consumption of AI workloads to influence architectural improvements. | Serve | 7 |
| Senior Solutions Architect, GPU System NVIDIA is seeking a Senior Solutions Architect with expertise in GPU server platforms and AI infrastructure to help customers design, deploy, and optimize NVIDIA-based AI factories. The role involves leading presales and architecture engagements, designing end-to-end AI data center solutions, and supporting the deployment of NVIDIA platforms for LLM training and inference workloads. | ServeAgent | 7 |
| Solution Architect - Top AI Labs Solution Architect role focused on designing AI computing platform architectures and supporting top AI Labs and model builders in integrating NVIDIA technologies for Deep Learning, HPC, Robotics, and Signal Processing applications. Requires experience with ML, data analytics, computer vision, and parallel programming on cloud/HPC architectures. | Serve | 7 |
| Senior Systems Performance Engineer Senior Systems Performance Engineer at NVIDIA focused on validating and optimizing GPU accelerated computing products, specifically for Deep Learning/AI applications. The role involves system architecture, performance modeling, and developing stress/performance testing strategies for ML/LLM workloads. | Serve | 7 |
| Senior Software Engineer - NIM Platform SDK and Framework Senior Software Engineer to own and evolve the core NIM Platform SDK and microservice framework, powering NVIDIA Inferencing Microservices (NIM). Focus on high-performance systems programming, multi-cloud abstractions, and API framework development for production-ready AI inference at scale. | Serve | 7 |
| Solutions Architect - DevOps NVIDIA is seeking a Senior Cloud Infrastructure and DevOps Solutions Architect to manage and optimize large-scale AI/HPC infrastructure, focusing on Kubernetes, automation, monitoring, and customer engagement for AI operational projects. | Serve | 7 |
| Solutions Architect, Financial Services - Data Center and Infrastructure NVIDIA is seeking a Solutions Architect with expertise in AI and data center infrastructure for the financial services sector. The role involves designing and deploying AI solutions, optimizing data center architectures, and collaborating with customers and internal teams to address complex technical challenges. Requires strong experience in accelerated computing, pre-sales, and large-scale systems management. | Serve | 7 |
| Senior Networking Solution Test Engineer Senior Networking Solution Test Engineer at NVIDIA focusing on Ethernet-based AI clusters. Responsibilities include designing test requirements, building testbeds, owning end-to-end cluster troubleshooting, debugging networking components (NCCL, RoCE/RDMA), defining tests for automation, running regression/performance/functional/scale testing, and profiling deep learning workloads. Requires 5+ years of Linux networking/system-level testing, strong debugging skills, expertise in NIC validation, and knowledge of AI networking libraries and protocols. | Serve | 7 |
| Senior Software R&D Engineer, Digital Logic Synthesis NVIDIA is seeking an EDA Software R&D Engineer to develop internal EDA tools by fusing advances in parallel computing, machine learning, and novel algorithms in C++. The role involves inventing and developing new algorithms for RTL synthesis, digital logic optimization, and physical-aware synthesis techniques, with a focus on prototyping and evaluating ML methods to guide optimization decisions and integrating successful approaches into production. | Serve | 7 |
| Senior Software Engineer, AI Frameworks Senior Software Engineer to integrate NVIDIA Grove project into AI frameworks like Dynamo, Ray, and PyTorch, focusing on production-grade software for adoption, scaling, and operation. Responsibilities include building adapters, optimizing performance for distributed training/inference, and improving observability. | Serve | 7 |
| Senior System Software Architect, AI and GPU Networking This role focuses on architecting and optimizing NVIDIA's GPU Networking offerings for AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and development of new technologies and runtime systems for AI data centers. | ServePost-train | 7 |
| Senior ASIC Methodology Engineer - LPU Division This role focuses on inventing and pioneering AI-driven hardware development methodologies for ASICs, aiming to improve predictability, convergence, and turnaround time in the ASIC development lifecycle. The engineer will leverage data to enable AI models and analytics, establish metrics for improvement, share best practices, and track advances in AI, EDA, and hardware design research. | Serve | 7 |
| Senior ASIC Methodology Engineer - LPU Division This role focuses on inventing and pioneering AI-driven and sophisticated automation techniques to transform the way ASICs are conceived, explored, and brought to closure, improving predictability, convergence, and turnaround time in the ASIC development lifecycle. The role involves identifying and leveraging data for AI models, establishing metrics, sharing best practices, and tracking advances in AI and hardware design research. | Serve | 7 |
| ASIC Methodology Engineer - New College Grad 2026 This role focuses on inventing and pioneering AI-driven automation techniques to transform ASIC development methodology, improving predictability, convergence, and turnaround time. The engineer will identify bottlenecks, curate data for AI models, establish metrics, share best practices, and track advances in AI and hardware design research. | Serve | 7 |
| Senior Solution Architect, AI Compute Engineer - NVIS Senior Solution Architect, AI Compute Engineer at NVIDIA, focusing on deploying, managing, and maintaining AI/HPC infrastructure in Linux environments for customers. The role involves customer interaction, system design, automation, and providing feedback to internal teams. Requires strong Linux system administration, scripting, and cluster management skills, with a preference for experience in distributed computing, high-speed networking, automation tools, and Kubernetes for AI/ML workloads. | Serve | 7 |
| Senior AI Developer Technology Engineer, Financial Sector Senior AI Developer Technology Engineer focused on optimizing AI and HPC workloads for financial markets on NVIDIA's computing platforms. This role involves research, development, performance analysis, and collaboration with the developer community and internal teams to influence hardware and software design. | Serve | 7 |
| Solutions Architect, AI and ML This role focuses on assisting customers in adopting NVIDIA's GPU hardware and software for building and deploying AI/ML and data analytics solutions on cloud platforms. The Solutions Architect acts as a technical expert, engaging with developers, researchers, and data scientists, and partnering with sales teams to drive end-to-end technology solutions. | Serve | 7 |
| Senior Networking Solution Test Engineer – AI Cluster Debugging Senior Networking Solution Test Engineer focused on debugging large-scale AI clusters, NVLink, Ethernet, and InfiniBand. The role involves designing tests, building testbeds, end-to-end troubleshooting, collaborating with development teams on networking components, and profiling deep learning workloads. | Serve | 7 |
| Devtech Compute Engineer NVIDIA is seeking a Devtech Compute Engineer to develop performance-critical code for deep learning applications, focusing on accelerating model training and inference on GPUs, particularly for recommender systems. The role involves optimizing CUDA kernels, integrating solutions into open-source libraries, and collaborating with hardware teams to define future solutions across various domains like LLM, Recsys, Robotics, and Assisted Driving. | ServeData | 7 |
| Senior System Software Architect, AI and GPU Networking This role focuses on architecting and enhancing NVIDIA's GPU Networking offerings to accelerate AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and design of new technologies for AI data centers. | ServePost-train | 7 |
| Senior Developer Technology Engineer This role focuses on optimizing GPU-accelerated code for training and inference performance of large-scale recommender systems. It involves designing and implementing high-performance C++/CUDA components, developing tests, and optimizing data flows between GPUs, NICs, and SSDs. The ideal candidate has experience with C++, CUDA, Python, GPU performance profiling, and ideally, building or optimizing recommender systems or production ML workloads on GPUs. | ServeShip | 7 |
| Senior Compiler Engineer - DL NVIDIA is seeking a Senior Compiler Engineer for its Deep Learning Compiler (DLC) team. This role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and hardware teams to accelerate deep learning inference. The compiler is critical for delivering leading inference performance, fast build times, and reduced memory footprints across various platforms. | Serve | 7 |
| Senior Solutions Architect, HPC and AI Senior Solutions Architect focused on deploying, debugging, and optimizing large-scale AI training and inference workloads on GPU clusters. The role involves collaborating with internal teams and external customers to solve complex HPC and AI challenges, focusing on performance, stability, and scaling of AI workloads. | ServeData | 7 |
| Neural Graphics Engineer NVIDIA is seeking a Neural Graphics Engineer to work on technologies at the intersection of AI and real-time rendering. The role involves implementing and optimizing neural graphics techniques, prototyping neural rendering and generative 3D approaches, and contributing to the graphics software stack. Experience with C++, Python, computer graphics, and machine learning is required, with a preference for hands-on experience in neural rendering or generative AI for 3D content. | ServeData | 7 |
| AI Benchmarking and Telemetry Engineer - NVIS NVIDIA is seeking an AI Benchmarking and Telemetry Engineer to develop and execute benchmarking approaches for large-scale HPC and AI clusters, and build telemetry frameworks to capture system performance data from host level through network and data center infrastructure. The role involves collaborating with engineering teams, customers, and partners to ensure platform performance and reliability, and maintaining knowledge of industry-standard benchmarks. | Serve | 7 |
| Senior Site Reliability Engineer - Datacenter Automation NVIDIA is seeking an experienced Senior Site Reliability Engineer to scale its AI Infrastructure, focusing on production systems for large GPU clusters used in AI workloads. The role involves implementing monitoring, health management, and automation for GPU asset provisioning, configuration, and lifecycle management across cloud providers, ensuring reliability, availability, and scalability. The engineer will collaborate with teams to maintain reliable and performant AI clusters, evaluate system failures, and improve services. | Serve | 7 |
| HPC and AI Cluster Engineer NVIDIA is seeking an HPC and AI Cluster Engineer to manage and maintain large-scale HPC/AI clusters, including Linux job scheduling, CI/CD pipelines, and troubleshooting from bare metal to application level. The role involves supporting R&D activities and POCs, working with cutting-edge hardware and software, and collaborating with researchers and customers to develop solutions. | Serve | 7 |
| Platform Architecture Engineer, GeForce NOW This role focuses on architecting and optimizing cloud infrastructure for AI workloads, specifically for the GeForce NOW service. The engineer will perform deep performance and power analysis of GPU/CPU microarchitecture for AI inference, deploy and optimize AI/gaming kernels, and build models to guide platform decisions balancing performance, power, and cost. The role requires strong programming skills and experience with AI models and performance analysis methodologies. | Serve | 7 |
| GPU Computing Engineer - Autonomous Driving NVIDIA is seeking a GPU Computing Engineer in Shanghai to analyze Deep Learning models and investigate TensorRT stability and performance issues. The role involves working with a global team on CUDA and TensorRT development, extracting feature requirements, and generating documentation. Requires strong C/C++/Python skills, knowledge of inference networks, and experience with deep learning frameworks like PyTorch. | Serve | 7 |
| Senior Software Engineer, AI Resiliency Senior Software Engineer to lead the development of AI software resiliency for large-scale AI supercomputers (100,000+ GPUs), focusing on features like fast checkpoint-recovery, error detection/isolation, and straggler/hang detection to minimize cluster downtime. The role involves hands-on C++ and Python coding, optimization for AI workloads, fault tolerance, debugging, and collaboration with AI researchers and hardware/software teams to integrate resiliency into AI frameworks. | Serve | 7 |
| Senior System Software Engineer - Video Senior System Software Engineer role focused on building and optimizing system software for NVIDIA's video subsystem, involving AI/ML and computer vision algorithms for video compression and multimedia processing on Tegra Application Processors and GPUs. Requires strong C/C++ and Python skills, experience with video compression standards, and a track record in pre/post-processing algorithms. | Serve | 7 |
| Senior AI Networking System Architect NVIDIA is seeking a Senior AI Networking System Architect to define and develop the architecture for next-generation NVL systems that power large-scale high-performance computing clusters for AI research and various industries. The role involves end-to-end system architecture, research across algorithms, software, firmware, and hardware, and developing simulation models for performance testing. | Serve | 7 |
| Senior Deep Learning Kernel Software Performance Architect Senior Kernel Performance Architect for Deep Learning Software at NVIDIA, focusing on crafting and prototyping GPU-accelerated system architectures to optimize deep learning and data analytics workloads. Requires expertise in kernel performance, math libraries, GPU computing, and parallel programming. | Serve | 7 |
| Senior Libraries Engineer – AI and HPC Senior Libraries Engineer at NVIDIA focused on building and optimizing GPU/CPU accelerated data processing software libraries for AI, data analytics, computer vision, and scientific simulations. The role involves developing scalable library software, performance tuning, optimization, and providing technical leadership. | Serve | 7 |
| Senior System Software Engineer - AI Performance and Efficiency Tools Senior System Software Engineer role focused on developing and improving tools for AI workload performance and efficiency on GPU clusters, supporting AI researchers and SW/HW teams. Involves building profiling, debugging, and benchmarking tools, and partnering with hardware architects. | ServeData | 7 |
| Deep Learning Performance Software Engineer NVIDIA is seeking a Deep Learning Performance Software Engineer to develop GPU-accelerated deep learning software, focusing on optimizing deep learning kernels and end-to-end performance through tile-based GPU programming. The role requires strong C/C++ skills, GPU programming experience (CUDA or OpenCL), and performance modeling/optimization knowledge. | Serve | 7 |
| Senior VLSI CAD and AI Automation Engineer Senior Engineer to develop and integrate AI/ML solutions for VLSI design automation, focusing on improving workflows, deploying algorithms, and maintaining automation infrastructure. Requires strong Python, AI/ML framework experience, and knowledge of VLSI physical design and EDA tools. | Serve | 7 |
| Senior System Software Engineer - Computer Vision Algorithms and SDK Senior System Software Engineer focused on developing and optimizing computer vision, signal processing, and machine learning algorithms for specialized DSP hardware (PVA engine) and enhancing the associated SDK. The role involves working with internal and external customers to enable efficient algorithm development and optimization on the hardware. | Serve | 7 |
| Senior System Software Engineer - AI Data Platform - Inference Factory Optimization Senior Software Engineer focused on building and optimizing infrastructure for automating the deployment and performance tuning of NVIDIA's AI software offerings, impacting inference applications across various hardware. | Serve | 7 |