Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Machine Learning Applications and Compiler Engineer, LPX NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to map neural network workloads onto future NVIDIA platforms. Responsibilities include building and maintaining high-performance runtime and compiler components for end-to-end inference optimization, defining workload mappings, extending the SW ecosystem, benchmarking, profiling, and collaborating with hardware architects. The role involves prototyping new compilation and runtime techniques and publishing technical work. | Serve | 7 |
| Senior Math Libraries Engineer – AI and HPC Senior engineer to join NVIDIA's Math Libraries team, focusing on kernel generation for AI and HPC, specifically matrix operations, JITing, and fusions. The role involves designing and implementing high-performance numerical dense linear algebra software on GPUs, providing technical leadership, and collaborating with product management. |
| Serve |
| 7 |
| Senior Manager, GPU Cloud Infrastructure - GeForce NOW Senior Manager to lead the design, scaling, and operations of high-performance networking for GPU-based cloud infrastructure, critical for cloud gaming, AI/ML training, and inference platforms. | Serve | 7 |
| Senior Deep Learning Test Development Engineer, SDET Senior Deep Learning Test Development Engineer (SDET) at NVIDIA's AI SWQA team, responsible for validating the robustness and performance of NVIDIA's AI software and GPU Infrastructure across various AI scenarios. The role involves test planning, design, execution, automation, and bug management, with a focus on improving workflow processes and efficiency. Experience with LLM inference frameworks and AI development tools is required. | Serve | 7 |
| Senior Staff AI Platform Engineer Senior Staff AI Platform Engineer at NVIDIA responsible for building, supporting, and maintaining AI-native infrastructure for enterprise products. This role involves architecting and scaling LLM/ML infrastructure, designing observability for AI models, developing automation, and troubleshooting complex distributed systems. The engineer will also drive AI-assisted engineering practices and partner with product teams to deliver scalable AI solutions. | ServeAgent | 7 |
| Senior Software Engineer, Deep Learning Inference - Automotive Safety Senior Software Engineer focused on developing high-performance deep learning inference software for safety-critical automotive applications using C++. The role involves integrating hardware functionalities into TensorRT, optimizing performance, and ensuring rigorous safety validation and documentation. | Serve | 7 |
| Senior Software Engineer, Deep Learning Inference - TensorRT NVIDIA is seeking a Senior Software Engineer to develop and scale a state-of-the-art inference framework for accelerating Deep Learning models, particularly LLMs, on NVIDIA GPUs using TensorRT. The role involves crafting inferencing software, developing components of TensorRT, and optimizing the deployment of trained models using C++ and Python. | Serve | 7 |
| Senior MLOps Engineer, GenAI Framework This role focuses on building and maintaining CI/CD pipelines and release processes for NVIDIA's GenAI frameworks (Megatron-LM, NeMo). It involves implementing scalable DevOps solutions, managing infrastructure (Kubernetes, Docker, Slurm), automating tasks for research and development cycles, and developing quality control measures. The goal is to enable efficient work for GenAI software engineers, DL algorithm engineers, and research scientists, optimizing performance and ensuring high-quality software delivery. | Serve | 7 |
| System Software Engineer, Python and C/C++ - Deep Learning System Software Engineer role at NVIDIA focused on deep learning, data analytics, and machine learning. The role involves researching, prototyping, developing, and optimizing solutions, tools, and libraries. It also includes analyzing and improving deep learning libraries and frameworks, defining APIs, and performance tuning. The position requires strong Python and C/C++ programming skills, experience in complex system design, and knowledge of algorithms and data structures. The role is primarily focused on the engineering and optimization of AI infrastructure and tools. | Serve | 7 |
| Senior Systems Performance Engineer Senior Systems Performance Engineer at NVIDIA focused on validating and optimizing GPU accelerated computing products, specifically for Deep Learning/AI applications. The role involves system architecture, performance modeling, and developing stress/performance testing strategies for ML/LLM workloads. | Serve | 7 |
| Senior Software Engineer - NIM Platform SDK and Framework Senior Software Engineer to own and evolve the core NIM Platform SDK and microservice framework, powering NVIDIA Inferencing Microservices (NIM). Focus on high-performance systems programming, multi-cloud abstractions, and API framework development for production-ready AI inference at scale. | Serve | 7 |
| Senior Networking Solution Test Engineer Senior Networking Solution Test Engineer at NVIDIA focusing on Ethernet-based AI clusters. Responsibilities include designing test requirements, building testbeds, owning end-to-end cluster troubleshooting, debugging networking components (NCCL, RoCE/RDMA), defining tests for automation, running regression/performance/functional/scale testing, and profiling deep learning workloads. Requires 5+ years of Linux networking/system-level testing, strong debugging skills, expertise in NIC validation, and knowledge of AI networking libraries and protocols. | Serve | 7 |
| Senior Software R&D Engineer, Digital Logic Synthesis NVIDIA is seeking an EDA Software R&D Engineer to develop internal EDA tools by fusing advances in parallel computing, machine learning, and novel algorithms in C++. The role involves inventing and developing new algorithms for RTL synthesis, digital logic optimization, and physical-aware synthesis techniques, with a focus on prototyping and evaluating ML methods to guide optimization decisions and integrating successful approaches into production. | Serve | 7 |
| Senior System Software Architect, AI and GPU Networking This role focuses on architecting and optimizing NVIDIA's GPU Networking offerings for AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and development of new technologies and runtime systems for AI data centers. | ServePost-train | 7 |
| Senior ASIC Methodology Engineer - LPU Division This role focuses on inventing and pioneering AI-driven hardware development methodologies for ASICs, aiming to improve predictability, convergence, and turnaround time in the ASIC development lifecycle. The engineer will leverage data to enable AI models and analytics, establish metrics for improvement, share best practices, and track advances in AI, EDA, and hardware design research. | Serve | 7 |
| Senior ASIC Methodology Engineer - LPU Division This role focuses on inventing and pioneering AI-driven and sophisticated automation techniques to transform the way ASICs are conceived, explored, and brought to closure, improving predictability, convergence, and turnaround time in the ASIC development lifecycle. The role involves identifying and leveraging data for AI models, establishing metrics, sharing best practices, and tracking advances in AI and hardware design research. | Serve | 7 |
| Senior AI Developer Technology Engineer, Financial Sector Senior AI Developer Technology Engineer focused on optimizing AI and HPC workloads for financial markets on NVIDIA's computing platforms. This role involves research, development, performance analysis, and collaboration with the developer community and internal teams to influence hardware and software design. | Serve | 7 |
| Senior Networking Solution Test Engineer – AI Cluster Debugging Senior Networking Solution Test Engineer focused on debugging large-scale AI clusters, NVLink, Ethernet, and InfiniBand. The role involves designing tests, building testbeds, end-to-end troubleshooting, collaborating with development teams on networking components, and profiling deep learning workloads. | Serve | 7 |
| Senior System Software Architect, AI and GPU Networking This role focuses on architecting and enhancing NVIDIA's GPU Networking offerings to accelerate AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and design of new technologies for AI data centers. | ServePost-train | 7 |
| Senior Developer Technology Engineer This role focuses on optimizing GPU-accelerated code for training and inference performance of large-scale recommender systems. It involves designing and implementing high-performance C++/CUDA components, developing tests, and optimizing data flows between GPUs, NICs, and SSDs. The ideal candidate has experience with C++, CUDA, Python, GPU performance profiling, and ideally, building or optimizing recommender systems or production ML workloads on GPUs. | ServeShip | 7 |
| Neural Graphics Engineer NVIDIA is seeking a Neural Graphics Engineer to work on technologies at the intersection of AI and real-time rendering. The role involves implementing and optimizing neural graphics techniques, prototyping neural rendering and generative 3D approaches, and contributing to the graphics software stack. Experience with C++, Python, computer graphics, and machine learning is required, with a preference for hands-on experience in neural rendering or generative AI for 3D content. | ServeData | 7 |
| HPC and AI Cluster Engineer NVIDIA is seeking an HPC and AI Cluster Engineer to manage and maintain large-scale HPC/AI clusters, including Linux job scheduling, CI/CD pipelines, and troubleshooting from bare metal to application level. The role involves supporting R&D activities and POCs, working with cutting-edge hardware and software, and collaborating with researchers and customers to develop solutions. | Serve | 7 |
| Platform Architecture Engineer, GeForce NOW This role focuses on architecting and optimizing cloud infrastructure for AI workloads, specifically for the GeForce NOW service. The engineer will perform deep performance and power analysis of GPU/CPU microarchitecture for AI inference, deploy and optimize AI/gaming kernels, and build models to guide platform decisions balancing performance, power, and cost. The role requires strong programming skills and experience with AI models and performance analysis methodologies. | Serve | 7 |
| GPU Computing Engineer - Autonomous Driving NVIDIA is seeking a GPU Computing Engineer in Shanghai to analyze Deep Learning models and investigate TensorRT stability and performance issues. The role involves working with a global team on CUDA and TensorRT development, extracting feature requirements, and generating documentation. Requires strong C/C++/Python skills, knowledge of inference networks, and experience with deep learning frameworks like PyTorch. | Serve | 7 |
| Senior System Software Engineer - Video Senior System Software Engineer role focused on building and optimizing system software for NVIDIA's video subsystem, involving AI/ML and computer vision algorithms for video compression and multimedia processing on Tegra Application Processors and GPUs. Requires strong C/C++ and Python skills, experience with video compression standards, and a track record in pre/post-processing algorithms. | Serve | 7 |
| Senior AI Networking System Architect NVIDIA is seeking a Senior AI Networking System Architect to define and develop the architecture for next-generation NVL systems that power large-scale high-performance computing clusters for AI research and various industries. The role involves end-to-end system architecture, research across algorithms, software, firmware, and hardware, and developing simulation models for performance testing. | Serve | 7 |
| Senior Deep Learning Kernel Software Performance Architect Senior Kernel Performance Architect for Deep Learning Software at NVIDIA, focusing on crafting and prototyping GPU-accelerated system architectures to optimize deep learning and data analytics workloads. Requires expertise in kernel performance, math libraries, GPU computing, and parallel programming. | Serve | 7 |
| Senior Libraries Engineer – AI and HPC Senior Libraries Engineer at NVIDIA focused on building and optimizing GPU/CPU accelerated data processing software libraries for AI, data analytics, computer vision, and scientific simulations. The role involves developing scalable library software, performance tuning, optimization, and providing technical leadership. | Serve | 7 |
| Deep Learning Performance Software Engineer NVIDIA is seeking a Deep Learning Performance Software Engineer to develop GPU-accelerated deep learning software, focusing on optimizing deep learning kernels and end-to-end performance through tile-based GPU programming. The role requires strong C/C++ skills, GPU programming experience (CUDA or OpenCL), and performance modeling/optimization knowledge. | Serve | 7 |
| Senior VLSI CAD and AI Automation Engineer Senior Engineer to develop and integrate AI/ML solutions for VLSI design automation, focusing on improving workflows, deploying algorithms, and maintaining automation infrastructure. Requires strong Python, AI/ML framework experience, and knowledge of VLSI physical design and EDA tools. | Serve | 7 |
| Senior System Software Engineer - Computer Vision Algorithms and SDK Senior System Software Engineer focused on developing and optimizing computer vision, signal processing, and machine learning algorithms for specialized DSP hardware (PVA engine) and enhancing the associated SDK. The role involves working with internal and external customers to enable efficient algorithm development and optimization on the hardware. | Serve | 7 |
| Senior System Software Engineer - AI Data Platform - Inference Factory Optimization Senior Software Engineer focused on building and optimizing infrastructure for automating the deployment and performance tuning of NVIDIA's AI software offerings, impacting inference applications across various hardware. | Serve | 7 |
| Senior Software Advanced Developer Develop and prototype advancements in distributed training and inference using NVIDIA's Spectrum-X AI fabric, focusing on improving AI app-networking connections through communication refinement, congestion control, NIC firmware coding, and switch SDK features to enhance AI factory efficiency and large-scale AI system development, scaling, and speed. | ServePretrain | 7 |
| Senior DGX Cloud AI Infrastructure Software Engineer Senior Software Engineer role focused on building and integrating AI infrastructure for DGX Cloud, enabling developers to access GPU-optimized virtual machines. Responsibilities include crafting IaaS API integrations, developing a two-sided marketplace, and improving testing and observability for scalable, fault-tolerant solutions. | Serve | 7 |
| Software Architect, Advanced Development Research role focused on the intersection of Networking, Security, and Communications, with a specific emphasis on applying AI to these domains. The role involves technical leadership, architecture design, SDK development for new hardware, and implementing services. A key aspect is working with AI-powered networking machines. | Serve | 7 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect role at NVIDIA focusing on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, exploring innovative HW/SW solutions, building proofs-of-concept, and using simulation to evaluate large GPU cluster performance. | Serve | 7 |
| Senior Storage Production Engineer - DGX Cloud NVIDIA is seeking a Senior Storage Production Engineer for their DGX Cloud service. This role focuses on designing, building, and maintaining large-scale, high-performance distributed storage systems that support AI/ML and HPC workloads. Responsibilities include ensuring reliability, scalability, optimizing data access, and automating storage operations using AI/ML-driven techniques. The engineer will work on monitoring, alerting, performance tuning, and incident response for storage infrastructure. | Serve | 5 |
| Developer Technology Engineer – AI NVIDIA Developer Technology Engineer focused on optimizing core parallel algorithms and data structures for GPUs, collaborating with application developers and internal NVIDIA teams to improve application performance and developer efficiency. Requires strong programming skills in C/C++/Python, parallel programming experience (CUDA), and mathematical fundamentals. | Serve | 5 |
| Tegra Manufacturing Test Engineer NVIDIA is seeking a Manufacturing Test Engineer to automate product definitions, data collection, test case execution, and results analysis. The role involves driving Tegra diagnosis, analyzing test issues, qualifying equipment, and developing AI for production automation. The engineer will also assist with debug tools and follow up on test setup issues. | Serve | 5 |
| Senior System Software Engineer - Windows DevOps and Test Labs NVIDIA is seeking a Senior System Software Engineer to build and maintain infrastructure for deploying AI applications and models on Windows. The role involves developing and sustaining infrastructure for AI workloads, scoping requirements for deploying AI applications, managing AI model repositories, analyzing data for insights, and collaborating with developers to debug issues. The engineer will also build CI/CD workflows and understand existing infrastructure. | Serve | 5 |
| Senior DFX Power Methodology Engineer This role focuses on Design-for-X (DFX) for power, thermal, and voltage noise methodology in semiconductor chip design, specifically for datacenter GPUs. It involves innovating low power and thermal solutions for manufacturing tests, analyzing post-silicon data for power, and developing/deploying DFT methodologies using Applied ML & Gen AI solutions. The role also includes mentoring junior engineers and requires strong programming skills for AI coding harnesses. | Serve | 5 |
| Compute Performance Developer Technology Engineer Software developer or computer scientist to join Compute Developer Technology team focusing on research and development of techniques to accelerate leading applications in scientific computing, computational engineering, data analytics, and artificial intelligence. Responsibilities include in-depth analysis and optimization for performance on CPU, GPU, and network architectures, guiding key application developers, developing reference codes or libraries, and creating/optimizing core parallel algorithms and data structures using the NVIDIA platform. The role also involves influencing next-generation architectures and software stack design. | Serve | 5 |
| Technical Marketing Engineer NVIDIA is seeking a Technical Marketing Engineer to build demos, develop media pipelines, and evangelize AI for live media. The role involves architecting and building software demos, developing end-to-end media pipelines using NVIDIA's stack, and traveling to industry tradeshows for technical evangelism. The candidate will also provide feedback to product management and collaborate with engineering teams. | Serve | 5 |
| Senior Production Engineer - DGX Cloud NVIDIA is seeking experienced Senior Production Engineers to scale its AI Infrastructure, focusing on production systems for large GPU clusters used in AI workloads. The role involves implementing monitoring, health management, and ensuring reliability and scalability of GPU assets, working with various data streams and cross-functional teams. | Serve | 5 |
| Senior Research-Ops & DevOps Engineer NVIDIA is seeking a Senior Software Engineer for their Video/Multimedia A&A team to lead infrastructure and operations. This role involves setting up compute resources (on-prem and cloud), developing distributed pipelines for large-scale regressions and experiments across hardware simulations and ML workloads, and maintaining CI/CD and development environments. The goal is to transform research workflows into reliable, automated systems. | Serve | 5 |
| Senior Physical Design Methodology Engineer, PPA Fusion Compiler Senior Physical Design Methodology Engineer focused on developing and implementing ML-based solutions to improve Power, Performance, and Area (PPA) for graphics processors and SOCs. This role involves developing efficient physical design methodologies, formulating ML-based solutions, and participating in the full chip design flow. | Serve | 5 |
| Senior HPC Architect, Automation and At-Scale Deployment The Senior HPC Architect will support the deployment and bring-up of large-scale GPU compute clusters, enabling AI and GPU computing breakthroughs. This role involves providing engineering solutions for GPU Computing products and software stacks, acting as an internal reference for system administration and large-scale GPU-accelerated systems, and working with scientific researchers and developers to craft workflows and solutions. | Serve | 5 |
| Senior System Software Engineer - Neural Graphics SDKs Senior System Software Engineer to develop and maintain NVIDIA's software ecosystem for neural graphics, including key OSS platforms like GSplat. The role involves implementing, validating, releasing, and maintaining SDKs, APIs, and libraries for Neural Reconstruction, influencing software architecture, validation strategy, and technical roadmaps. Experience with Python, C++, distributed systems, and GPU acceleration is required. | Serve | 5 |
| Senior Software Developer NVIDIA is seeking a Senior Software Developer to join their AI networking acceleration team. The role involves developing a groundbreaking open-source library using hardware offloads, GPU Kernels, and RDMA network cards, focusing on a performance-oriented, low-level inference framework. The position requires strong C++/C/Python development skills, Linux environment experience, and deep knowledge of TCP/IP networking. Experience with Linux internals, low-level optimizations, CUDA kernels, ML frameworks, LLMs, parallel programming, HPC, or RDMA is advantageous. | Serve | 5 |
| Developer Technology Engineer NVIDIA is seeking a Senior Developer Technology Engineer focused on game consoles to optimize applications for NVIDIA's GPU architectures. This role involves working with developers on software, engines, and libraries to improve performance and efficiency, with a focus on AI inferencing workloads and cutting-edge rendering techniques. | Serve | 5 |