Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Design Automation Engineer, Applied AI NVIDIA is seeking an Applied AI Engineer to lead end-to-end solution development for timing and constraint analysis workflows in VLSI/ASIC design. The role involves data generation, model training, orchestration, and building autonomous agents that interact with timing tools. The engineer will develop AI-driven solutions, integrate data sources, implement scalable orchestration, and build interpretable AI pipelines using GNNs, LLMs, and reasoning engines. Experience with Python, PyTorch/TensorFlow, graph/agentic AI frameworks, and EDA tools is required. | AgentData | 8 |
| Senior AI and MLOps Engineer - Security and Networking Research Senior AI/MLOps Engineer focused on building and maintaining infrastructure, tools, and processes for the AI lifecycle in a production environment, specifically for security and networking AI models and agents. The role involves optimizing models, deploying agentic systems and LLMs, designing training/inference pipelines, and collaborating with various engineering teams. |
| AgentServe |
| 8 |
| Senior Manager, System Software Engineering - Metropolis Accelerated and Inferencing Software Senior Manager for System Software Engineering at NVIDIA, focusing on Metropolis Accelerated and Inferencing Software. The role involves leading engineering teams, driving strategic implementations of inference solutions (TensorRT, VLLM) for edge and enterprise devices, performance benchmarking, and technical leadership in deep learning. Requires extensive experience in machine learning/deep learning, embedded software, GPU/CPU optimization, and multimodal AI systems. | ServeAgent | 8 |
| Senior Product Architect, Storage NVIDIA is seeking a Senior Product Architect to design and validate AI storage infrastructure, focusing on optimizing systems for large-scale foundation model training, disaggregated inference, and agentic AI pipelines. The role involves architecting end-to-end reference architectures, defining system-level architectures, and collaborating with partners and customers to deliver proof-of-concepts. | AgentServe | 8 |
| Solutions Architect – AI Factory Solutions Architect role focused on designing, building, and operationalizing large-scale AI factories and GenAI/Agentic AI solutions for enterprise customers, leveraging NVIDIA's technology stack. This involves hands-on work with compute, networking, software, and cluster management tools. | Agent | 8 |
| Manager, AI and Software Manager for an AI team at NVIDIA, focusing on developing and leading the implementation of cutting-edge AI applications including RAG, LLMs, AI Agents, recommendation engines, and classical AI models. The role involves managing a team of 6-8 engineers, providing technical leadership, and collaborating with cross-functional teams to identify and implement AI opportunities. | AgentData | 8 |
| Senior Software Engineer, Video Analytics Senior Software Engineer role focused on building large-scale distributed Vision AI platforms for video analytics using NVIDIA Metropolis. The role involves designing and developing functionalities for video processing, integrating VLMs, CV models, and LLMs, and optimizing performance on NVIDIA hardware. Requires strong software development experience with ML systems, C++, Python, and GPU acceleration. | ShipServe | 8 |
| Senior Solutions Architect - Physical AI NVIDIA is seeking a Senior Solutions Architect for Physical AI to support customers building robotics and Physical AI solutions on NVIDIA’s platforms. This role involves guiding architecture, prototyping, and troubleshooting across robotics deployments from simulation to training to deployment, focusing on applied AI (computer vision, GenAI) for robotics. | AgentData | 8 |
| Senior GPU System Architect NVIDIA is seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves architecting system topologies, defining interconnects (NVLink, Ethernet), collaborating on RDMA, using system models for analysis, and co-designing hardware-software stacks for efficient AI workload deployment. | Serve | 8 |
| Senior System Software Engineer, Speech AI Senior System Software Engineer role focused on speech AI technologies (ASR, TTS, ALM, S2S) for enterprise and developer customers. Responsibilities include implementing, troubleshooting, and optimizing GPU-accelerated speech systems in production, transitioning models from research to production, optimizing inference performance, developing core speech services using C++ and Python with CUDA, and contributing to client SDKs. Requires strong programming skills, experience with inference pipelines, understanding of modern model architectures, and knowledge of real-time streaming audio and low-latency systems. Experience with speech model fine-tuning is required. | ServePost-train | 8 |
| Senior System Software Engineer, Speech AI NVIDIA is seeking an experienced Software Engineer to work on their GPU-accelerated Speech AI platform, focusing on building and optimizing core speech recognition (ASR), text-to-speech (TTS), and S2S services for real-time conversational AI applications. The role involves developing C++ & Python backend implementations, optimizing inference performance, adding new features, contributing to client libraries, and performance analysis of complex systems. | ServePost-train | 8 |
| NIM Solutions Architect This role focuses on deploying and optimizing large models using NVIDIA's Inference Microservice (NIM) and related tools. The Solutions Architect will package optimized models (LLM, VLM, etc.) into containers for deployment, refine NIM tools for the community, and design/implement agentic AI solutions for customer scenarios. The role requires strong programming skills, experience with inference engines, and MLOps practices, with a focus on performance engineering and model optimization. | ServeAgent | 8 |
| Solution Architecture Intern, AI in Industry - 2026 NVIDIA is seeking an AI in Industry Solution Architecture Intern to help optimize large models, develop AI workflows, and deliver advanced AI solutions. The intern will provide technical support, design and implement optimizations for AI models, and set up model training or inference to identify and resolve bottlenecks. This role involves working with various AI models and inference frameworks, conducting research, and collaborating with global teams. | ServePost-train | 8 |
| Senior Software Engineer – ADAS Senior Software Engineer to develop production ADAS and autonomous driving functions in C++ and Python, integrating deep learning models into real-time inference pipelines on NVIDIA GPUs for safety-critical automotive applications. | ServePost-train | 8 |
| Performance Engineer Intern, Deep Learning and HPC - 2026 NVIDIA is seeking a Performance Engineer Intern to support performance testing of datacenter products and applications, focusing on AI workloads like LLM training and inference, as well as HPC. The role involves benchmarking, profiling, analyzing performance, developing automation scripts, and collaborating with internal teams. The intern will aggregate and report testing data for sales, marketing, and engineering teams, and assist in developing tools and processes for automated testing. | ServePost-train | 8 |
| Senior Software Engineer, Robotics - Isaac Lab NVIDIA is seeking a Senior Software Engineer for their Isaac Lab team to develop features for a robot learning platform, focusing on reinforcement learning, multi-agent learning, and sim-to-real deployment. The role involves automating workflows, scaling in the cloud, and collaborating with research teams on next-generation robots. | AgentData | 8 |
| Software Engineering Manager, Robotics NVIDIA is seeking a Robotics Software Engineering Manager to lead a team focused on sim-first development, real-world deployment, and continuous learning for physical AI robots, such as Humanoid Robots. The role involves hands-on development, implementation, and deployment of real-time software stacks, fostering innovation, and collaborating with cross-functional teams. | ShipAgent | 8 |
| Senior Solutions Architect, AI Factory NVIDIA is seeking a Senior Solutions Architect with expertise in AI Supercomputing to support academic and commercial groups using NVIDIA products for deep learning, data analytics, and scientific simulation. The role involves understanding customer needs, developing solutions, demonstrating workflows, and communicating requirements to NVIDIA Engineering. Requires 3+ years of Deep Learning research experience, experience with LLM training and adaptation, and familiarity with DL frameworks and Generative AI. | Post-train | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to collaborate with developers, optimize AI workloads on GPUs, research innovative AI techniques, and ensure peak performance on GPU architectures. The role involves developing and optimizing parallel algorithms and data structures, influencing next-gen architectures, and requires proficiency in C++, AI algorithms, and specific domains like multi-modal models or RL for LLMs. | ServePost-train | 8 |
| Senior Software Engineer, Robotics - Isaac Lab Senior Software Engineer to join the Isaac Lab team, focusing on developing a platform for robot learning, including perception-in-the-loop reinforcement learning, multi-agent/multi-task learning, and VLA & RL integration. The role involves sim-to-real efforts, defining training workflows, and collaborating with research teams to advance humanoid robots. | ShipData | 8 |
| Senior HPC Performance Engineer - AI for Science at Scale Senior HPC Performance Engineer focused on optimizing large-scale, CUDA-backed ML training frameworks for AI in Science applications, particularly in digital biology and chemistry. The role involves kernel design, GPU porting, distributed learning, and algorithmic improvements within HPC software stacks. | ServePost-train | 8 |
| Software Engineer, Metropolis Vision AI Software Engineer for NVIDIA's Metropolis Vision AI team, focusing on building and optimizing large-scale distributed Vision AI platforms for real-time and streaming scenarios. The role involves implementing high-performance pipelines, developing distributed services for video/image/3D data processing, enhancing multi-modal perception, using simulation/synthetic data, and profiling GPU-accelerated inference. Requires strong C++/Python, Linux, computer vision, deep learning, and distributed systems experience, with practical experience in PyTorch for training and deployment. | ServeData | 8 |
| Deeplearning Software Engineer -- Neural 3D reconstruction Software Engineer role focused on deep learning for neural 3D reconstruction, involving research, design, implementation, optimization, and deployment of DNN models. The role requires C++, PyTorch, and ML/DL techniques, with a preference for experience in DNN development and network acceleration. | ServePost-train | 8 |
| Senior AI Application Developer - GPU and SOC Architecture Modeling Senior AI Application Developer role focused on developing and deploying scalable GenAI applications to accelerate GPU/SOC architecture modeling. The role involves integrating LLMs into existing workflows, collaborating with hardware architects and infrastructure engineers, and researching emerging AI technologies. Requires proficiency in C++, Python, ML frameworks, and hands-on experience with LLMs and multimodal models. | Agent | 8 |
| Architect, AI Solutions Engineering NVIDIA is looking for an AI Solutions Architect to scale internal AI platforms and solutions for thousands of developers. The role involves identifying AI opportunities, setting system outcomes, optimizing performance and cost, and collaborating with AI product vendors. Requires strong experience in building large-scale distributed systems and hands-on experience with LLMs, RAG, fine-tuning, and agentic orchestration. | AgentServe | 8 |
| Senior High-Performance System Architect NVIDIA is seeking a Senior High-Performance System Architect to define and research NVL system architecture for large-scale, high-performance computing clusters used to train advanced AI models. The role involves working across algorithms, software, firmware, and hardware, collaborating with cross-functional teams, and analyzing simulation results. | ServePretrain | 8 |
| Manager, Deep Learning Algorithms Manager for Deep Learning Algorithms at NVIDIA, focusing on productizing DL models, optimizing inference, and leading engineering teams. The role involves working with LLMs/VLMs, inference optimization, and collaborating across NVIDIA to develop state-of-the-art algorithms for GPU-accelerated platforms. | Serve | 8 |
| Senior Deep Learning Engineer - AI for Wireless Systems NVIDIA is seeking a Senior Deep Learning Engineer to develop AI-native wireless networks, integrating deep learning into signal processing and radio access technologies. The role involves designing, prototyping, implementing, training, and optimizing deep learning models for real-time inference and deployment on GPU platforms, collaborating with researchers and system engineers. | ServePost-train | 8 |
| Engineering Manager - AI for RAN and 6G Wireless Systems NVIDIA is seeking an Engineering Manager to lead a team developing AI/ML models for 6G wireless networks. The role involves guiding model development, training, evaluation, and deployment, with a focus on integrating deep learning into signal processing and radio access technologies. Experience with Python, PyTorch/TensorFlow, and leading engineering teams is required. | ServePost-train | 8 |
| System Software Engineer - Deep Learning System Software Engineer at NVIDIA focused on accelerating deep learning inference for autonomous driving systems using NVIDIA GPUs and DL accelerators. The role involves developing SDKs/frameworks for LLMs and state-of-the-art models, benchmarking, and optimizing for latency, accuracy, and power consumption. Requires experience with deep learning frameworks, DNN optimization, and C/C++. | ServePost-train | 8 |
| Senior AI Infrastructure Software Engineer Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks. | AgentServe | 8 |
| Senior LLM Agents Architect Senior LLM Agents Architect at NVIDIA to build and deploy agentic systems integrating LLMs with domain tools for HW/SW engineering workflows. Focus on developing end-to-end agent flows for simulation analysis, kernel optimization, and developer efficiency, including prototyping, integration, evaluation, and mentoring. | Agent | 8 |
| Distinguished Engineer, JAX Distinguished Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing core JAX components, driving peak performance on NVIDIA products, and building tools to increase the efficiency of AI-based system development teams. It bridges numerical computing, simulation, and deep learning research with real-world applications. | Serve | 8 |
| Senior Deep Learning Compiler Engineer - PyTorch Senior Deep Learning Compiler Engineer to develop and optimize PyTorch models for NVIDIA GPUs using compiler technology like Thunder, TorchDynamo, and TorchInductor. Focus on performance analysis and contributing to open-source AI ecosystem. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software to accelerate AI and HPC workloads. The role involves investigating performance bottlenecks, exploring HW/SW co-design, building proofs-of-concept, and performing quantitative modeling for large GPU clusters. | Serve | 8 |
| Distinguished Engineer - Dynamo Distinguished Engineer role focused on NVIDIA Dynamo, an AI inferencing platform. The role involves technical leadership, driving product direction, and contributing to open-source projects to achieve state-of-the-art performance and scalability for AI inference across modalities on NVIDIA hardware. | Serve | 8 |
| Principal Software Engineer - Dynamo Principal Software Engineer for NVIDIA Dynamo, an open-source platform for efficient, scalable inference of large language and reasoning models in distributed GPU environments. Focuses on Kubernetes serving, scalability, disaggregated serving, dynamic GPU scheduling, intelligent routing, and distributed KV cache management. | ServeAgent | 8 |
| Principal Software Engineer – Large-Scale LLM Memory and Storage Systems NVIDIA is seeking a Principal Systems Engineer to design and evolve a unified memory layer for large-scale LLM inference, focusing on KV-cache offload, reuse, and sharing across heterogeneous clusters. The role involves deep integration with LLM serving engines and optimizing performance across GPU, CPU, and storage tiers. | Serve | 8 |
| Senior Software Engineer, Deep Learning - MLIR TRT Senior Software Engineer focused on developing and productizing deep learning solutions for autonomous driving vehicles, specifically involving compiler technology to optimize deep learning inference on NVIDIA hardware. The role requires expertise in deep learning frameworks, compiler technologies, and GPU programming. | Serve | 8 |
| Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK Senior Software Engineer at NVIDIA to build the future of real-time AI for sensor-driven applications using the Holoscan Platform. The role involves architecting APIs, prototyping GPU-accelerated algorithms for computer vision, imaging, sensor fusion, and low-latency rendering, and integrating generative models and multimodal foundation models into real-time pipelines. Focus on enabling GPU-resident generative methods for perception, simulation, and robotics. | AgentServe | 8 |
| Manager, Deep Learning Algorithms Manager for Deep Learning Algorithms at NVIDIA, focusing on leading engineering efforts for productizing DL models, optimizing inference, and collaborating with research teams to implement and improve algorithms. The role involves managing a team, aligning priorities, and developing the GPU-accelerated DL platform. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications This role focuses on architecting and implementing next-generation communication software and platforms for deep learning and high-performance computing (HPC) applications, specifically targeting the efficient scaling of GPU clusters. The work involves identifying performance bottlenecks, designing new communication technologies, exploring hardware/software co-design, and using simulation to evaluate performance at massive scales. | Serve | 8 |
| Senior Deep Learning Performance Architect NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and high-performance computing. Responsibilities include developing HW architectures for performance and energy efficiency, benchmarking AI workloads, creating simulation tools, and evaluating hardware features. Requires MS/PhD or equivalent experience with 4+ years in parallel computing architectures, GPU/ASIC architecture evaluation for training/inference, and strong Python/C++ skills. | Serve | 8 |
| Senior Software Architect, Advanced Development Senior Software Architect focused on accelerating networking and building AI data centers, researching transport functions for AI workloads, and leading architectural efforts in distributed AI, deep learning, HPC, SDN, virtualization, and storage. | Serve | 8 |
| Senior Manager, Deep Learning Performance Architecture NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures. | Serve | 8 |
| Deep Learning Performance Architect NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions. | Serve | 8 |
| AI Developer Technology Engineer NVIDIA is seeking an AI Developer Technology Engineer to work on optimizing AI techniques on GPU architectures and collaborate with customers and internal teams to influence future designs. The role involves studying and developing cutting-edge deep learning, graphs, and machine learning techniques, with a focus on performance analysis and optimization for GPUs. The engineer will also work with customers to understand their problems and provide AI solutions using GPUs, and collaborate with NVIDIA's internal teams to shape next-generation architectures and software platforms. | Serve | 8 |
| Director, Engineering – Software Engineering and AI Inferencing Platforms NVIDIA is seeking an Engineering Director to lead and scale software engineering teams in Vietnam, focusing on AI Inferencing Platforms and AI data/factory initiatives. The role involves driving the design, architecture, and delivery of high-performance system software platforms, collaborating with global teams, and overseeing the development and optimization of AI delivery platforms like NIMs and Blueprints. Experience with cloud, data, accelerated computing, and managing large AI/ML product teams is required. | ServeData | 8 |
| Senior System Software Architect, HPC and AI Networking NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers. | ServePost-train | 8 |
| Software Engineer, LLM Inference Software Engineer focused on developing and optimizing LLM inference software and frameworks, working with GPU-accelerated libraries and deep learning frameworks. | Serve | 8 |