Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Software Engineer, Metropolis Vision AI Senior Software Engineer to develop and optimize high-performance Vision AI pipelines and large-scale distributed services for processing video, image, and 3D data. The role involves crafting real-time systems, developing multi-modal perception, using simulation/synthetic data, and profiling/tuning GPU-accelerated inference pipelines. Collaboration with research and platform teams is key, with an emphasis on bringing research into production at scale. | ServePost-train | 8 |
| Senior Software Engineer, AI Networking Senior Software Engineer role focused on building and productizing ML tools for optimizing AI workloads (LLM training/inference) across GPU/CPU clusters, with a focus on networking and system resource utilization. Involves distributed deep learning, ML-based optimization techniques, and performance analysis. |
| ServeAgent |
| 8 |
| Senior Deep Learning Scientist, Speech Synthesis NVIDIA is seeking a Senior Deep Learning Scientist to work on their Speech AI product, Riva. The role involves training speech synthesis models (mel-spectrogram and vocoder), measuring and analyzing model performance, maintaining the TTS evaluation system, and improving speech data processing and training set preparation. The ideal candidate has a Master's or PhD, 5+ years of ML/AI experience, strong Python and PyTorch skills, and hands-on experience training speech synthesis models. | DataPost-train | 8 |
| Senior AI-Native Systems Software Engineer, TensorRT Senior engineer to architect and build an AI-native framework using AI agents for software development, focusing on scaling, performance optimization, and integrating SOTA models for inference. | AgentServe | 8 |
| Senior Performance Engineer - LLM Inference Frameworks NVIDIA is seeking a Senior Performance Engineer to optimize LLM inference infrastructure on GPUs, focusing on throughput, memory efficiency, and scalability. The role involves designing and implementing high-performance pipelines, profiling, tuning model execution, and innovating techniques like Speculative Decoding and quantization. Experience with deep learning frameworks and performance debugging is required. | Serve | 8 |
| Senior Staff Software Engineer - Agentic Automation Senior Staff Software Engineer to own engineering efforts for NVIDIA enterprise systems, transforming support into AI-infused automated resolution systems using LLM-based agents, tool calling, RAG, and orchestration frameworks. Requires full-stack experience, strong systems thinking, and incident management skills. | Agent | 8 |
| AI Computing Development Engineer, TensorRT-LLM NVIDIA is seeking software engineers to develop and optimize inferencing software for AI models, specifically focusing on TensorRT-LLM. This role involves performance analysis, tuning, and collaboration across teams to advance machine learning inferencing capabilities. | Serve | 8 |
| Senior Software Engineer, JAX Senior Software Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing JAX core components, driving performance on NVIDIA products, and building tools to increase efficiency for AI-based systems. | Serve | 8 |
| Senior Architect - Server Performance NVIDIA is seeking architects to drive architectural performance for its next-generation AI server systems. This position demands a unique capability to bridge deep architectural knowledge, workload analysis, and hands-on silicon investigations. Candidates should be adept at working directly with silicon, high-level models, and simulators. Responsibilities include conducting performance investigations on both NVIDIA and competitive platforms, and developing targeted microbenchmarks to examine specific architectural aspects. The role does not heavily involve modeling tasks (functional or performance), though occasional focused assignments may arise. | Serve | 8 |
| Principal Deep Learning Communication Architect NVIDIA is seeking a Principal Deep Learning Communication Architect to lead the technical roadmap for communication libraries across next-generation platforms, ensuring seamless scaling of models to massive clusters. The role involves designing and optimizing communication primitives for heterogeneous interconnects, co-designing with application developers and silicon architects, and developing analytical models for system behavior. Expertise in parallel computing, HPC/distributed deep learning, inference engines, and GPU architecture is required. | ServeAgent | 8 |
| Developer Technology Engineer - AI NVIDIA Developer Technology Engineer focused on optimizing AI workloads, particularly large language models (LLMs), on NVIDIA's GPU platform. The role involves deep dives into application performance, GPU kernel optimization, distributed training and inference, and collaboration with various internal teams and external developers. It requires strong software engineering skills, parallel programming expertise, and a focus on performance analysis and tuning. | ServePost-train | 8 |
| Senior Solutions Architect, CSP System Senior Solutions Architect focused on building and optimizing Kubernetes infrastructure for Agentic AI and Agentic RL workloads, working with Cloud Service Providers in China. | AgentServe | 8 |
| Senior Integration Engineer - Autonomous Vehicles NVIDIA is seeking a Senior Integration Engineer to work on their end-to-end autonomous driving application, focusing on integrating modular software components and optimizing performance on heterogeneous hardware architectures. The role involves defining software architecture for L2/L3/L4 autonomous driving solutions, performing in-vehicle and simulation testing, and developing efficient C++ code using CUDA. | Agent | 8 |
| Senior Integration Engineer - Autonomous Vehicles Senior Integration Engineer for NVIDIA's end-to-end autonomous driving application, focusing on integrating software components, optimizing performance, and developing efficient C++ code on heterogeneous hardware architectures (including GPUs) for L2/L3/L4 autonomous driving solutions. | AgentServe | 8 |
| AI and FSI Developer Technology Engineer - New College Grad 2026 NVIDIA is seeking an AI and FSI Developer Technology Engineer to optimize AI and HPC workloads on NVIDIA GPUs and CPUs, focusing on performance tuning and eliminating bottlenecks for financial markets. The role involves research, development, analysis, and collaboration with experts to improve performance across the stack, from algorithms to kernels. The engineer will also publish and present their work and influence future hardware/software designs. | Serve | 8 |
| Deep Learning Algorithms Engineer - ACOT NVIDIA is looking for an AI Acceleration & Optimization Engineer to optimize the performance, scalability, and efficiency of AI models (LLMs, VLMs, diffusion, multimodal) on NVIDIA GPU platforms. The role involves profiling, identifying bottlenecks, and applying optimization techniques like quantization and kernel fusion, using tools such as CUDA, TensorRT, and Nsight. Collaboration with various teams (algorithms, systems, hardware, research, CUDA, compiler, frameworks) is key to bringing models from research to production. | ServePost-train | 8 |
| Principal Software Engineer - Enterprise AI Platform Principal Software Engineer to lead security foundations for autonomous, self-evolving agents in an enterprise setting. This role involves defining security requirements, designing scalable architectures with guardrails, implementing isolation and access controls, building secure data access pathways, establishing observability and auditing, and operating a continuous evaluation framework for agent behavior. The goal is to enable developer velocity while ensuring robust safety and security for agents that generate and execute code and access data. | Agent | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX NVIDIA is seeking a Senior Machine Learning Applications and Compiler Engineer to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to map neural network workloads onto future NVIDIA platforms. | Serve | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX Develops algorithms and optimizations for NVIDIA's LPX inference and compiler stack, focusing on mapping neural network workloads onto future NVIDIA platforms and optimizing end-to-end inference performance. Requires strong software engineering, compiler/runtime development, and deep learning framework experience. | Serve | 8 |
| Senior Software Engineer – TensorRT Edge-LLM Senior Software Engineer to develop and optimize a state-of-the-art inference framework for Large Language, Vision-Language, and Multimodal models on edge and embedded platforms, focusing on real-time performance and constrained environments. | Serve | 8 |
| Senior Performance Engineer - Deep Learning Senior Performance Engineer at NVIDIA focused on optimizing Deep Learning models and frameworks (PyTorch, JAX) for NVIDIA GPUs. The role involves building and supporting Transformer Engine, collaborating on systems research for performance improvements, implementing and benchmarking new DL models, contributing to MLPerf, and engaging with the open-source community and enterprise customers. It also involves influencing future hardware and software design. | ServePost-train | 8 |
| Senior Software Engineer, Quantized Inference Senior Software Engineer focused on optimizing quantized inference for LLMs by implementing recipes, developing kernels, and collaborating on inference engines like vLLM and TRT-LLM. The role involves model export pipelines, benchmarking, and data analysis tooling. | Serve | 8 |
| Senior Compiler Engineer, AI Inference Platforms NVIDIA is seeking a Senior Compiler Engineer to join its Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate AI inference performance on NVIDIA GPUs. The compiler is critical for data centers, personal devices, automotive, and robotics, focusing on inference performance, build time, memory footprints, and ease of use. | Serve | 8 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing core parallel algorithms and data structures for GPUs, specifically working with LLM training frameworks and performance optimization. Collaborates with application developers and internal NVIDIA teams to improve performance and developer efficiency. | Data | 8 |
| Principal AI Developer Technology Engineer This role focuses on researching and developing techniques to accelerate AI workloads (deep learning, machine learning) on advanced computer architectures, specifically GPUs. The engineer will perform in-depth analysis and optimization of complex AI and HPC algorithms, publish findings, and influence future hardware/software design. Requires deep C/C++ programming, parallel programming (CUDA, etc.), low-level performance optimization, and CPU/GPU architecture expertise. | Serve | 8 |
| Principal AI Developer Technology Engineer Seeking a Principal Developer Technology Engineer to research and develop techniques for GPU acceleration of AI workloads, focusing on performance optimization of deep learning and HPC algorithms on modern CPU and GPU architectures. This role involves collaborating with internal teams and the developer community, influencing hardware/software design, and publishing findings. | Serve | 8 |
| AI Chip Design Engineer - New College Grad 2026 NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks. The role involves creating AI agents to enhance productivity, building production infrastructure for these agents, and optimizing algorithms for enterprise data. Requires strong proficiency in LLM libraries, GPU/CPU architectures, and HW verification methodologies. | Agent | 8 |
| Senior Tools Development Engineer NVIDIA is seeking a Senior Tools Development Engineer to build agentic infrastructure for test automation and quality engineering on the Omniverse platform. The role involves designing and deploying multi-agent systems, orchestration frameworks, and evaluation systems to improve software quality and reliability. | Agent | 8 |
| Senior AI Performance and Efficiency Engineer Senior AI/ML Performance and Efficiency Engineer focused on optimizing GPU cluster performance for AI/ML researchers by addressing infrastructure and application bottlenecks. This role involves building tools, analyzing efficiency, and collaborating across teams to improve hardware, software, and infrastructure usage for various ML workloads like Robotics, Autonomous vehicles, LLMs, and Videos. | Serve | 8 |
| Senior AI Developer Technology Engineer Senior Developer Technology Engineer focused on researching and developing techniques to GPU accelerate AI workloads, optimizing performance on modern CPU and GPU architectures, and collaborating with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |
| Senior AI Formal Verification Engineer Senior AI Formal Verification Engineer to enhance in-house formal tools with AI, leveraging LLMs and ML to automate intent-to-proof workflows and debug complex chips. Role involves architecting methodologies, developing AI agents, and creating AI-based debug assistants. | AgentServe | 8 |
| Engineering Manager, AI Developer Technology Engineering Manager for NVIDIA's AI Developer Technology team, focused on leading a team to optimize and develop algorithms for Deep Learning and Machine Learning applications, influencing next-generation hardware/software, and collaborating with customers and internal teams. The role involves optimizing training and inference performance on NVIDIA hardware. | ServePost-train | 8 |
| Senior Developer Technology Engineer - AI Senior Developer Technology Engineer focused on researching and optimizing AI/ML workloads for GPU acceleration, involving deep analysis, performance tuning, and collaboration with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |
| Senior Design Automation Engineer, Applied AI NVIDIA is seeking an Applied AI Engineer to lead end-to-end solution development for timing and constraint analysis workflows in VLSI/ASIC design. The role involves data generation, model training, orchestration, and building autonomous agents that interact with timing tools. The engineer will develop AI-driven solutions, integrate data sources, implement scalable orchestration, and build interpretable AI pipelines using GNNs, LLMs, and reasoning engines. Experience with Python, PyTorch/TensorFlow, graph/agentic AI frameworks, and EDA tools is required. | AgentData | 8 |
| Senior AI and MLOps Engineer - Security and Networking Research Senior AI/MLOps Engineer focused on building and maintaining infrastructure, tools, and processes for the AI lifecycle in a production environment, specifically for security and networking AI models and agents. The role involves optimizing models, deploying agentic systems and LLMs, designing training/inference pipelines, and collaborating with various engineering teams. | AgentServe | 8 |
| Senior Product Architect, Storage NVIDIA is seeking a Senior Product Architect to design and validate AI storage infrastructure, focusing on optimizing systems for large-scale foundation model training, disaggregated inference, and agentic AI pipelines. The role involves architecting end-to-end reference architectures, defining system-level architectures, and collaborating with partners and customers to deliver proof-of-concepts. | AgentServe | 8 |
| Manager, AI and Software Manager for an AI team at NVIDIA, focusing on developing and leading the implementation of cutting-edge AI applications including RAG, LLMs, AI Agents, recommendation engines, and classical AI models. The role involves managing a team of 6-8 engineers, providing technical leadership, and collaborating with cross-functional teams to identify and implement AI opportunities. | AgentData | 8 |
| Senior GPU System Architect NVIDIA is seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves architecting system topologies, defining interconnects (NVLink, Ethernet), collaborating on RDMA, using system models for analysis, and co-designing hardware-software stacks for efficient AI workload deployment. | Serve | 8 |
| Senior System Software Engineer, Speech AI NVIDIA is seeking an experienced Software Engineer to work on their GPU-accelerated Speech AI platform, focusing on building and optimizing core speech recognition (ASR), text-to-speech (TTS), and S2S services for real-time conversational AI applications. The role involves developing C++ & Python backend implementations, optimizing inference performance, adding new features, contributing to client libraries, and performance analysis of complex systems. | ServePost-train | 8 |
| Senior Software Engineer – ADAS Senior Software Engineer to develop production ADAS and autonomous driving functions in C++ and Python, integrating deep learning models into real-time inference pipelines on NVIDIA GPUs for safety-critical automotive applications. | ServePost-train | 8 |
| Senior Software Engineer, Robotics - Isaac Lab NVIDIA is seeking a Senior Software Engineer for their Isaac Lab team to develop features for a robot learning platform, focusing on reinforcement learning, multi-agent learning, and sim-to-real deployment. The role involves automating workflows, scaling in the cloud, and collaborating with research teams on next-generation robots. | AgentData | 8 |
| Software Engineering Manager, Robotics NVIDIA is seeking a Robotics Software Engineering Manager to lead a team focused on sim-first development, real-world deployment, and continuous learning for physical AI robots, such as Humanoid Robots. The role involves hands-on development, implementation, and deployment of real-time software stacks, fostering innovation, and collaborating with cross-functional teams. | ShipAgent | 8 |
| Developer Technology Engineer - AI NVIDIA is seeking an AI Developer Technology Engineer to collaborate with developers, optimize AI workloads on GPUs, research innovative AI techniques, and ensure peak performance on GPU architectures. The role involves developing and optimizing parallel algorithms and data structures, influencing next-gen architectures, and requires proficiency in C++, AI algorithms, and specific domains like multi-modal models or RL for LLMs. | ServePost-train | 8 |
| Senior HPC Performance Engineer - AI for Science at Scale Senior HPC Performance Engineer focused on optimizing large-scale, CUDA-backed ML training frameworks for AI in Science applications, particularly in digital biology and chemistry. The role involves kernel design, GPU porting, distributed learning, and algorithmic improvements within HPC software stacks. | ServePost-train | 8 |
| Senior High-Performance System Architect NVIDIA is seeking a Senior High-Performance System Architect to define and research NVL system architecture for large-scale, high-performance computing clusters used to train advanced AI models. The role involves working across algorithms, software, firmware, and hardware, collaborating with cross-functional teams, and analyzing simulation results. | ServePretrain | 8 |
| Manager, Deep Learning Algorithms Manager for Deep Learning Algorithms at NVIDIA, focusing on productizing DL models, optimizing inference, and leading engineering teams. The role involves working with LLMs/VLMs, inference optimization, and collaborating across NVIDIA to develop state-of-the-art algorithms for GPU-accelerated platforms. | Serve | 8 |
| Senior Deep Learning Engineer - AI for Wireless Systems NVIDIA is seeking a Senior Deep Learning Engineer to develop AI-native wireless networks, integrating deep learning into signal processing and radio access technologies. The role involves designing, prototyping, implementing, training, and optimizing deep learning models for real-time inference and deployment on GPU platforms, collaborating with researchers and system engineers. | ServePost-train | 8 |
| Engineering Manager - AI for RAN and 6G Wireless Systems NVIDIA is seeking an Engineering Manager to lead a team developing AI/ML models for 6G wireless networks. The role involves guiding model development, training, evaluation, and deployment, with a focus on integrating deep learning into signal processing and radio access technologies. Experience with Python, PyTorch/TensorFlow, and leading engineering teams is required. | ServePost-train | 8 |
| System Software Engineer - Deep Learning System Software Engineer at NVIDIA focused on accelerating deep learning inference for autonomous driving systems using NVIDIA GPUs and DL accelerators. The role involves developing SDKs/frameworks for LLMs and state-of-the-art models, benchmarking, and optimizing for latency, accuracy, and power consumption. Requires experience with deep learning frameworks, DNN optimization, and C/C++. | ServePost-train | 8 |
| Senior AI Infrastructure Software Engineer Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks. | AgentServe | 8 |