Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
| Title | Stage | AI score |
|---|---|---|
| Senior Solutions Architect - Generative AI Senior Solutions Architect specializing in Generative AI, focusing on training LLMs, fine-tuning, and RAG implementation. The role involves architecting end-to-end solutions, collaborating with customers and sales teams, and providing technical leadership for LLM training and Agentic AI workflows on NVIDIA platforms. | Post-trainAgent | 9 |
| Senior Tools Development Engineer This role focuses on building agentic infrastructure for test automation and quality engineering within the NVIDIA Omniverse platform. The engineer will design and deploy multi-agent systems, orchestration frameworks, and autonomous pipelines, with a strong emphasis on evaluating agent output quality and establishing observability for these workflows. The goal is to enable engineers to ship high-quality software with greater speed and confidence. |
| AgentEval Gate |
| 9 |
| Senior Software Engineer, RAG and Agentic AI Senior Software Engineer to join the AI Blueprints team to build multimodal, scalable, production-grade reference RAG solutions using Agentic AI and NVIDIA Nemotron models. Develop orchestration layers to dynamically interact with proprietary unstructured and structured data sources. The role involves planning, building, and refining RAG workflows, designing and implementing AI agents for reasoning, planning, and multi-step execution, running POCs, hardening patterns, building and deploying end-to-end RAG pipelines using microservices architecture, and driving continuous improvement through evaluation and performance analysis. | AgentServe | 9 |
| Inference Optimization Architect, Speech AI NVIDIA is seeking an Inference Optimization Architect to accelerate and scale Speech AI models, focusing on reducing inference latency, improving throughput, and optimizing resource utilization across AI infrastructure. The role involves implementing model compression techniques, developing custom kernels, designing serving infrastructure, and optimizing inference across diverse GPU platforms. | Serve | 9 |
| Senior Applied Scientist - Sovereign AI Senior Applied Scientist/AI Engineer at NVIDIA focusing on Sovereign AI efforts. The role involves end-to-end model training (pre-training, CPT, SFT, alignment), rigorous evaluation and benchmarking, and inference optimization using tools like TensorRT-LLM and NIM. Requires strong Python, PyTorch, and experience with large-scale ML frameworks. | Post-trainServe | 9 |
| Manager, Test and Tools Development Engineering Manager for a test and tools development engineering team focused on building autonomous systems and AI-powered quality infrastructure for Omniverse. The role involves leading a team to design agentic test pipelines, multi-agent orchestration for test generation, failure triage, and establishing evaluation frameworks for AI-generated outputs. | Agent | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| Senior Architect - Server Performance NVIDIA is seeking architects to drive architectural performance for its next-generation AI server systems. This position demands a unique capability to bridge deep architectural knowledge, workload analysis, and hands-on silicon investigations. Candidates should be adept at working directly with silicon, high-level models, and simulators. Responsibilities include conducting performance investigations on both NVIDIA and competitive platforms, and developing targeted microbenchmarks to examine specific architectural aspects. The role does not heavily involve modeling tasks (functional or performance), though occasional focused assignments may arise. | Serve | 8 |
| Senior Software Engineer, RAG and Agentic AI Senior Software Engineer role focused on building and deploying production-grade RAG solutions and AI agents. The role involves designing and implementing scalable RAG architectures, developing AI agents with reasoning and multi-step execution capabilities, and orchestrating complex microservices deployments. Emphasis on optimizing RAG pipelines for accuracy, relevance, and performance, and driving continuous improvement through rigorous evaluation and collaboration. | AgentServe | 8 |
| Senior Tools Development Engineer NVIDIA is seeking a Senior Tools Development Engineer to build agentic infrastructure for test automation and quality engineering on the Omniverse platform. The role involves designing and deploying multi-agent systems, orchestration frameworks, and evaluation systems to improve software quality and reliability. | Agent | 8 |
| Senior Manager, System Software Engineering - Metropolis Accelerated and Inferencing Software Senior Manager for System Software Engineering at NVIDIA, focusing on Metropolis Accelerated and Inferencing Software. The role involves leading engineering teams, driving strategic implementations of inference solutions (TensorRT, VLLM) for edge and enterprise devices, performance benchmarking, and technical leadership in deep learning. Requires extensive experience in machine learning/deep learning, embedded software, GPU/CPU optimization, and multimodal AI systems. | ServeAgent | 8 |
| Senior Solutions Architect - Physical AI NVIDIA is seeking a Senior Solutions Architect for Physical AI to support customers building robotics and Physical AI solutions on NVIDIA’s platforms. This role involves guiding architecture, prototyping, and troubleshooting across robotics deployments from simulation to training to deployment, focusing on applied AI (computer vision, GenAI) for robotics. | AgentData | 8 |
| Senior GPU System Architect NVIDIA is seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves architecting system topologies, defining interconnects (NVLink, Ethernet), collaborating on RDMA, using system models for analysis, and co-designing hardware-software stacks for efficient AI workload deployment. | Serve | 8 |
| Senior System Software Engineer, Speech AI Senior System Software Engineer role focused on speech AI technologies (ASR, TTS, ALM, S2S) for enterprise and developer customers. Responsibilities include implementing, troubleshooting, and optimizing GPU-accelerated speech systems in production, transitioning models from research to production, optimizing inference performance, developing core speech services using C++ and Python with CUDA, and contributing to client SDKs. Requires strong programming skills, experience with inference pipelines, understanding of modern model architectures, and knowledge of real-time streaming audio and low-latency systems. Experience with speech model fine-tuning is required. | ServePost-train | 8 |
| Senior System Software Engineer, Speech AI NVIDIA is seeking an experienced Software Engineer to work on their GPU-accelerated Speech AI platform, focusing on building and optimizing core speech recognition (ASR), text-to-speech (TTS), and S2S services for real-time conversational AI applications. The role involves developing C++ & Python backend implementations, optimizing inference performance, adding new features, contributing to client libraries, and performance analysis of complex systems. | ServePost-train | 8 |
| System Software Engineer - Deep Learning System Software Engineer at NVIDIA focused on accelerating deep learning inference for autonomous driving systems using NVIDIA GPUs and DL accelerators. The role involves developing SDKs/frameworks for LLMs and state-of-the-art models, benchmarking, and optimizing for latency, accuracy, and power consumption. Requires experience with deep learning frameworks, DNN optimization, and C/C++. | ServePost-train | 8 |
| AI Developer Technology Engineer NVIDIA is seeking an AI Developer Technology Engineer to work on optimizing AI techniques on GPU architectures and collaborate with customers and internal teams to influence future designs. The role involves studying and developing cutting-edge deep learning, graphs, and machine learning techniques, with a focus on performance analysis and optimization for GPUs. The engineer will also work with customers to understand their problems and provide AI solutions using GPUs, and collaborate with NVIDIA's internal teams to shape next-generation architectures and software platforms. | Serve | 8 |
| GPU System Architect NVIDIA is seeking a GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves defining system architectures that tightly couple GPU compute, memory, and interconnects for optimal AI performance, scalability, and resilience. Responsibilities include architecting system topologies, defining high-speed interconnects, collaborating on RDMA hardware, using system models for analysis, and enabling hardware-software co-design. | Serve | 7 |
| Software Solutions Engineer NVIDIA is seeking a Software Solutions Engineer to support NVIDIA AI Enterprise customers. This role involves end-to-end customer issue resolution and building software features, automation, and deployment tooling to enhance product readiness and scalability in cloud and datacenter environments. The engineer will work with compute, cloud-native technologies, and GPU-accelerated AI frameworks, requiring strong debugging, communication, and ownership skills. | ServeAgent | 7 |
| SWQA Test Developer SWQA Test Developer for embedded systems in AI, Retail, Healthcare, Media, Finance. Role involves using AI tools for QA automation, CI/CD, bug fixing, and building/automating workflows/agents. Also involves training/fine-tuning models for agent optimization and developing tests for embedded/GPU systems. Requires Python/C++ skills, Linux, and knowledge of Docker/cloud platforms. | Agent | 7 |
| Senior Software Engineer, Mapping - Autonomous Vehicles NVIDIA is seeking a Senior Software Engineer for their Autonomous Vehicles Mapping team. The role involves designing and developing algorithms for map-based driving products, architecture design, and implementing efficient in-vehicle code. Key responsibilities include researching and developing transformer models for graphs, implementing evaluation frameworks for LLMs, fine-tuning pretrained models, building automated map content analysis, and creating scalable map-building workflows. The role requires a background in computer vision, 3D geometry, machine learning, and heavy AI tool usage for development. | AgentPost-train | 7 |
| Senior Site Reliability Engineer - Datacenter Automation NVIDIA is seeking an experienced Senior Site Reliability Engineer to scale its AI Infrastructure, focusing on production systems for large GPU clusters used in AI workloads. The role involves implementing monitoring, health management, and automation for GPU asset provisioning, configuration, and lifecycle management across cloud providers, ensuring reliability, availability, and scalability. The engineer will collaborate with teams to maintain reliable and performant AI clusters, evaluate system failures, and improve services. | Serve | 7 |
| Senior System Software Engineer - Video Senior System Software Engineer role focused on building and optimizing system software for NVIDIA's video subsystem, involving AI/ML and computer vision algorithms for video compression and multimedia processing on Tegra Application Processors and GPUs. Requires strong C/C++ and Python skills, experience with video compression standards, and a track record in pre/post-processing algorithms. | Serve | 7 |
| Senior System Software Engineer - Computer Vision Algorithms and SDK Senior System Software Engineer focused on developing and optimizing computer vision, signal processing, and machine learning algorithms for specialized DSP hardware (PVA engine) and enhancing the associated SDK. The role involves working with internal and external customers to enable efficient algorithm development and optimization on the hardware. | Serve | 7 |
| DGX Cloud Performance Engineer NVIDIA is seeking Parallel and Distributed Systems engineers to drive performance analysis, optimization, and modeling for their DGX Cloud AI platform. The role involves developing benchmarks, analyzing performance bottlenecks, and collaborating with AI researchers to improve system performance and usability. Expertise in large-scale parallel systems, AI workloads, performance modeling, and AI frameworks is required. | Serve | 7 |