Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Research Scientist, Deep Learning and Computer Vision - New College Graduate Research Scientist role at NVIDIA focusing on deep learning and computer vision, with an emphasis on novel methods, generative and multimodal AI, and explainable AI. The role involves research, design, implementation, publication, and technology transfer to product groups. Requires a Ph.D. and a strong publication record in top conferences. | Pretrain | 10 |
| Senior Scientist, Synthetic Data Generation Senior Scientist focused on synthetic data generation for training frontier LLMs, contributing to open-source libraries and advancing multimodal data generation. | DataPost-train |
| 10 |
| AI Research Scientist AI Research Scientist at NVIDIA focusing on GPU-accelerated generative AI models for language, vision, and robotics, with a strong emphasis on publishing novel research and efficient model design. | Pretrain | 10 |
| AI Research Scientist AI Research Scientist at NVIDIA focusing on developing and optimizing GPU-accelerated generative AI models for language, vision, and robotics, with a strong emphasis on publishing research and transferring technology to products. | Pretrain | 10 |
| Research Scientist NVIDIA Research Singapore is seeking AI researchers to develop and optimize GPU-accelerated efficient AI computing for language models, visual generation, and robotics, focusing on pushing generative AI boundaries and publishing research. | Pretrain | 10 |
| Senior Research Scientist, Multimodal Foundation Models and Robotics Research Scientist role focused on building multimodal foundation models and systems for humanoid robots and embodied agents, involving algorithm design, large-scale training/inference, and deployment on physical hardware and simulations. | Post-trainAgent | 10 |
| Senior Deep Learning Researcher, Diffusion Senior Deep Learning Researcher at NVIDIA focusing on diffusion-based technologies and multi-modal learning. The role involves inventing and building new techniques, combining diffusion models with LLMs, and publishing research findings. It requires a PhD, research experience, and a strong publication record in leading AI conferences and journals. | PretrainPost-train | 10 |
| Senior Research Scientist, Fundamental Generative AI Senior Research Scientist focused on fundamental generative AI research, particularly for biomolecular design and scientific applications. The role involves designing and implementing novel, large-scale generative models, publishing research, and transferring technology to product groups. | Pretrain | 10 |
| AI Research Scientist AI Research Scientist at NVIDIA focusing on developing and optimizing GPU-accelerated generative AI models for language, vision, and robotics, with a strong emphasis on publishing research and transferring technology to products. | Pretrain | 10 |
| Research Scientist, Fundamental Generative AI - New College Grad 2026 Research Scientist role focused on fundamental generative AI research, particularly for scientific applications like biomolecular design. The role involves developing novel, large-scale generative models, publishing research, and potentially transferring technology to product groups. Requires a strong theoretical and practical understanding of generative AI and deep learning. | Pretrain | 10 |
| Research Scientist, Generalist Embodied Agent Research - PhD New College Grad 2026 Research Scientist role focused on building humanoid robot foundation models and general-purpose embodied agents. Involves designing and implementing novel AI algorithms, developing large-scale training and inference methods, and deploying models in simulation and on hardware. Requires strong experience in LLMs, multimodal foundation models, reinforcement learning, agent learning, and applied robotics, with a PhD and publication record. | AgentData | 10 |
| Senior Research Scientist, Multimodal Foundation Models and Robotics Research Scientist role focused on developing multimodal foundation models and systems for general-purpose humanoid robots and embodied agents, involving algorithm design, large-scale training/inference, and deployment on physical hardware and simulations. | Post-trainAgent | 10 |
| Deep Learning Senior Engineer, End-To-End Autonomous Driving NVIDIA is seeking a Deep Learning Senior Engineer to design, implement, and deploy end-to-end autonomous driving systems. The role focuses on AI 2.0, leveraging LLMs, VLMs, and VLAs for reasoning and planning in autonomous vehicles and robotics. Responsibilities include training large-scale models, building and fine-tuning LLM/VLM/VLA systems, exploring data generation strategies, and deploying models in production environments, integrating them with vehicle firmware. | Post-trainAgent | 9 |
| Senior Research Scientist, Nemotron Post-training Research Scientist/Engineer at NVIDIA focused on building Nemotron models, specifically working on post-training pipelines, synthetic data, agentic RL, data/training infrastructure, and large-scale model post-training. The role involves advancing open-source foundation models, developing training data, benchmarks, LLMs, and software, and solving end-to-end foundation model post-training challenges. Requires a Master's/PhD and 5+ years of experience in model post-training, RL, and agentic systems, with experience in data curation, model training, and inference/deployment environments. | Post-trainAgent | 9 |
| High-Performance LLM Training Engineer - New College Grad 2026 NVIDIA is seeking an experienced engineer to optimize LLM training workloads on high-performance computing systems, focusing on software stack optimization for thousands of GPUs and influencing future hardware roadmaps. The role involves performance analysis, profiling, and implementation across the deep learning platform, from drivers to frameworks, and contributing to MLPerf benchmarks. | Data | 9 |
| Research Scientist, Efficient Deep Learning - New College Grad 2026 Research Scientist role focused on efficient deep learning methods, including post-training optimization, efficient architecture design, and resource-efficient training/finetuning. Requires a Ph.D. or equivalent research experience, strong Python/PyTorch skills, and experience with large-scale model training and large vision-language models. The role involves research, implementation, publication, and technology transfer. | Post-trainServe | 9 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to analyze, model, and optimize deep learning system performance, particularly for LLM workloads, on state-of-the-art hardware architectures. This role influences future hardware and software design by collaborating with various internal teams. | Serve | 9 |
| Deep Learning Performance Architect NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architectures for edge devices, workstations, and data center GPUs. The role involves benchmarking, performance modeling, bottleneck identification, and exploring new hardware/software capabilities, with a focus on LLMs and generative AI. Experience with AI agents for engineering workflows is also mentioned. | ServePost-train | 9 |
| Senior Scientist, Synthetic Data and Privacy Senior Scientist role focused on building LLM-based methods for synthetic data generation and privacy-preserving AI, contributing to open-source libraries within the NVIDIA NeMo ecosystem. The role involves applied research, software engineering, and optimizing LLMs for inference, with a strong emphasis on publishing original research. | DataServe | 9 |
| Senior Quantum Applied Research Scientist, Calibration and Decoding Research Scientist at NVIDIA focusing on developing AI models for quantum system calibration and decoding. This role involves building physics-informed synthetic data generation pipelines, developing surrogate models of quantum hardware, and architecting real-time AI systems. The work also includes applying reinforcement learning and online learning methods for optimization, with a strong emphasis on GPU acceleration and collaboration across Product, Engineering, and Applied Research teams to advance fault-tolerant quantum computing. | Post-trainData | 9 |
| Senior Research Manager, World Model Evaluation Lead a research team focused on world-model evaluation and benchmarking for NVIDIA's Physical AI portfolio, defining the scientific roadmap for closed-system and open-system evaluations, developing benchmarks for various physical AI capabilities, and driving evaluation-to-model-improvement loops. The role requires publishing high-quality papers and establishing rigorous standards. | Eval GatePost-train | 9 |
| Senior Systems Software Engineer, AI Stack and Performance - DGX Station Senior Systems Software Engineer focused on optimizing AI stack performance and readiness on NVIDIA's DGX Station, a workstation-class AI computer. The role involves profiling, identifying bottlenecks, and driving optimizations across the full stack from GPU kernels to applications, ensuring AI workloads like LLM inference and agents run efficiently in multi-GPU, multi-user configurations. Collaboration with framework, compiler, and GPU architecture teams is critical. | ServeShip | 9 |
| Applied AI Researcher - World Reconstruction and Generation NVIDIA is seeking an Applied AI Researcher to work on NuRec-related research in world reconstruction and generation, developing and adapting Deep Learning-based methods for tasks like novel view synthesis, generative modeling, and neural rendering. The role involves prototyping with Python/PyTorch, building evaluation and agentic AI-assisted research workflows, and turning research into usable technology. Requires a PhD or MS with significant experience in ML/DL, Computer Graphics, Computer Vision, or 3D reconstruction, with strong Python/PyTorch skills. | Post-trainAgent | 9 |
| Senior Machine Learning Engineer, Perception - Autonomous Driving NVIDIA is seeking a Senior Machine Learning Engineer for their Autonomous Driving Perception team. The role involves designing and developing end-to-end deep learning solutions for perception modules, focusing on road layout detection, lane structures, and other critical driving components. The engineer will also drive data-driven development, leverage simulation and augmentation, and productize solutions meeting safety and latency requirements. Experience with deep learning frameworks, Python/C++, and perception for autonomous driving or robotics is essential. | ShipData | 9 |
| Senior Software Engineer, DGX Cloud AI Infrastructure Senior Software Engineer to lead the bring-up, triage, benchmarking, analysis, and optimization of distributed training and inference workloads across NVIDIA GPU platforms at scale. This role involves setting technical direction for communication libraries, model frameworks, and inference/training stacks, leading performance and reliability investigations, defining benchmarking and qualification processes, and building resilience capabilities for large clusters. | ServePost-train | 9 |
| Senior Deep Learning Performance Architect NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and HPC applications. The role involves developing innovative architectures, analyzing performance/cost/power trade-offs using models and simulators, understanding hardware/software interplay, and evaluating PPA for architectural decisions. Collaboration with software, product, and research teams is key. Requires MS/PhD, 6+ years experience, strong background in GPU/Deep Learning ASIC architecture for distributed training/inference, performance modeling, and ML/DL fundamentals, particularly transformer architectures. Proficiency in Python, C, C++ is essential. | Serve | 9 |
| Research Scientist, Electronic Design Automation - New College Grad 2026 Research Scientist role focused on applying AI/ML techniques, including supervised, unsupervised, reinforcement learning, and agentic AI, to Electronic Design Automation (EDA) and VLSI design. The role involves defining and conducting original research, innovating in EDA software and algorithms, and applying deep learning to improve chip design tools, with a strong emphasis on publication and collaboration. | Post-trainAgent | 9 |
| AI Inference Performance Engineer - New College Grad 2026 NVIDIA is seeking an AI Inference Performance Engineer to optimize and benchmark GenAI inference on their accelerators, working with frameworks like TensorRT-LLM, SGLang, and vLLM. The role involves driving industry benchmark results, defining cutting-edge workloads, architecting distributed inference, establishing performance methodology, and influencing the ecosystem through open-source contributions and cross-functional partnerships. Requires strong programming skills, DL framework expertise, and a deep understanding of LLM inference mechanics. | Serve | 9 |
| Senior Software Engineer, Generative AI Research NVIDIA is seeking a Senior Software Engineer for Generative AI Research to build and operate scalable infrastructure for training their world foundation model for physical AI, Cosmos. This role involves designing and developing high-throughput systems for data processing, retrieval, and workflow orchestration, improving system reliability and performance, and contributing to long-term infrastructure strategy for training, data management, and large-scale compute efficiency. The role requires a strong engineering background in distributed systems, ML infrastructure, or large-scale compute/data platforms, proficiency in Python and C++/Go/Rust, and experience with orchestration systems and data pipelines. Experience with large-scale model training infrastructure, distributed compute, synthetic data, or multimodal datasets is a plus. | DataPretrain | 9 |
| Senior Software Manager, Agentic AI Senior Software Manager to lead a team building agentic AI solutions for chip design workflows, involving coding agents, custom skills, and integration with enterprise systems. The role requires technical leadership in designing, developing, and deploying AI applications using LLMs and agentic systems, including model customization (fine-tuning, RL, instruction tuning) and overseeing retrieval/generation algorithms for enterprise data. Collaboration with cross-functional teams and ensuring high technical standards for evaluation, guardrails, and monitoring are key. | AgentPost-train | 9 |
| Deep Learning Performance Software Engineer Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks. | Serve | 9 |
| Senior Research Scientist NVIDIA is seeking a Senior Research Scientist to join their applied research team focused on building next-generation Conversational AI systems. The role involves developing new Deep Learning models for speech recognition, speech synthesis, neural machine translation, and natural language processing, designing large-scale training algorithms, and open-sourcing models via the NeMo framework. The position requires a PhD, at least 5 years of research experience in speech recognition or NLP, a strong understanding of Deep Learning in these areas, proficiency in Python and PyTorch, and a strong publication record. Collaboration with academic and product teams, as well as mentoring interns, are also key aspects of the role. | Post-trainPretrain | 9 |
| Senior Data Scientist - Security and Networking Research Senior Data Scientist role focused on AI cybersecurity, developing agentic AI systems, optimizing models, and leveraging data pipelines for NVIDIA's networking and data center security products. | AgentPost-train | 9 |
| AI Computing Architect NVIDIA is seeking an AI Computing Architect to develop innovative architectures for deep learning performance and efficiency, analyze trade-offs using models and simulators, and prototype algorithms. The role requires strong programming skills, computer architecture background, and a foundation in machine learning. | ServePost-train | 9 |
| Senior Software Engineer - Agentic AI Senior Software Engineer role focused on leading Agentic AI solutions, including sophisticated AI agents and fine-tuning, integrating them with enterprise production systems. The role involves designing, developing, and deploying AI applications using LLMs, Agentic frameworks, and optimizing retrieval/generation algorithms for enterprise data (text, code, images) to build advanced AI applications for engineering assistants and multi-turn, multi-modal dialogue systems, ultimately solving complex problems in chip design. | AgentPost-train | 9 |
| AI Workload and Networking Research Architect Research Architect role focused on optimizing AI workloads and networking infrastructure for NVIDIA's AI computing platforms, involving modeling, analysis, and influencing future product roadmaps. | ServePost-train | 9 |
| Senior AI Safety Red Teamer NVIDIA is seeking a Senior AI Safety Red Teamer to improve the safety and security posture of their AI models, systems, and infrastructure. The role involves hands-on safety and security research, developing tools to expose weaknesses, defining safety standards, and partnering with cross-functional teams. Requires 5+ years of experience in AI safety/security and offensive cybersecurity, with knowledge of AI vulnerabilities, LLMs, MLLMs, Generative AI, Agents, and RAG workflows, and strong Python programming skills. | Eval GateAgent | 9 |
| Senior Quantum AI Research Scientist, Applied Research NVIDIA is seeking a Senior Quantum AI Research Scientist to architect and build AI solutions for fault-tolerant quantum computing, focusing on quantum error correction, decoding, calibration, and beyond. The role involves researching and developing open AI models, datasets, and benchmarks, fine-tuning models for specific quantum systems, and collaborating with cross-functional teams to integrate AI into quantum supercomputers. | Post-trainData | 9 |
| Senior Applied Deep Learning Scientist - Large Vision Language Models NVIDIA is seeking a Senior Applied Deep Learning Scientist to work on multimodal language models, specifically the Nemotron Omni family. The role involves pushing the boundaries of these models for downstream applications, preparing large-scale multimodal datasets, and collaborating globally to turn research into impactful products. The position spans the full pipeline from pre-training to post-training, with a focus on open models, weights, and data for real-world applications. | Post-trainData | 9 |
| Senior LLM Agents Architect Senior LLM Agents Architect role focused on designing and building agentic AI systems to optimize GPU compute kernels, analyze architectural simulations, and drive improvements in hardware design and developer efficiency. The role involves hands-on CUDA programming, collaboration with hardware architects, and building automated agentic workflows for performance forensics and architectural studies. | Agent | 9 |
| Senior Performance Architect, Nemotron NVIDIA is seeking a Senior Performance Architect for Nemotron to focus on deep model-system-hardware co-design. The role involves developing high-fidelity performance models to evaluate architectural choices, predict deployment efficiency, and ensure Pareto-optimal trade-offs for future Nemotron models. This position will guide future software and hardware roadmaps by modeling end-to-end performance impact of GenAI workflows and collaborating with research, framework, compiler, and hardware teams. | Serve | 9 |
| Senior Machine Learning Engineer - Physical AI and Synthetic Data Generation NVIDIA is seeking a Senior Machine Learning Engineer to join their Physical AI team. The role focuses on architecting and developing generative pipelines for high-fidelity synthetic data using multimodal and diffusion models. Responsibilities include building and fine-tuning large-scale models, applying user controls for data synthesis, establishing quality assurance pipelines, and leading the generation of massive training datasets. The role requires deep technical knowledge in image/video synthesis, strong programming skills, and experience in assessing synthetic data impact on model performance. | DataPost-train | 9 |
| Senior DL Algorithms Engineer - Inference Performance Senior engineer to optimize LLM/Omni model inference performance on NVIDIA's accelerated inference software stack, working across hardware and software layers. Involves enabling and optimizing open models, contributing code to frameworks like TRT-LLM and vLLM, profiling bottlenecks, and benchmarking. | Serve | 9 |
| Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize neural network workloads on future NVIDIA platforms. | Serve | 9 |
| Senior Research Scientist, AI-Mediated Reality and Interaction Senior Research Scientist focused on AI-Mediated Reality and Interaction, creating interactive physical AIs for dynamic 4D worlds. Research areas include AI, neural rendering, graphics, generative modeling, LLMs, and human behavior. The role involves proposing and prototyping novel research, publishing in top venues, collaborating with teams, creating demonstrations, and participating in technology transfer to products like Isaac, Omniverse, and Metropolis. | Pretrain | 9 |
| Senior Software Engineer, AI Inference Systems Senior Software Engineer focused on building and optimizing AI inference systems for large-scale models, involving GPU kernel optimization, inference framework development (vLLM), benchmarking (MLPerf), and orchestration of distributed deployments. | Serve | 9 |
| Senior Research Engineer - AI Coding Tools Senior Research Engineer at NVIDIA focused on building and improving AI coding agents, fine-tuning code LLMs, designing evaluations, and developing interfaces for AI agents to interact with NVIDIA's developer tools. The role involves shipping novel agents and features, contributing to benchmarks, and generating synthetic data for AI-for-code applications. | AgentPost-train | 9 |
| Principal High-Performance LLM Training Engineer NVIDIA is seeking a Principal Engineer to lead performance analysis and optimization of large-scale AI training and post-training workloads on NVIDIA's hardware and software stack. The role involves deep technical analysis across compute, memory, communication, and frameworks to improve efficiency and influence future roadmaps. | PretrainPost-train | 9 |
| Senior Software Engineer, AI Inference Systems Senior Software Engineer focused on building and optimizing AI inference systems, including vLLM, GPU kernels, and orchestration for large-scale model deployments. The role involves performance engineering, benchmarking (MLPerf), and potentially research integration. | Serve | 9 |
| Senior Tools Development Engineer This role focuses on building agentic infrastructure for test automation and quality engineering within the NVIDIA Omniverse platform. The engineer will design and deploy multi-agent systems, orchestration frameworks, and autonomous pipelines, with a strong emphasis on evaluating agent output quality and establishing observability for these workflows. The goal is to enable engineers to ship high-quality software with greater speed and confidence. | AgentEval Gate | 9 |