Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Applied AI Researcher - World Reconstruction and Generation NVIDIA is seeking an Applied AI Researcher to work on NuRec-related research in world reconstruction and generation, developing and adapting Deep Learning-based methods for tasks like novel view synthesis, generative modeling, and neural rendering. The role involves prototyping with Python/PyTorch, building evaluation and agentic AI-assisted research workflows, and turning research into usable technology. Requires a PhD or MS with significant experience in ML/DL, Computer Graphics, Computer Vision, or 3D reconstruction, with strong Python/PyTorch skills. | Post-trainAgent | 9 |
| Senior Applied Deep Learning Scientist - Large Vision Language Models NVIDIA is seeking a Senior Applied Deep Learning Scientist to work on multimodal language models, specifically the Nemotron Omni family. The role involves pushing the boundaries of these models for downstream applications, preparing large-scale multimodal datasets, and collaborating globally to turn research into impactful products. The position spans the full pipeline from pre-training to post-training, with a focus on open models, weights, and data for real-world applications. |
| Post-trainData |
| 9 |
| Senior Software Engineer, AI Inference Systems Senior Software Engineer focused on building and optimizing AI inference systems for large-scale models, involving GPU kernel optimization, inference framework development (vLLM), benchmarking (MLPerf), and orchestration of distributed deployments. | Serve | 9 |
| Research Scientist, 3D Computer Vision Research Scientist role focused on 3D computer vision and deep learning, involving novel technique research, publication, and collaboration. Requires a Ph.D. and a strong publication record in top venues. | Pretrain | 9 |
| Research Scientist, 3D Computer Vision Research Scientist role focused on 3D computer vision and deep learning for scene understanding, pose estimation, and localization. The role involves publishing original research, mentoring, and collaborating with productization teams. Requires a Ph.D. and a strong publication record in top computer vision venues. | Post-train | 9 |
| Senior HPC and AI Network Software Architect NVIDIA is seeking a Senior HPC and AI Network Software Architect to design and build scalable AI infrastructure for distributed training and inference. The role involves developing software and hardware approaches to optimize communication efficiency and performance across large-scale systems, collaborating with AI framework teams and hardware teams. | ServePost-train | 9 |
| Senior Deep Learning Engineer Senior Deep Learning Engineer at NVIDIA to optimize and deploy foundation models for physical AI applications (AVs, robots, video analytics) on GPU platforms, focusing on high-performance inference. | ServePost-train | 9 |
| Deep Learning Engineer - LLM and VLM Model Compression NVIDIA is seeking a Deep Learning Engineer with 8+ years of experience to build deep learning frameworks for LLM and VLM model compression. The role involves designing and implementing algorithms for pruning, NAS, and distillation, experimenting with model compression, and collaborating with researchers. Experience with PyTorch, LLM/VLM training or inference, and DL fundamentals are required. Experience with model compression techniques, building DL frameworks, and GPU programming are preferred. The role is based in Poland or Switzerland, with a salary range of 292,500 PLN - 650,000 PLN. | Post-trainServe | 9 |
| Senior GPU Networking Architect This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development. | Serve | 9 |
| Robotics Research Intern - 2026 Robotics Research Intern at NVIDIA focusing on fundamental and applied research across the full robotics stack, including perception, planning, control, reinforcement learning, imitation learning, and simulation. The goal is to transform research paradigms, transfer into products, and create new markets. | AgentData | 9 |
| Senior Manager, Interactive World Model Platforms Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams) into an industry standard, focusing on production engineering, performance, and developer/researcher success across AV, robotics, rendering, and simulation. | ShipServe | 8 |
| Solutions Architect - AI for Drug Discovery NVIDIA seeks a Solutions Architect for their EMEA team to drive AI adoption in drug discovery within the biopharma industry. The role involves acting as a technical advisor to pharmaceutical companies, biotechs, and research organizations, leveraging NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and supporting business development by guiding customers on production-grade inference, model training, RL, and post-training algorithms. The role also involves exploring foundation models, agentic LLM applications, and physical AI in biopharma, providing feedback to internal teams, and documenting/teaching NVIDIA solutions. | ServePost-train | 8 |
| Senior HPC and AI Networking Performance Research and Analysis Engineer Research Engineer focused on analyzing and optimizing the performance of large-scale distributed LLM training and inference on GPU clusters, with a strong emphasis on networking aspects. | PretrainServe | 8 |
| Senior Software Engineer, JAX Senior Software Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing JAX core components, driving performance on NVIDIA products, and building tools to increase efficiency for AI-based systems. | Serve | 8 |
| Principal AI Developer Technology Engineer Seeking a Principal Developer Technology Engineer to research and develop techniques for GPU acceleration of AI workloads, focusing on performance optimization of deep learning and HPC algorithms on modern CPU and GPU architectures. This role involves collaborating with internal teams and the developer community, influencing hardware/software design, and publishing findings. | Serve | 8 |
| Solution Architect, Financial Services Solutions Architect for Financial Services at NVIDIA, focusing on guiding customers in leveraging NVIDIA's AI technologies, particularly in areas like model distillation, domain adaptation, reinforcement learning, and post-training algorithms. The role involves technical advocacy, collaborative innovation, and knowledge sharing within the financial services sector, requiring expertise in AI frameworks, Python, distributed computing, and the AI model lifecycle. | Post-trainPretrain | 8 |
| Solutions Architect – AI Factory Solutions Architect role focused on designing, building, and operationalizing large-scale AI factories and GenAI/Agentic AI solutions for enterprise customers, leveraging NVIDIA's technology stack. This involves hands-on work with compute, networking, software, and cluster management tools. | Agent | 8 |
| Senior Deep Learning Compiler Engineer - PyTorch Senior Deep Learning Compiler Engineer to develop and optimize PyTorch models for NVIDIA GPUs using compiler technology like Thunder, TorchDynamo, and TorchInductor. Focus on performance analysis and contributing to open-source AI ecosystem. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software to accelerate AI and HPC workloads. The role involves investigating performance bottlenecks, exploring HW/SW co-design, building proofs-of-concept, and performing quantitative modeling for large GPU clusters. | Serve | 8 |
| Senior Software Architect - Deep Learning and HPC Communications This role focuses on architecting and implementing next-generation communication software and platforms for deep learning and high-performance computing (HPC) applications, specifically targeting the efficient scaling of GPU clusters. The work involves identifying performance bottlenecks, designing new communication technologies, exploring hardware/software co-design, and using simulation to evaluate performance at massive scales. | Serve | 8 |
| Senior Software Architect, GPU Networking Research NVIDIA is seeking a Senior Software Architect to focus on GPU Networking Research for accelerating AI workloads and building AI data centers. The role involves leading vision, architecture, design, and proof-of-concept development for future GPU Networking offerings, identifying new technologies, and working with the community. Requires M.Sc./Ph.D. or equivalent experience, 8+ years in systems architecture, and experience in virtualization, networking, storage, and OS drivers. Experience in performance profiling, optimization, and HW offloads is crucial. A research track record and knowledge of Deep Learning frameworks are desirable. | Serve | 7 |
| Principal Simulation Engineer, Industrial Physics and Robotics NVIDIA is seeking a Principal Simulation Engineer to lead the development of advanced physically based simulation systems for robotics and industrial digital twins. This role requires deep expertise in multibody dynamics, contact, friction, and flexible bodies, with a focus on integrating simulation with robotics workflows and applying modern AI-assisted and agentic development. The ideal candidate has a track record of building production-level simulation software and experience validating simulators against physical systems. | Agent | 7 |
| Senior HPC AI Cluster Engineer NVIDIA is seeking an experienced HPC-AI Engineer to join their Networking Clusters Solutions Infrastructure team. The role involves designing, implementing, and maintaining large-scale HPC/AI clusters, managing job schedulers, developing CI/CD pipelines, and automating infrastructure deployment and monitoring. The engineer will work with cutting-edge hardware and software, support R&D, and engage in POCs for future improvements. | Serve | 7 |
| Senior Systems Software Engineer - GPU Performance at Scale Senior Systems Software Engineer focused on GPU performance at scale for AI workloads. This role involves leading performance practices, aligning AI workloads with hardware, developing insights into AI workload performance, debugging complex issues, and collaborating with various software and firmware teams to optimize AI workload performance on NVIDIA GPUs. | Serve | 7 |
| Solution Architect, Financial Services NVIDIA is seeking a Solutions Architect for Financial Services to act as a trusted technical advisor to customers, enabling their productivity and driving adoption of NVIDIA's AI technologies. The role involves working with financial institutions, providing technical guidance, developing solution prototypes, and staying updated on industry trends. Requires a BS/MS/PhD in a technical field, 5+ years of AI experience, financial services background, and expertise in coding for NVIDIA GPUs. | Serve | 7 |
| Senior Software Developer Senior Software Developer to work on an open-source AI networking acceleration library, focusing on performance-oriented low-level infrastructure for inference, utilizing hardware offloads, GPU Kernels, and RDMA. Requires strong C++/C/Rust, Linux, and networking stack knowledge, with advantages in LLM inference, distributed storage, Linux internals, CUDA, and parallel programming. | Serve | 7 |
| Senior Software Developer, AI Networking Senior Software Developer focused on AI Networking at NVIDIA, developing communication frameworks, production tools, and benchmarks for large-scale AI training and inference systems. The role involves enabling new AI models, analyzing workloads, designing automation, and collaborating with hardware teams. | ServeData | 7 |
| Senior Manager, AlpaSim & AlpaDreams Production Seeking an engineering leader to productize neural simulation technologies (AlpaDreams, AlpaSim) for AV and robotics. This role involves building and scaling a team, hardening research into a production-quality platform, and establishing an open-source standard. Requires strong technical depth in ML, distributed systems, and production infrastructure, with a proven track record of shipping research-stage systems as scalable products. | ShipPretrain | 7 |
| Senior Networking Solution Test Engineer – AI Cluster Debugging Senior Networking Solution Test Engineer focused on debugging large-scale AI clusters, NVLink, Ethernet, and InfiniBand. The role involves designing tests, building testbeds, end-to-end troubleshooting, collaborating with development teams on networking components, and profiling deep learning workloads. | Serve | 7 |
| Senior Solutions Architect, HPC and AI Senior Solutions Architect focused on deploying, debugging, and optimizing large-scale AI training and inference workloads on GPU clusters. The role involves collaborating with internal teams and external customers to solve complex HPC and AI challenges, focusing on performance, stability, and scaling of AI workloads. | ServeData | 7 |
| Senior Libraries Engineer – AI and HPC Senior Libraries Engineer at NVIDIA focused on building and optimizing GPU/CPU accelerated data processing software libraries for AI, data analytics, computer vision, and scientific simulations. The role involves developing scalable library software, performance tuning, optimization, and providing technical leadership. | Serve | 7 |