Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Systems Engineer, Neural Graphics Senior Systems Engineer role focused on integrating AI and traditional rendering techniques for real-time visual experiences. The role involves taking innovative techniques like AlpaDreams and driving them into production-ready, real-time pipelines, owning the end-to-end path from prototype to shipping product, and solving complex systems challenges related to latency, memory, and throughput. Requires deep expertise in graphics and AI, with a strong track record of shipping impactful products and experience with systems-level thinking. | ShipAgent | 8 |
| Senior Data and AI Solutions Engineer Senior Data and AI Solutions Engineer at NVIDIA to partner with engineering teams, transform data, BI, automation, and agentic AI into measurable engineering productivity gains. Develop and deploy production-grade AI and data solutions, including agentic systems and RAG pipelines, from concept to deployment. |
| AgentServe |
| 8 |
| Machine Learning Intern - AI Agents Conversational AI NVIDIA is seeking a Machine Learning Intern to support the development of AI agents for workflow automation and intelligent assistants. The role involves working on conversational AI, LLM applications, RAG systems, and speech AI workflows, building prototypes with NVIDIA AI technologies, and assisting with evaluation and deployment. The ideal candidate is pursuing a degree in a related field, has experience with machine learning, strong Python skills, and familiarity with Linux. | AgentServe | 8 |
| Machine Learning Intern - Multimodal Models Generative AI NVIDIA is seeking a Machine Learning Intern to support research and development of large language and multimodal models. The intern will work on model fine-tuning, parameter-efficient training, architecture exploration, experiments, benchmarking, evaluation, data analysis, and prototype development using NVIDIA AI platforms and GPU-accelerated tools. The role also involves collaborating with researchers and engineers on AI innovation projects and exploring opportunities for technical publications. | Post-trainAgent | 8 |
| Machine Learning Intern - 2026 NVIDIA is seeking a Machine Learning Intern to assist with developing demonstrations using NVIDIA SDKs, algorithmic development, and AI software development. The role involves keeping up with the latest NVIDIA technology, building demos, and engaging the AI community through workshops. | Serve | 8 |
| Solutions Architect - AI for Drug Discovery NVIDIA seeks a Solutions Architect for their EMEA team to drive AI adoption in drug discovery within the biopharma industry. The role involves acting as a technical advisor to pharmaceutical companies, biotechs, and research organizations, leveraging NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and supporting business development by guiding customers on production-grade inference, model training, RL, and post-training algorithms. The role also involves exploring foundation models, agentic LLM applications, and physical AI in biopharma, providing feedback to internal teams, and documenting/teaching NVIDIA solutions. | ServePost-train | 8 |
| Senior GPU System Architect Seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC datacenters. The role involves defining system architectures that integrate GPU compute, memory, and interconnects for optimal AI performance and scalability. Requires deep experience in system-level fabric/networking architecture and hardware-software co-design. | Serve | 8 |
| Senior System Software Engineer Senior Software System Engineer to customize and expand NVIDIA’s autonomous driving solutions, delivering and scaling a foundation model-based end-to-end stack for autonomous driving. Involves applied research and development of deep learning models, issue triage, performance maintenance, and collaboration to build a unified production system. | ShipPost-train | 8 |
| Solution Architect, Generative AI NVIDIA is seeking a Solution Architect to promote adoption and provide technical support for their GPU-accelerated computing solutions, focusing on generative AI, machine learning, and deep learning for enterprise clients in Japan. The role involves pre-sales activities, technical support for model training and deployment, and developing solutions for inference and agent-based systems. | ServeAgent | 8 |
| Applied AI Engineer - Silicon Co-Design Group NVIDIA is seeking an Applied AI Engineer to design, develop, and integrate AI/LLM-powered systems into their chip design and automation infrastructure. The role involves architecting and implementing solutions to enhance workflow efficiency, scalability, and intelligence, driving initiatives from concept to deployment. Requires hands-on experience building and deploying ML/AI systems or data-intensive backend services, with a focus on owning AI agents or LLM-powered workflows end-to-end. | Agent | 8 |
| AI Research Engineer - Applied Scientist Compilers AI Research Engineer/Applied Scientist focused on Compilers/Low-level optimization to develop AI compiler solutions for NVIDIA's software stack and GPU acceleration. Responsibilities include applying AI to compilation, implementing AI-based solutions for GPU programming, building training pipelines (fine-tuning, RL), defining model I/O, developing evaluation frameworks, prompt engineering, integrating learned policies, prototyping models, creating datasets, and applying RL for optimization. | Post-trainServe | 8 |
| Senior Research Engineer, Robotics Systems Senior/Principal Engineer in robotics systems, focusing on foundation models and full-stack technology for humanoid robots. Responsibilities include designing teleoperation software, optimizing control stacks, deploying neural network models on hardware, and collaborating on the MLOps lifecycle. Requires strong robotics and software engineering background, with experience in real-time control and deploying ML models on robotic hardware. | ShipData | 8 |
| Senior Perception Engineer - Autonomous Vehicles Senior Perception Engineer at NVIDIA focused on developing and productizing autonomous driving solutions using deep learning and multi-sensor fusion. The role involves applied research, algorithm development, and ensuring solutions meet production requirements for safety, latency, and robustness. | ShipPost-train | 8 |
| Senior Software Engineer, Agentic AI Senior Software Engineer to develop core libraries for Agentic Applications, focusing on building foundational technology, scalable capabilities, reusable blocks, and high-quality libraries to accelerate developer productivity and ensure agent quality. The role involves benchmarking, identifying bottlenecks, and optimizing performance, cost, and latency for agents. Collaboration with teams on data pipelines, RAG, vector databases, and GPU-optimized workflows is expected. | Agent | 8 |
| Software Engineering Intern, AI Tools - Fall 2026 NVIDIA is seeking a Software Engineering Intern to join its AI Tools and Infrastructure team, focusing on Agentic AI. The intern will work with LLMs and orchestration frameworks to design, build, and deploy intelligent agents and AI tools, gaining exposure to the full lifecycle of agent development from conceptualization to deployment. | Agent | 8 |
| Solutions Architect - AI Technology Centre NVIDIA is seeking a Solutions Architect with expertise in AI for Chemistry and Materials Science to lead research and application efforts. The role involves driving collaborations, mentoring junior members, developing technical materials, and staying updated on AI advancements in the field. Requires a PhD or equivalent experience and proficiency in Python and AI/ML libraries like PyTorch. | Data | 8 |
| Applied AI Engineer - DFT Methodology NVIDIA is seeking an Applied AI Engineer to explore and architect generative AI solutions, including LLMs, RAGs, and Agentic AI workflows, for Design-for-Test (DFT) and VLSI problems. The role involves deploying predictive ML models for silicon lifecycle management and collaborating with VLSI/DFX teams to integrate AI solutions. Experience in applied ML for chip design and deploying generative AI for engineering use cases is required. | Agent | 8 |
| Senior Deep Learning Performance Architect Senior Deep Learning Performance Architect at NVIDIA to design and evaluate hardware architectures for AI/HPC applications, focusing on LLM inference and training performance, and optimizing system bottlenecks. | ServePost-train | 8 |
| Senior Data Center Performance Engineer - Benchmarking and Optimization Senior Data Center Performance Engineer at NVIDIA focused on benchmarking and optimizing data center platforms for AI training, inference, and HPC workloads. Responsibilities include designing benchmarks, characterizing workloads, identifying bottlenecks, and driving performance improvements through system tuning and architectural recommendations. | Serve | 8 |
| Senior GenAI Engagement Lead, Partner Platforms This role focuses on driving the technical integration of Generative AI software with enterprise partners, involving hands-on design and deployment of RAG, LLM inference, and Multi-Agent workflows. The position requires deep technical expertise in AI/ML, partner engagement, and understanding of the GenAI lifecycle, with a focus on production deployments and influencing product roadmaps. | AgentServe | 8 |
| NCX Engineer, AI Accelerator This role focuses on engineering and deploying AI infrastructure and solutions for strategic customers, optimizing large-scale training and inference workloads on NVIDIA's AI platform. It involves MLOps, Kubernetes, GPU scheduling, and performance tuning, with a strong emphasis on customer-facing technical support and collaboration. | ServePost-train | 8 |
| Senior HPC and AI Networking Performance Research and Analysis Engineer Research Engineer focused on analyzing and optimizing the performance of large-scale distributed LLM training and inference on GPU clusters, with a strong emphasis on networking aspects. | PretrainServe | 8 |
| Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize neural network workloads on future NVIDIA platforms. The role involves building and maintaining high-performance runtime and compiler components, defining workload mappings, integrating with the SW ecosystem, benchmarking, profiling, and collaborating with hardware teams. It also includes prototyping new compilation techniques and publishing technical work. | Serve | 8 |
| Senior AI Solutions Architect NVIDIA is seeking an AI Solutions Architect with deep expertise in AI solutions and scalable data center infrastructure. The role involves embedding NVIDIA software into customer architectures, improving application performance, and establishing technical foundations for next-generation AI systems. Responsibilities include supporting business development, working directly with developers and customers, analyzing architectures for acceleration opportunities, and delivering trainings. | ServeAgent | 8 |
| Senior Deep Learning Framework Communications Engineer Senior Deep Learning Framework Communications Engineer at NVIDIA, focusing on integrating and optimizing communication libraries (NCCL, NVSHMEM) within AI frameworks (PyTorch, TRT-LLM, vLLM, JAX) to enhance performance for large-scale AI training and inference. The role involves deep analysis of AI workloads, compiler improvements, and kernel authoring for multi-GPU systems. | Serve | 8 |
| Senior Scientific Machine Learning Engineer – Earth-2 Develops and enhances machine learning frameworks (NVIDIA PhysicsNeMo, NVIDIA Earth2Studio) for scientific ML technology in weather, climate, and earth system modeling. Focuses on implementing new deep learning techniques and enhancing Earth-2 technologies. | Post-train | 8 |
| Senior Solutions Architect, Generative AI Data Processing Senior Solutions Architect role focused on assisting customers in deploying Generative AI solutions, particularly for data processing and agentic workflows, using NVIDIA's AI technology stack. The role involves technical advisory, system design, and implementation at scale, with a strong emphasis on Deep Learning, LLMs, and GPU technologies. | AgentServe | 8 |
| Director, System Software Engineering - Metropolis Accelerated and Inferencing Software NVIDIA is seeking a Director of System Software Engineering to lead teams responsible for the full lifecycle of Vision AI strategy, from model onboarding to production deployment. The role focuses on transforming foundation models into real-time, GPU-accelerated video intelligence systems, scaling multimodal reasoning, and enabling agentic development workflows. Key responsibilities include architecting and operationalizing inference acceleration, driving implementations of frameworks like TensorRT and VLLM, collaborating with partners on custom models, and ensuring performance benchmarking. The ideal candidate has extensive experience in deep learning, GPU optimization, and leading engineering teams in embedded and enterprise platforms. | ServeAgent | 8 |
| Director, Isaac for Healthcare Engineering Director of Engineering for NVIDIA's Isaac for Healthcare initiative, focusing on building a platform for healthcare robotics companies to develop, simulate, train, and deploy physical AI systems. The role involves platform leadership, team building, partner enablement, technical strategy, and cross-functional collaboration, with a strong emphasis on shipping sophisticated software platforms at scale. | ShipData | 8 |
| Manager, Solutions Architecture - Global Partner Team Manager of Solutions Architecture for NVIDIA's Global Partner Team, focusing on leading technical engagements with GSIs and AI consulting firms. The role involves building and scaling Agentic AI services, providing architectural oversight for complex AI workflows, and collaborating with product and engineering teams. Requires deep technical expertise in Generative/Agentic AI, RAG, LLM orchestration, and AI infrastructure. | Agent | 8 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect role at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, developing new communication technologies, exploring hardware/software co-design, and building proofs-of-concept to drive innovation in large-scale GPU clusters. | Serve | 8 |
| Senior Solutions Architect - Deep Learning Senior Solutions Architect focused on Deep Learning and Agentic AI tools, collaborating with customers to build solutions using NVIDIA technology. Responsibilities include technical sales support, integrating NVIDIA tech into HPC, championing Deep Learning internally, and developing demo solutions. | ServeAgent | 8 |
| SOC AI Application Engineer — AI Services, Agents and Knowledge Systems NVIDIA is seeking an AI Engineer to build and operate AI application-layer services for SOC design automation, including assistants, retrieval, Q&A, workflow automation, and AI agents. The role involves designing LLM-backed services, building RAG and knowledge systems, applying agent and orchestration patterns, improving developer experience with AI-assisted coding, and owning reliability and evaluation. Requires strong Python, experience shipping services, and hands-on use of LLM frameworks and RAG. | AgentServe | 8 |
| Director, Product Platform Retail and CPG Industries NVIDIA is seeking a Director to define and build the Retail & CPG Industries product platform. This role involves architecting and developing a platform leveraging NVIDIA's full stack, including Agentic AI and accelerated computing, to reshape digital commerce, supply chains, and intelligent stores. The platform will utilize NVIDIA Nemo microservices and Nemotron models, with a focus on Agentic AI for various business functions. The ideal candidate will have hands-on development experience with AI agents, LLMs, RAG, and distributed systems, and will collaborate with engineering teams to deliver a production-ready, scalable platform. | AgentServe | 8 |
| Senior Solutions Architect - AI Factory Deployment Senior Solutions Architect focused on deploying and validating AI factories, specifically running and debugging AI/LLM workloads on GPU clusters. Responsibilities include setting up environments, executing benchmarks, resolving performance issues, building observability, and recommending optimizations. | Serve | 8 |
| Senior Software Engineer, Deep Learning Inference Senior Software Engineer focused on optimizing deep learning inference for LLMs and omnimodal architectures on NVIDIA hardware, including GPU kernel tuning, distributed inference, and contributing to open-source libraries. | Serve | 8 |
| Senior Hardware Architect, Deep Learning GPU and System Senior Hardware Architect role focused on designing next-generation GPUs and systems to advance the state of AI, analyzing deep learning workloads, and proposing new features for acceleration. Requires 8+ years of experience in performance, hardware architecture, and deep learning analysis. | Serve | 8 |
| Solutions Architect - AI Development Solutions Architect role focused on leading AI research and application, mentoring technical members, and fostering AI ecosystem development using NVIDIA technologies. Requires expertise in AI research areas like Digital Twins, Synthetic Data Generation, Immersive Multimedia, and Generative AI, with a strong background in AI model training and Python/PyTorch. | DataPost-train | 8 |
| Senior Software Engineer - VLM Microservices for Neural Reconstruction Senior Software Engineer to design, build, and optimize containerized inference execution for 3D Vision Language Models (VLMs) for neural reconstruction, turning research into production-grade software (NIMs). The role involves developing benchmarks, releasing and maintaining models, contributing to open-source projects like vLLM, and collaborating with research and product teams. Requires experience with AI distributed systems, inference platforms, Python/C++, and software engineering fundamentals. | ServePost-train | 8 |
| AI Computing Software Development Engineer, TensorRT NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust inferencing software for GPUs, focusing on performance analysis, optimization, and tuning. The role involves collaborating with various teams to guide machine learning inferencing direction and potentially publishing key results. | Serve | 8 |
| Senior Applied Machine Learning Engineer - VLSI Design NVIDIA is seeking a Senior Applied Machine Learning Engineer to build AI-driven software systems for circuit design, combining automation algorithms, DL models, and agentic workflows. The role involves working on pre-silicon and post-silicon hardware design data, circuit optimization, and AI systems for EDA/design automation, translating requirements into AI/ML and agentic system problems, and testing/releasing models and AI systems. | AgentData | 8 |
| Applied Machine Learning Engineer, Circuit Design - New College Grad 2026 NVIDIA is seeking an Applied Machine Learning Engineer for their Circuit Design team, focusing on building AI-driven software systems that combine automation algorithms, DL models, and agentic workflows to accelerate end-to-end circuit design. The role involves working with hardware design data, circuit optimization, and developing AI/ML solutions for EDA, with a focus on agent-driven design exploration and optimization. | Agent | 8 |
| Senior Solutions Architect, Generative AI Senior Solutions Architect role focused on customer engagements for NVIDIA's generative AI technologies, involving AI model training and deployment optimization, particularly for LLMs and recommenders in the consumer internet industry. Requires strong coding, GPU optimization, and communication skills. | ServeData | 8 |
| Principal AI and ML Infra Software Engineer, GPU Clusters This role focuses on enhancing the efficiency of AI and ML research on GPU clusters by collaborating with researchers to identify and address infrastructure deficiencies. The engineer will optimize performance, monitor resource utilization, and contribute to the AI/ML infrastructure ecosystem, keeping up-to-date with the latest AI/ML technologies. | Serve | 8 |
| Principal Cloud Services Software Engineer NVIDIA DGX Cloud Team is seeking a Principal Cloud Services Software Engineer to develop and optimize AI infrastructure services for large-scale AI training workflows. The role involves designing and implementing resilient, efficient services orchestrated by Kubernetes, with a focus on backend development, distributed systems, and high-performance computing. | ServeAgent | 8 |
| Senior Deep Learning Software Engineer - Autonomous Vehicles Senior Deep Learning Software Engineer focused on developing and productizing deep learning solutions for autonomous vehicles. The role involves training, fine-tuning, optimizing perception DNNs, applying quantization, improving DNN architectures, and enhancing inference speed and power consumption. It requires strong programming skills, experience with deep learning frameworks, computer vision tasks, and familiarity with CNNs and Transformer architectures. Experience with low precision inference, quantization, and NVIDIA software libraries is a plus. | ServePost-train | 8 |
| Compiler Engineer - AI Inference NVIDIA is seeking an AI Compiler Engineer to optimize kernel generation and computational graph optimizations for AI inference and training workloads on next-generation GPUs. The role involves hands-on development, collaboration on hardware/software co-design, and scaling AI deployments in datacenters. | ServePost-train | 8 |
| Senior Software Engineer, Metropolis Vision AI Senior Software Engineer to develop and optimize high-performance Vision AI pipelines and large-scale distributed services for processing video, image, and 3D data. The role involves crafting real-time systems, developing multi-modal perception, using simulation/synthetic data, and profiling/tuning GPU-accelerated inference pipelines. Collaboration with research and platform teams is key, with an emphasis on bringing research into production at scale. | ServePost-train | 8 |
| Senior Software Engineer, AI Networking Senior Software Engineer role focused on building and productizing ML tools for optimizing AI workloads (LLM training/inference) across GPU/CPU clusters, with a focus on networking and system resource utilization. Involves distributed deep learning, ML-based optimization techniques, and performance analysis. | ServeAgent | 8 |
| Machine Learning Intern - 2026 NVIDIA is seeking a Machine Learning Intern to assist with AI technology development and demonstrations. The intern will work with NVIDIA SDKs, engage with the AI community, and contribute to machine learning projects and AI software development. | Serve | 8 |