Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior System Software Engineer – Embedded AI Inference Senior Software Engineer to develop production automotive software for AI inference and agent orchestration in C++ for embedded platforms. Focus on building next-generation automotive software applications, including in-car agentic AI and inference of LLM/VLM/VLA models on NVIDIA GPUs. | ServeAgent | 8 |
| Senior Machine Learning Applications and Compiler Engineer, LPX Develops algorithms and optimizations for NVIDIA's LPX inference and compiler stack, focusing on mapping neural network workloads onto future NVIDIA platforms and optimizing end-to-end inference performance. Requires strong software engineering, compiler/runtime development, and deep learning framework experience. | Serve |
| 8 |
| DL System Software Engineer - AI Platform NVIDIA is seeking a DL System Software Engineer to develop an AI Platform for efficient inference and training of large-scale models on GPU clusters. The role involves designing and building solutions for scheduling workloads, resource management, and performance optimization, working with various NVIDIA AI technologies. | ServePost-train | 8 |
| Senior Software Engineer, TensorRT-LLM NVIDIA is seeking a Senior Software Engineer for its TensorRT-LLM team to develop and scale inferencing software for LLMs and Generative AI. The role involves crafting robust inferencing software, performing benchmarking and profiling for GPU applications, writing high-quality Python code for LLM inference, and improving the TensorRT-LLM library. Collaboration with software, research, and product teams is key. | Serve | 8 |
| Senior Software Engineer – TensorRT Edge-LLM Senior Software Engineer to develop and optimize a state-of-the-art inference framework for Large Language, Vision-Language, and Multimodal models on edge and embedded platforms, focusing on real-time performance and constrained environments. | Serve | 8 |
| Senior Performance Engineer - Deep Learning Senior Performance Engineer at NVIDIA focused on optimizing Deep Learning models and frameworks (PyTorch, JAX) for NVIDIA GPUs. The role involves building and supporting Transformer Engine, collaborating on systems research for performance improvements, implementing and benchmarking new DL models, contributing to MLPerf, and engaging with the open-source community and enterprise customers. It also involves influencing future hardware and software design. | ServePost-train | 8 |
| Senior System Software Engineer, 3D Computer Vision Senior System Software Engineer focused on 3D Computer Vision at NVIDIA, involving the development and deployment of advanced neural reconstruction models for generating 3D scenes. The role requires strong programming skills in Python and C/C++, a background in computer vision and deep learning, and experience with production-grade software development. | Post-trainServe | 8 |
| Senior AI Research Scientist, Robotics Digital Twins Senior AI Research Scientist role focused on developing digital twins for chemical, biological, and physical laboratories, integrating AI agents with science experiments, and collaborating with robotics and software engineers. Requires a Ph.D. and 5+ years of AI research experience in robotics. | Agent | 8 |
| Senior Software Engineer, Quantized Inference Senior Software Engineer focused on optimizing quantized inference for LLMs by implementing recipes, developing kernels, and collaborating on inference engines like vLLM and TRT-LLM. The role involves model export pipelines, benchmarking, and data analysis tooling. | Serve | 8 |
| Senior Compiler Engineer, AI Inference Performance NVIDIA is seeking a Senior Compiler Engineer to optimize AI inference performance for their Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate next-generation deep learning software for various AI applications. | Serve | 8 |
| Senior Compiler Engineer, AI Inference Platforms NVIDIA is seeking a Senior Compiler Engineer to join its Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate AI inference performance on NVIDIA GPUs. The compiler is critical for data centers, personal devices, automotive, and robotics, focusing on inference performance, build time, memory footprints, and ease of use. | Serve | 8 |
| Research Scientist, Security and Privacy - PhD New College Grad 2026 Research Scientist focused on security and privacy for AI systems, aiming to develop hardware, software, and algorithms for trustworthy AI with verifiable protection. Requires a PhD and expertise in areas like computer architecture, programming languages, applied cryptography, or AI/ML algorithms, with a strong publication record. | Post-train | 8 |
| Principal GenAI Engagement Lead, Partner Platforms This role focuses on driving the technical integration of NVIDIA's Generative AI software with enterprise partners, including ISVs and CSPs. The Principal GenAI Engagement Lead will build trusted relationships, accelerate adoption, and influence product direction by designing and shipping methodologies, code, and reference architectures for RAG, LLM inference, and Multi-Agent workflows. The role requires a strong background in AI/ML, deep learning, and enterprise-grade GenAI systems, with experience in various LLM application stages and MLOps. The individual will act as the key technical lead, ensuring the deployment of robust, scalable GenAI solutions. | AgentServe | 8 |
| Developer Technology Engineer, AI NVIDIA Developer Technology Engineer focused on optimizing core parallel algorithms and data structures for GPUs, specifically working with LLM training frameworks and performance optimization. Collaborates with application developers and internal NVIDIA teams to improve performance and developer efficiency. | Data | 8 |
| Solutions Architect - CPU and LPU NVIDIA Solutions Architect focused on optimizing AI inference workloads across CPU, GPU, and LPU platforms for customers. The role involves technical expertise, proof-of-concept development, and optimizing AI efficiency in heterogeneous environments. | ServeAgent | 8 |
| Principal AI Developer Technology Engineer This role focuses on researching and developing techniques to accelerate AI workloads (deep learning, machine learning) on advanced computer architectures, specifically GPUs. The engineer will perform in-depth analysis and optimization of complex AI and HPC algorithms, publish findings, and influence future hardware/software design. Requires deep C/C++ programming, parallel programming (CUDA, etc.), low-level performance optimization, and CPU/GPU architecture expertise. | Serve | 8 |
| Principal AI Developer Technology Engineer Seeking a Principal Developer Technology Engineer to research and develop techniques for GPU acceleration of AI workloads, focusing on performance optimization of deep learning and HPC algorithms on modern CPU and GPU architectures. This role involves collaborating with internal teams and the developer community, influencing hardware/software design, and publishing findings. | Serve | 8 |
| Solution Architect, Financial Services Solutions Architect for Financial Services at NVIDIA, focusing on guiding customers in leveraging NVIDIA's AI technologies, particularly in areas like model distillation, domain adaptation, reinforcement learning, and post-training algorithms. The role involves technical advocacy, collaborative innovation, and knowledge sharing within the financial services sector, requiring expertise in AI frameworks, Python, distributed computing, and the AI model lifecycle. | Post-trainPretrain | 8 |
| AI Chip Design Engineer - New College Grad 2026 NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks. The role involves creating AI agents to enhance productivity, building production infrastructure for these agents, and optimizing algorithms for enterprise data. Requires strong proficiency in LLM libraries, GPU/CPU architectures, and HW verification methodologies. | Agent | 8 |
| Senior Solutions Architect – Simulation Solutions 3D Reconstruction This role focuses on developing and scaling AI platforms for simulation and 3D reconstruction, particularly within the Omniverse ecosystem. The Senior Solutions Architect will act as a technical advisor, prototype solutions, implement intricate technical systems, provide technical enablement, and advocate for partner needs. The role requires expertise in AI, systems knowledge, autonomous systems, simulation, generative AI, Python, C++, DL/RL frameworks, computer vision, and 3D reconstruction. | AgentServe | 8 |
| AI Chip Design Engineer - New College Grad 2026 NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks, focusing on building and maintaining infrastructure for AI agents that process large codebases and optimize verification flows. The role involves developing retrieval and generation algorithms, integrating AI optimizations, and working with HW engineering teams. | Agent | 8 |
| Solution Architect, Energy NVIDIA is seeking a Solution Architect with deep expertise in AI solutions to drive the efficient use of compute platforms in the Energy Industry. The role involves being a trusted technical advisor to developers and customers, embedding NVIDIA software, improving application performance, and establishing the foundation for next-generation AI systems. Responsibilities include supporting business development, working directly with customers, assisting in the adoption of NVIDIA software, analyzing architectures for acceleration opportunities, providing feedback to engineering teams, and delivering trainings and demonstrations. Requires an MS/PhD in a technical field, 5+ years of experience in AI/ML/DL/NLP/Generative AI, and 5+ years of industrial experience in power grid software and advanced ML for grid operations. Familiarity with accelerated computing, GPU systems, Python/C/C++, major AI frameworks, containers, and version control is essential. Experience designing and building complex AI/ML solutions, and reasoning across various system components is also required. | Serve | 8 |
| Senior Tools Development Engineer NVIDIA is seeking a Senior Tools Development Engineer to build agentic infrastructure for test automation and quality engineering on the Omniverse platform. The role involves designing and deploying multi-agent systems, orchestration frameworks, and evaluation systems to improve software quality and reliability. | Agent | 8 |
| Senior Software Engineer, 3D/4D Reconstruction Senior Software Engineer at NVIDIA focused on 3D/4D reconstruction for autonomous driving products. The role involves building and optimizing systems using deep learning, computer vision, and generative AI techniques, including large geometry models, Gaussian splatting, and diffusion models. Responsibilities include developing reconstruction systems, inventing evaluation methods, creating visualization tools, building automated workflows, and optimizing neural network performance for training and deployment. The goal is to improve reconstruction fidelity and simulation realism for end-to-end driving models. | AgentEval Gate | 8 |
| Developer Relations Manager – AI Natives NVIDIA is seeking a Developer Relations Manager to engage with AI-native companies, helping them design, optimize, and scale their AI platforms on NVIDIA technologies. The role involves advising founders and engineering teams on building agentic systems, AI copilots, and multimodal applications, with a focus on accelerating training, optimizing inference, and delivering AI experiences. The ideal candidate has deep technical expertise in AI systems, developer platforms, and large-scale inference infrastructure. | ServeAgent | 8 |
| Senior AI Performance and Efficiency Engineer Senior AI/ML Performance and Efficiency Engineer focused on optimizing GPU cluster performance for AI/ML researchers by addressing infrastructure and application bottlenecks. This role involves building tools, analyzing efficiency, and collaborating across teams to improve hardware, software, and infrastructure usage for various ML workloads like Robotics, Autonomous vehicles, LLMs, and Videos. | Serve | 8 |
| Senior AI Developer Technology Engineer Senior Developer Technology Engineer focused on researching and developing techniques to GPU accelerate AI workloads, optimizing performance on modern CPU and GPU architectures, and collaborating with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |
| Senior AI Formal Verification Engineer Senior AI Formal Verification Engineer to enhance in-house formal tools with AI, leveraging LLMs and ML to automate intent-to-proof workflows and debug complex chips. Role involves architecting methodologies, developing AI agents, and creating AI-based debug assistants. | AgentServe | 8 |
| Engineering Manager, AI Developer Technology Engineering Manager for NVIDIA's AI Developer Technology team, focused on leading a team to optimize and develop algorithms for Deep Learning and Machine Learning applications, influencing next-generation hardware/software, and collaborating with customers and internal teams. The role involves optimizing training and inference performance on NVIDIA hardware. | ServePost-train | 8 |
| Senior Developer Technology Engineer - AI Senior Developer Technology Engineer focused on researching and optimizing AI/ML workloads for GPU acceleration, involving deep analysis, performance tuning, and collaboration with the developer community and internal teams to influence next-generation hardware and software design. | Serve | 8 |
| Senior Design Automation Engineer, Applied AI NVIDIA is seeking an Applied AI Engineer to lead end-to-end solution development for timing and constraint analysis workflows in VLSI/ASIC design. The role involves data generation, model training, orchestration, and building autonomous agents that interact with timing tools. The engineer will develop AI-driven solutions, integrate data sources, implement scalable orchestration, and build interpretable AI pipelines using GNNs, LLMs, and reasoning engines. Experience with Python, PyTorch/TensorFlow, graph/agentic AI frameworks, and EDA tools is required. | AgentData | 8 |
| Senior AI and MLOps Engineer - Security and Networking Research Senior AI/MLOps Engineer focused on building and maintaining infrastructure, tools, and processes for the AI lifecycle in a production environment, specifically for security and networking AI models and agents. The role involves optimizing models, deploying agentic systems and LLMs, designing training/inference pipelines, and collaborating with various engineering teams. | AgentServe | 8 |
| Senior Manager, System Software Engineering - Metropolis Accelerated and Inferencing Software Senior Manager for System Software Engineering at NVIDIA, focusing on Metropolis Accelerated and Inferencing Software. The role involves leading engineering teams, driving strategic implementations of inference solutions (TensorRT, VLLM) for edge and enterprise devices, performance benchmarking, and technical leadership in deep learning. Requires extensive experience in machine learning/deep learning, embedded software, GPU/CPU optimization, and multimodal AI systems. | ServeAgent | 8 |
| Senior Product Architect, Storage NVIDIA is seeking a Senior Product Architect to design and validate AI storage infrastructure, focusing on optimizing systems for large-scale foundation model training, disaggregated inference, and agentic AI pipelines. The role involves architecting end-to-end reference architectures, defining system-level architectures, and collaborating with partners and customers to deliver proof-of-concepts. | AgentServe | 8 |
| Solutions Architect – AI Factory Solutions Architect role focused on designing, building, and operationalizing large-scale AI factories and GenAI/Agentic AI solutions for enterprise customers, leveraging NVIDIA's technology stack. This involves hands-on work with compute, networking, software, and cluster management tools. | Agent | 8 |
| Technical Marketing Engineer This role leads complex, cross-functional programs for next-generation generative AI systems, focusing on media content creation. It involves translating research into execution roadmaps, defining program plans, and managing model release readiness across various stages from research to product integration. The role requires strong program management skills in AI/ML and a solid understanding of generative AI systems. | ShipPretrain | 8 |
| Manager, AI and Software Manager for an AI team at NVIDIA, focusing on developing and leading the implementation of cutting-edge AI applications including RAG, LLMs, AI Agents, recommendation engines, and classical AI models. The role involves managing a team of 6-8 engineers, providing technical leadership, and collaborating with cross-functional teams to identify and implement AI opportunities. | AgentData | 8 |
| Senior Software Engineer, Video Analytics Senior Software Engineer role focused on building large-scale distributed Vision AI platforms for video analytics using NVIDIA Metropolis. The role involves designing and developing functionalities for video processing, integrating VLMs, CV models, and LLMs, and optimizing performance on NVIDIA hardware. Requires strong software development experience with ML systems, C++, Python, and GPU acceleration. | ShipServe | 8 |
| Senior Solutions Architect - Physical AI NVIDIA is seeking a Senior Solutions Architect for Physical AI to support customers building robotics and Physical AI solutions on NVIDIA’s platforms. This role involves guiding architecture, prototyping, and troubleshooting across robotics deployments from simulation to training to deployment, focusing on applied AI (computer vision, GenAI) for robotics. | AgentData | 8 |
| Senior GPU System Architect NVIDIA is seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves architecting system topologies, defining interconnects (NVLink, Ethernet), collaborating on RDMA, using system models for analysis, and co-designing hardware-software stacks for efficient AI workload deployment. | Serve | 8 |
| Senior System Software Engineer, Speech AI Senior System Software Engineer role focused on speech AI technologies (ASR, TTS, ALM, S2S) for enterprise and developer customers. Responsibilities include implementing, troubleshooting, and optimizing GPU-accelerated speech systems in production, transitioning models from research to production, optimizing inference performance, developing core speech services using C++ and Python with CUDA, and contributing to client SDKs. Requires strong programming skills, experience with inference pipelines, understanding of modern model architectures, and knowledge of real-time streaming audio and low-latency systems. Experience with speech model fine-tuning is required. | ServePost-train | 8 |
| Senior System Software Engineer, Speech AI NVIDIA is seeking an experienced Software Engineer to work on their GPU-accelerated Speech AI platform, focusing on building and optimizing core speech recognition (ASR), text-to-speech (TTS), and S2S services for real-time conversational AI applications. The role involves developing C++ & Python backend implementations, optimizing inference performance, adding new features, contributing to client libraries, and performance analysis of complex systems. | ServePost-train | 8 |
| NIM Solutions Architect This role focuses on deploying and optimizing large models using NVIDIA's Inference Microservice (NIM) and related tools. The Solutions Architect will package optimized models (LLM, VLM, etc.) into containers for deployment, refine NIM tools for the community, and design/implement agentic AI solutions for customer scenarios. The role requires strong programming skills, experience with inference engines, and MLOps practices, with a focus on performance engineering and model optimization. | ServeAgent | 8 |
| Solution Architecture Intern, AI in Industry - 2026 NVIDIA is seeking an AI in Industry Solution Architecture Intern to help optimize large models, develop AI workflows, and deliver advanced AI solutions. The intern will provide technical support, design and implement optimizations for AI models, and set up model training or inference to identify and resolve bottlenecks. This role involves working with various AI models and inference frameworks, conducting research, and collaborating with global teams. | ServePost-train | 8 |
| Senior Software Engineer – ADAS Senior Software Engineer to develop production ADAS and autonomous driving functions in C++ and Python, integrating deep learning models into real-time inference pipelines on NVIDIA GPUs for safety-critical automotive applications. | ServePost-train | 8 |
| Performance Engineer Intern, Deep Learning and HPC - 2026 NVIDIA is seeking a Performance Engineer Intern to support performance testing of datacenter products and applications, focusing on AI workloads like LLM training and inference, as well as HPC. The role involves benchmarking, profiling, analyzing performance, developing automation scripts, and collaborating with internal teams. The intern will aggregate and report testing data for sales, marketing, and engineering teams, and assist in developing tools and processes for automated testing. | ServePost-train | 8 |
| Senior Scientist, Synthetic Data and Privacy This role focuses on research and development of synthetic data generation and privacy-preserving AI techniques, contributing to open-source libraries within the NVIDIA NeMo ecosystem. It involves building advanced pipelines, researching privacy methods like DP-SGD and NER for PII, and designing software libraries. The role requires a PhD, significant research experience in data privacy and synthetic data, a strong publication record, and expertise in PyTorch, HuggingFace, and LLM inference frameworks. | DataPost-train | 8 |
| Senior Software Engineer, Robotics - Isaac Lab NVIDIA is seeking a Senior Software Engineer for their Isaac Lab team to develop features for a robot learning platform, focusing on reinforcement learning, multi-agent learning, and sim-to-real deployment. The role involves automating workflows, scaling in the cloud, and collaborating with research teams on next-generation robots. | AgentData | 8 |
| Software Engineering Manager, Robotics NVIDIA is seeking a Robotics Software Engineering Manager to lead a team focused on sim-first development, real-world deployment, and continuous learning for physical AI robots, such as Humanoid Robots. The role involves hands-on development, implementation, and deployment of real-time software stacks, fostering innovation, and collaborating with cross-functional teams. | ShipAgent | 8 |
| Senior Solutions Architect, AI Factory NVIDIA is seeking a Senior Solutions Architect with expertise in AI Supercomputing to support academic and commercial groups using NVIDIA products for deep learning, data analytics, and scientific simulation. The role involves understanding customer needs, developing solutions, demonstrating workflows, and communicating requirements to NVIDIA Engineering. Requires 3+ years of Deep Learning research experience, experience with LLM training and adaptation, and familiarity with DL frameworks and Generative AI. | Post-train | 8 |