Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).
NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.
NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.
NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).
Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.
In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).
| Title | Stage | AI score |
|---|---|---|
| Senior Software Advanced Developer Develop and prototype advancements in distributed training and inference using NVIDIA's Spectrum-X AI fabric, focusing on improving AI app-networking connections through communication refinement, congestion control, NIC firmware coding, and switch SDK features to enhance AI factory efficiency and large-scale AI system development, scaling, and speed. | ServePretrain | 7 |
| Senior DGX Cloud AI Infrastructure Software Engineer Senior Software Engineer role focused on building and integrating AI infrastructure for DGX Cloud, enabling developers to access GPU-optimized virtual machines. Responsibilities include crafting IaaS API integrations, developing a two-sided marketplace, and improving testing and observability for scalable, fault-tolerant solutions. |
| Serve |
| 7 |
| Director, Software Architecture NVIDIA is seeking a Director of Software Architecture to lead the development of AI data centers and networking technologies. The role involves identifying and evaluating new technologies, leading the development of new networking applications using data plane programming, engaging with customers, defining a strategic vision, and managing technical teams. Requires an M.Sc. or PhD., 12+ years of software architecture experience, 8+ years of management experience, and expertise in AI inference algorithms, frameworks, and systems. | Serve | 7 |
| Software Architect, Advanced Development Research role focused on the intersection of Networking, Security, and Communications, with a specific emphasis on applying AI to these domains. The role involves technical leadership, architecture design, SDK development for new hardware, and implementing services. A key aspect is working with AI-powered networking machines. | Serve | 7 |
| DGX Cloud Performance Engineer NVIDIA is seeking Parallel and Distributed Systems engineers to drive performance analysis, optimization, and modeling for their DGX Cloud AI platform. The role involves developing benchmarks, analyzing performance bottlenecks, and collaborating with AI researchers to improve system performance and usability. Expertise in large-scale parallel systems, AI workloads, performance modeling, and AI frameworks is required. | Serve | 7 |
| Senior Software Architect - Deep Learning and HPC Communications Senior Software Architect role at NVIDIA focusing on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, exploring innovative HW/SW solutions, building proofs-of-concept, and using simulation to evaluate large GPU cluster performance. | Serve | 7 |