AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (53)

434 AI · 1824 total active
FilteredStageServe×CountryChina×Clear all
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 30Post-train · 51Serve · 356Agent · 192Eval Gate · 11Ship · 55
Function
AllEngineering · 627Research · 82Product · 14
Country
AllUnited States · 439China · 93Israel · 54Germany · 36Switzerland · 31India · 26United Kingdom · 24Poland · 17Vietnam · 13Canada · 12Singapore · 11France · 10Netherlands · 9Italy · 8Taiwan · 6Hong Kong · 4Japan · 4Spain · 3Australia · 2Czech Republic · 2Finland · 2Hungary · 2South Korea · 2Armenia · 1Brazil · 1Mexico · 1Romania · 1Saudi Arabia · 1Sweden · 1United Arab Emirates · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to analyze, model, and optimize deep learning system performance, particularly for LLM workloads, on state-of-the-art hardware architectures. This role influences future hardware and software design by collaborating with various internal teams.
ServeEngineeringShanghai, China +12w ago9
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architectures for edge devices, workstations, and data center GPUs. The role involves benchmarking, performance modeling, bottleneck identification, and exploring new hardware/software capabilities, with a focus on LLMs and generative AI. Experience with AI agents for engineering workflows is also mentioned.
ServePost-train
1–50 of 53← Prev12Next →
Engineering
Shanghai, China
2w ago
9
Deep Learning Performance Software Engineer
Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks.
ServeEngineeringShanghai, China3w ago9
AI Computing Architect
NVIDIA is seeking an AI Computing Architect to develop innovative architectures for deep learning performance and efficiency, analyze trade-offs using models and simulators, and prototype algorithms. The role requires strong programming skills, computer architecture background, and a foundation in machine learning.
ServePost-trainEngineeringShanghai, China +14w ago9
Deep Learning Solution Architect
NVIDIA is seeking a Deep Learning Solution Architect to design and optimize production-grade generative AI solutions for enterprise customers, focusing on LLM training, RAG, and agentic inference using NVIDIA's ecosystem.
ServeAgentEngineeringBeijing, China +1Apr 49
Deep Learning Performance Software Engineer
Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks.
ServeEngineeringShanghai, ChinaApr 49
AI Computing Software Development Engineer, LLM Inference
Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams.
ServeEngineeringShanghai, China +11w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks.
ServeEngineeringShanghai, China2w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV
NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach.
ServeEngineeringShanghai, China +22w ago8
Developer Technology Engineer - AI
NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms.
ServeEngineeringShanghai, China +23w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM
NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing.
ServeEngineeringShanghai, China4w ago8
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience.
ServeEngineeringShanghai, China +15w ago8
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems.
ServeDataEngineeringShanghai, China5w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust inferencing software for GPUs, focusing on performance analysis, optimization, and tuning. The role involves collaborating with various teams to guide machine learning inferencing direction and potentially publishing key results.
ServeEngineeringBeijing, China8w ago8
AI Computing Development Engineer, TensorRT-LLM
NVIDIA is seeking software engineers to develop and optimize inferencing software for AI models, specifically focusing on TensorRT-LLM. This role involves performance analysis, tuning, and collaboration across teams to advance machine learning inferencing capabilities.
ServeEngineeringShanghai, ChinaApr 178
Developer Technology Engineer - AI
NVIDIA Developer Technology Engineer focused on optimizing AI workloads, particularly large language models (LLMs), on NVIDIA's GPU platform. The role involves deep dives into application performance, GPU kernel optimization, distributed training and inference, and collaboration with various internal teams and external developers. It requires strong software engineering skills, parallel programming expertise, and a focus on performance analysis and tuning.
ServePost-trainEngineeringShanghai, China +2Apr 138
Senior Solutions Architect - KV Cache and AI Storage
Senior Solutions Architect focused on building LLM inference platforms using NVIDIA GPUs, KV cache, and tiered memory solutions. The role involves technical exploration with customers, performance analysis, and translating customer needs into product roadmaps.
ServeEngineeringBeijing, ChinaApr 78
Solutions Architect - Top AI Labs
Solutions Architect role at NVIDIA focusing on optimizing LLM inference and training acceleration, contributing to open-source frameworks like SGLang and vLLM, and developing KV cache offloading. Requires strong programming, systems fundamentals, and experience in performance analysis.
ServePretrainEngineeringBeijing, ChinaApr 78
Solutions Architect, Generative AI - CSP
NVIDIA is seeking an AI-focused Solutions Architect with expertise in LLMs, generative AI, agentic AI, or recommender systems. The role involves providing technical expertise to customers, assisting with GPU infrastructure for AI, optimizing training and inference pipelines, and gathering customer feedback for product development. This position requires 3+ years of experience in AI for large models and proficiency with AI tools.
ServePost-trainEngineeringShenzhen, China +1Apr 48
Senior Deep Learning Solution Architect
Senior Deep Learning Solution Architect at NVIDIA, focusing on LLM inference and training acceleration, performance optimization, and contributing to open-source frameworks like SGLang and vLLM. The role involves developing and optimizing inference frameworks, KV cache offloading, and exploring distributed training performance.
ServePost-trainEngineeringBeijing, China +1Apr 48
Solutions Architect - CPU and LPU
NVIDIA Solutions Architect focused on optimizing AI inference workloads across CPU, GPU, and LPU platforms for customers. The role involves technical expertise, proof-of-concept development, and optimizing AI efficiency in heterogeneous environments.
ServeAgentEngineeringBeijing, China +1Apr 48
NIM Solutions Architect
This role focuses on deploying and optimizing large models using NVIDIA's Inference Microservice (NIM) and related tools. The Solutions Architect will package optimized models (LLM, VLM, etc.) into containers for deployment, refine NIM tools for the community, and design/implement agentic AI solutions for customer scenarios. The role requires strong programming skills, experience with inference engines, and MLOps practices, with a focus on performance engineering and model optimization.
ServeAgentEngineeringBeijing, China +3Mar 68
Solution Architecture Intern, AI in Industry - 2026
NVIDIA is seeking an AI in Industry Solution Architecture Intern to help optimize large models, develop AI workflows, and deliver advanced AI solutions. The intern will provide technical support, design and implement optimizations for AI models, and set up model training or inference to identify and resolve bottlenecks. This role involves working with various AI models and inference frameworks, conducting research, and collaborating with global teams.
ServePost-trainEngineeringBeijing, China +2Mar 68
Performance Engineer Intern, Deep Learning and HPC - 2026
NVIDIA is seeking a Performance Engineer Intern to support performance testing of datacenter products and applications, focusing on AI workloads like LLM training and inference, as well as HPC. The role involves benchmarking, profiling, analyzing performance, developing automation scripts, and collaborating with internal teams. The intern will aggregate and report testing data for sales, marketing, and engineering teams, and assist in developing tools and processes for automated testing.
ServePost-trainEngineeringShanghai, ChinaMar 38
System Software Architect, AI and GPU Networking
NVIDIA is seeking a System Software Architect to research and develop advanced networking solutions for AI data centers, focusing on accelerating AI workloads, inference, and model serving. The role involves enhancing GPU networking offerings, designing optimizations for data movement, and evaluating new technologies.
ServePost-trainResearchBeijing, China +1Feb 268
Deeplearning Software Engineer -- Neural 3D reconstruction
Software Engineer role focused on deep learning for neural 3D reconstruction, involving research, design, implementation, optimization, and deployment of DNN models. The role requires C++, PyTorch, and ML/DL techniques, with a preference for experience in DNN development and network acceleration.
ServePost-trainEngineeringShanghai, China +1Feb 58
Senior Manager, Deep Learning Performance Architecture
NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures.
ServeEngineeringShanghai, China +1Dec '258
Deep Learning Performance Architect
NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions.
ServeEngineeringShanghai, China +1Dec '258
Senior System Software Architect, HPC and AI Networking
NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers.
ServePost-trainEngineeringBeijing, ChinaOct '258
Software Engineer, LLM Inference
Software Engineer focused on developing and optimizing LLM inference software and frameworks, working with GPU-accelerated libraries and deep learning frameworks.
ServeEngineeringShanghai, ChinaSep '258
Compute Architecture Software Engineer
NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments.
ServeEngineeringShanghai, ChinaSep '258
Software Engineer, cuDNN - Deep Learning
Software Engineer role focused on developing and optimizing cuDNN, a GPU-accelerated library for deep neural networks, including LLM support. The role involves performance analysis, tuning, and collaboration with cross-functional teams to innovate across various AI applications.
ServeEngineeringShanghai, ChinaSep '258
Deep Learning Performance Architect, CUTLASS DSL
NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++.
ServeEngineeringShanghai, China +13w ago7
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks.
ServeEngineeringShanghai, China3w ago7
Deep Learning Compiler Engineer - CUDA
NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks.
ServeEngineeringShanghai, China +14w ago7
Developer Technology Engineer, AI
NVIDIA Developer Technology Engineer focused on optimizing AI and deep learning applications on GPU architectures, working with customers to provide AI solutions, and collaborating with internal teams to influence future hardware and software design.
ServeEngineeringBeijing, China +24w ago7
Senior System Software Engineer - AI Performance and Efficiency Tools
NVIDIA is seeking a Senior System Software Engineer to develop tools for AI researchers and SW/HW teams running AI workloads on GPU clusters. The role involves building internal profiling, analysis, debugging, benchmarking, and simulation tools to improve the performance and efficiency of AI workloads and systems. This includes partnering with HW architects and understanding deep learning frameworks, distributed training/inference, and GPU cluster technologies.
ServeDataEngineeringShanghai, China5w ago7
Senior Developer Relations Manager
NVIDIA is seeking a Senior Developer Relations Manager to engage with the China industrial and research community, focusing on integrating GPU-accelerated computing solutions, particularly in Generative AI, Agentic AI, and AI Storage. The role involves understanding community requirements, promoting NVIDIA tools, architecting solutions, and driving adoption of new products within the AI storage ecosystem.
ServeAgentEngineeringBeijing, China +17w ago7
Developer Technology Engineer – AI
NVIDIA Developer Technology Engineer focused on optimizing deep learning and machine learning workloads on NVIDIA's accelerated computing platform (GPU, CPU, DPU) for key customers. Requires strong C/C++ and CUDA experience, with an MS/PhD in CS or related field.
ServeEngineeringShanghai, China +2Apr 247
Senior Computer Vision and Deep Learning Hardware Architect
NVIDIA is seeking an Autonomous Vehicle Performance Architecture Engineer to design, model, and verify state-of-the-art programmable vision accelerators (PVA) for automotive and robotics. The role involves optimizing software for autonomous driving solutions, analyzing and prototyping applications, building performance models for future architectures, and collaborating with teams to enhance PVA architecture. Requires a Masters/PhD, 3+ years of relevant experience, strong C/C++ and computer architecture skills, and performance modeling/optimization expertise. Experience in DSP programming, autonomous vehicle software, deep learning, computer vision, and self-driving cars is a plus.
ServePost-trainEngineeringShanghai, ChinaApr 157
Senior Software Engineer, NCCL
Senior Software Engineer role focused on designing, implementing, and maintaining highly-optimized communication runtimes for Deep Learning frameworks and HPC programming interfaces on GPU clusters. This involves system software development, parallel programming interface contributions, and proof-of-concept creation for new designs and hardware features.
ServeEngineeringShanghai, ChinaApr 157
Solution Architect – Accelerated Computing Libraries
NVIDIA is seeking a Solution Architect to drive the adoption of their AI and accelerated computing libraries across industries. The role involves understanding customer workloads, designing solutions using NVIDIA libraries for LLM inference and training acceleration, and collaborating with product teams to improve features and performance. The candidate will also build technical assets and analyze industry trends.
ServeEngineeringBeijing, China +1Apr 87
Senior Deep Learning Test Development Engineer, SDET
Senior Deep Learning Test Development Engineer (SDET) at NVIDIA's AI SWQA team, responsible for validating the robustness and performance of NVIDIA's AI software and GPU Infrastructure across various AI scenarios. The role involves test planning, design, execution, automation, and bug management, with a focus on improving workflow processes and efficiency. Experience with LLM inference frameworks and AI development tools is required.
ServeEngineeringShanghai, ChinaApr 47
Senior Software Test Development Engineer - Deep Learning
NVIDIA is seeking a Senior Software Test Development Engineer for its AI SWQA team. This role involves defining, developing, and executing tests to validate the robustness and performance of NVIDIA's AI software and GPU infrastructure across various AI applications like autonomous driving, healthcare, and NLP. The engineer will collaborate with AI product teams, develop complex test plans, manage bug lifecycles, and automate test cases for CI/CD pipelines. The position requires a Master's degree, 5+ years of QA/test automation experience, strong Python skills, and direct experience with AI tools/products or using AI for major features. Experience with AI for QA automation and deep learning frameworks is a plus.
ServeEngineeringShanghai, ChinaApr 47
Senior Solutions Architect, GPU System
NVIDIA is seeking a Senior Solutions Architect with expertise in GPU server platforms and AI infrastructure to help customers design, deploy, and optimize NVIDIA-based AI factories. The role involves leading presales and architecture engagements, designing end-to-end AI data center solutions, and supporting the deployment of NVIDIA platforms for LLM training and inference workloads.
ServeAgentEngineeringBeijing, China +1Apr 27
Solution Architect - Top AI Labs
Solution Architect role focused on designing AI computing platform architectures and supporting top AI Labs and model builders in integrating NVIDIA technologies for Deep Learning, HPC, Robotics, and Signal Processing applications. Requires experience with ML, data analytics, computer vision, and parallel programming on cloud/HPC architectures.
ServeEngineeringBeijing, China +1Apr 27
Devtech Compute Engineer
NVIDIA is seeking a Devtech Compute Engineer to develop performance-critical code for deep learning applications, focusing on accelerating model training and inference on GPUs, particularly for recommender systems. The role involves optimizing CUDA kernels, integrating solutions into open-source libraries, and collaborating with hardware teams to define future solutions across various domains like LLM, Recsys, Robotics, and Assisted Driving.
ServeDataEngineeringBeijing, China +1Feb 277
Senior System Software Architect, AI and GPU Networking
This role focuses on architecting and enhancing NVIDIA's GPU Networking offerings to accelerate AI workloads, including distributed AI, deep learning, inference, and model serving. It involves co-designing hardware features and leading the architecture and design of new technologies for AI data centers.
ServePost-trainEngineeringBeijing, China +1Feb 267
Senior Developer Technology Engineer
This role focuses on optimizing GPU-accelerated code for training and inference performance of large-scale recommender systems. It involves designing and implementing high-performance C++/CUDA components, developing tests, and optimizing data flows between GPUs, NICs, and SSDs. The ideal candidate has experience with C++, CUDA, Python, GPU performance profiling, and ideally, building or optimizing recommender systems or production ML workloads on GPUs.
ServeShipEngineeringBeijing, China +1Feb 267
HPC and AI Cluster Engineer
NVIDIA is seeking an HPC and AI Cluster Engineer to manage and maintain large-scale HPC/AI clusters, including Linux job scheduling, CI/CD pipelines, and troubleshooting from bare metal to application level. The role involves supporting R&D activities and POCs, working with cutting-edge hardware and software, and collaborating with researchers and customers to develop solutions.
ServeEngineeringShanghai, China +1Feb 147