AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (1,674)

434 AI · 1824 total active
FilteredFunctionEngineering×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 23Pretrain · 20Post-train · 28Serve · 265Agent · 102Eval Gate · 8Ship · 41
Function
AllEngineering · 1674Research · 68Product · 10
Country
AllUnited States · 945Israel · 413India · 146China · 119Taiwan · 78Germany · 34Switzerland · 26United Kingdom · 25Vietnam · 25Canada · 19Poland · 19France · 10Italy · 7Netherlands · 7Singapore · 6South Korea · 5Spain · 4Ukraine · 4Hungary · 3Japan · 3Romania · 3Czech Republic · 2Denmark · 2Finland · 2Palestine · 2Sweden · 2Armenia · 1Brazil · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Software Engineer, AI Inference Systems
NVIDIA is seeking a Senior Software Engineer to build and optimize AI inference systems for large-scale models, focusing on extreme efficiency and performance across multi-GPU, multi-node, and multi-cloud environments. The role involves architecting inference stacks, optimizing GPU kernels and compilers, driving benchmarks (MLPerf), and orchestrating large-scale deployments.
ServeEngineeringToronto, ONApr 49
Principal Perception Engineer, Obstacle Foundation Models - Autonomous Vehicles
Principal Perception Engineer at NVIDIA for Autonomous Vehicles, focusing on designing and productizing next-generation 3D obstacle perception stacks using deep learning, transformers, and multi-modal techniques. The role involves technical leadership, hands-on algorithm development, production-grade model development, data strategy, and collaboration with safety and systems teams for large-scale deployment.
Agent
51–100 of 1,674← Prev123…34Next →
Data
Engineering
Santa Clara, CA
Apr 4
9
Senior Deep Learning Communication Architect
Senior Deep Learning Communication Architect role focused on optimizing communication performance for large-scale distributed deep learning training and inference. This involves identifying bottlenecks, designing efficient protocols, collaborating on hardware/software co-design, and exploring new communication technologies. The role requires deep understanding of parallelism techniques and experience with DNN frameworks and GPU computing.
ServePost-trainEngineeringSanta Clara, CA +1Apr 49
Senior Deep Learning Performance Architect - LPU
NVIDIA is seeking a Senior Deep Learning Performance Architect to focus on hardware-software co-design for AI Inference performance. The role involves designing GPU and system architectures, analyzing deep learning algorithms, building performance models, and collaborating with various teams to guide AI direction.
ServeEngineeringCA +1 · RemoteApr 49
Senior Systems Software Engineer - Deep Learning Solutions
Senior Systems Software Engineer focused on optimizing deep learning inference for autonomous vehicles and robotics on edge devices. Requires deep understanding of model architectures, kernel trace analysis, and evaluation of modern architectures on GPUs/SOCs, with a focus on TensorRT and compiler technology for embedded hardware.
ServePost-trainEngineeringSanta Clara, CAApr 49
AI Inference Performance Engineer
This role focuses on optimizing and benchmarking Generative AI inference performance on NVIDIA's hardware accelerators, specifically working with frameworks like TensorRT-LLM, SGLang, and vLLM. The engineer will drive industry benchmark results by implementing optimizations in quantization, scheduling, memory management, and distributed inference. They will also define and optimize cutting-edge workloads, architect distributed inference systems from single-GPU to rack-scale, establish performance methodology using profiling, and contribute to open-source projects. The role requires strong programming skills (Python/C++), expertise in DL frameworks, and a deep understanding of LLM/VLM architectures and inference mechanics.
ServeEngineeringSanta Clara, CAApr 49
Senior Deep Learning Engineer - Model Evaluation & AI Systems
Senior/Principal Deep Learning Engineer focused on building evaluation methodologies and infrastructure for AI models (LLMs, RAG, agents, vision/multimodal), including contributing to an open-source platform and collaborating with the community. The role involves working with model training, inference, and product teams to provide evaluation signals for release and optimization decisions.
Eval GateAgentEngineeringSanta Clara, CAApr 49
Senior Deep Learning Engineer
Senior Deep Learning Engineer at NVIDIA focused on optimizing inference for next-generation AI workloads including multi-agent systems and generative multimodal models. The role involves characterizing emerging workloads and developing novel optimization methods across the inference stack, from algorithmic to system level, on NVIDIA hardware. Collaboration with research, framework development, and silicon architecture teams is key.
ServeAgentEngineeringRedmond, WA +1Apr 49
Lead Principal Engineer, Enterprise Agentic AI Platform
Lead Principal Engineer for Enterprise Agentic AI Platform at NVIDIA, focusing on building and scaling production-grade agentic AI systems, including multi-agent orchestration, memory systems, and evaluation pipelines. Requires deep expertise in distributed systems, Kubernetes, GPU inference, and hands-on coding in Python/Go.
AgentServeEngineeringSanta Clara, CAApr 49
Senior Systems Software Engineer - Deep Learning Solutions
Senior Systems Software Engineer focused on deep learning inference optimization for autonomous vehicles and robotics on edge hardware. The role involves analyzing and improving deep learning models on NVIDIA platforms, benchmarking performance, evaluating emerging model architectures, and collaborating with compiler, runtime, and hardware teams to deliver inference solutions.
ServeEngineeringToronto, ON +1 · RemoteApr 49
Senior Deep Learning Compiler Engineer - XLA
Senior Deep Learning Compiler Engineer focused on optimizing inference and training performance for JAX and OpenXLA on NVIDIA GPUs. Develops compiler optimization algorithms, graph partitioning, tensor sharding, and code generation using MLIR, LLVM, and Triton.
ServePost-trainEngineeringSanta Clara, CA +5 · RemoteApr 49
Principal Software Engineer - AI Inference
Principal Software Engineer focused on advancing open-source LLM serving, specifically contributing to inference engines like vLLM and SGLang, optimizing them for NVIDIA GPUs and systems to achieve high-throughput, low-latency inference at scale. The role requires deep technical expertise in inference runtime architecture, GPU performance engineering, and distributed systems.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 49
Senior DL Algorithms Engineer - Inference Performance
Senior DL Algorithms Engineer focused on optimizing inference performance for language and multimodal models using NVIDIA's inference stack (NIMs, TRT-LLM). Role involves profiling, analysis, and collaboration across hardware/software layers to maximize performance on GPUs.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 49
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to develop and optimize infrastructure software and tools for large-scale AI training, post-training, and inference. The role focuses on improving efficiency and resiliency of AI workloads, co-designing APIs, and enhancing AI platforms, requiring strong debugging and distributed systems experience.
ServePost-trainEngineeringSanta Clara, CA +4 · RemoteApr 29
Principal Engineer, Autonomous Vehicles and Physical AI Solutions
Principal Engineer for Autonomous Vehicles and Physical AI Solutions at NVIDIA, focusing on strategic automotive and robotics partnerships in Japan. The role involves tailoring NVIDIA's full-stack AI technologies (DRIVE AGX Thor, Alpamayo, Cosmos) to meet production-grade requirements of OEMs, bridging AI, system optimization, and safety architecture. Responsibilities include innovating with reasoning VLA models, ensuring engineering alignment for partnerships, representing NVIDIA in industry forums, serving as technical authority for RFIs/RFQs, and establishing standards for physical AI production deployment.
AgentShipEngineeringTokyo, JapanApr 29
Senior GPU Networking Architect
This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development.
ServeEngineeringZurich, Switzerland +4 · RemoteMar 309
Agent RL Infra Engineer
NVIDIA is seeking an engineer to develop and productionize reinforcement learning (RL) capabilities for agent teams within an enterprise context. The role involves evaluating and adapting RL approaches, designing reward environments, operationalizing training backends, and integrating with existing ML services. Responsibilities include leading data curation, designing RL training loops, integrating with GPU infrastructure, building observability, and collaborating with various platform and customer teams. The ideal candidate has extensive experience in operationalizing fine-tuning and RL techniques, familiarity with distributed training frameworks and MLOps, and proficiency in relevant programming languages.
Post-trainAgentEngineeringSanta Clara, CAMar 299
Senior AI ML Solution Engineer, AI-Native Development
Senior AI/ML Solution Engineer focused on designing and building AI-powered development pipelines, evaluating ML approaches for code generation and review, and driving adoption of AI-assisted software development. The role involves architecting feedback and evaluation systems, leading proof-of-concept development, and collaborating on risk-based development levels.
AgentEval GateEngineeringTel Aviv, IsraelMar 259
Director of Engineering, End to End Autonomous Driving
NVIDIA is seeking a Director of Engineering to lead the design and deployment of end-to-end autonomous driving systems. This role focuses on leveraging LLMs, VLMs, and VLAs for advanced planning and reasoning in vehicles and robotics, involving strategic leadership, team management, and technical oversight of ML model development and integration into safety-critical production environments.
ShipPost-trainEngineeringSanta Clara, CAMar 189
Director, Perception - Autonomous Vehicles
Director of Perception for Autonomous Vehicles at NVIDIA, leading teams to develop and deploy state-of-the-art deep learning models for real-time 3D world reconstruction and navigation. This role involves end-to-end ownership of the ML lifecycle, from data generation to deployment on NVIDIA DRIVE platforms, with a strong emphasis on safety-critical systems and cross-functional collaboration.
ShipDataEngineeringSanta Clara, CAMar 129
Senior Manager, Engineering - Enterprise AI and Automation
Senior Engineering Manager to lead the strategy and execution for NVIDIA’s agentic developer platform, focusing on building, evaluating, and improving autonomous agents. The role involves identifying gaps, driving POCs, operationalizing approaches into reusable components, and establishing governance and safety mechanisms to scale autonomous systems within NVIDIA.
AgentServeEngineeringSanta Clara, CAFeb 239
Senior High-Performance AI Training Engineer
Senior engineer focused on optimizing AI training workloads for performance on NVIDIA's hardware and software stack, from drivers to DL frameworks, impacting hardware/software roadmap and contributing to MLPerf benchmarks.
DataServeEngineeringSanta Clara, CAFeb 129
Senior Research Engineer Neural Reconstruction
Senior Research Engineer focused on neural reconstruction, developing and integrating neural rendering approaches for generative video, segmentation, and 3D reconstruction. The role involves adapting and fine-tuning generative models, collaborating on ML workflows, and contributing to core NVIDIA products. Requires strong Python and ML library skills, with experience in training and optimizing models.
Post-trainServeEngineeringSanta Clara, CAFeb 129
Senior Capability Development Engineer
NVIDIA is seeking a Senior Capability Development Engineer to develop and enhance internal RAG and Agent platforms for Ops Engineering productivity. The role involves developing, training, fine-tuning, and deploying multimodal LLMs, building LLM-based applications (RAG, TEXT2SQL, Agents), applying advanced tuning techniques, measuring performance, analyzing accuracy/bias, and driving dataset development. Requires strong Python skills, familiarity with ML/DL frameworks and LLMs, and practical experience with LLM training frameworks.
AgentPost-trainEngineeringShenzhen, ChinaFeb 119
Senior Software Architect, AI Networking
NVIDIA is looking for a Senior Software Architect to design and optimize inference infrastructure for large language models running on GPU clusters. The role involves working across software and hardware domains to define deployment and scaling strategies, optimize latency and throughput, and collaborate with various teams to ensure high-performance solutions.
ServeEngineeringTel Aviv, Israel +1Feb 49
Senior AI Algorithms Software Engineer
Senior AI Engineer at NVIDIA focused on developing and deploying foundation model applications (LLMs, VLMs, multi-modal) for manufacturing AI platforms, including computer vision, video understanding, and anomaly detection. The role involves technical leadership, co-development with customers, and driving research from concept to production.
ShipPost-trainEngineeringHsinchu, Taiwan +1Jan 219
Distinguished Engineer – High Performance AI
Distinguished Engineer role focused on building groundbreaking agentic AI systems for the CUDA ecosystem, encompassing multi-agent runtimes, orchestration, data/evaluation pipelines, training/inference stacks, and GPU-accelerated execution. The role involves defining technical strategy, co-designing solutions with hardware/software teams, developing evaluation frameworks, and driving architecture across the AI stack.
AgentServeEngineeringSanta Clara, CA +5 · RemoteJan 159
Senior Deep Learning Algorithm Engineer
Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks.
ServePost-trainEngineeringHo Chi Minh City, Vietnam +1 · RemoteJan 119
Senior GPU Architect, Deep Learning
NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing.
ServeEngineeringSanta Clara, CA +2Jan 99
Senior Deep Learning Computer Architect
NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels.
ServeEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference.
ServePost-trainEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Software Engineer, Inference
Senior Software Engineer specializing in Deep Learning Inference, focusing on optimizing GPU-accelerated software for large-scale model serving and inference using frameworks like SGLang and vLLM. The role involves performance tuning, implementing latest algorithms, and scaling performance across NVIDIA accelerators.
ServeEngineeringNetherlands +2 · RemoteJan 99
Senior Research Engineer, Foundation Model Training Infrastructure
Senior/Principal Engineer to build cutting-edge infrastructure for large-scale foundation model training in the Generalist Embodied Agent Research (GEAR) group, focusing on Project GR00T for humanoid robots. Responsibilities include designing and optimizing distributed training systems, data loaders, and monitoring tools for multimodal foundation models.
PretrainPost-trainEngineeringSanta Clara, CAJan 99
Senior Software Architect, AI Networking
Senior Software Architect role focused on designing and optimizing large-scale LLM inference infrastructure on GPU clusters, involving system-level optimizations for latency, throughput, and cost-efficiency.
ServeEngineeringTel Aviv, IsraelDec '259
Senior LLM Train Framework Engineer
NVIDIA is seeking a Senior LLM Train Framework Engineer to contribute to the Megatron Core team, focusing on building and developing open-source frameworks for LLM and Multimodal foundation model pretraining and post-training. The role involves addressing AI training and inference challenges across the model lifecycle, enhancing distributed training strategies, and optimizing performance on NVIDIA GPUs.
PretrainPost-trainEngineeringShanghai, ChinaOct '259
AI Computing Software Development Engineer, TensorRT-LLM
NVIDIA is seeking a Software Development Engineer for its TensorRT-LLM team to develop and optimize LLM inference software for various platforms. The role involves performance analysis, tuning, and contributing to the architecture and hardware design, with a focus on scaling inference capabilities.
ServeEngineeringTaipei, Taiwan +1Sep '259
Senior Manager, Interactive World Model Platforms
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams) into an industry standard, focusing on production engineering, performance, and developer/researcher success across AV, robotics, rendering, and simulation.
ShipServeEngineeringMunich, Germany +21w ago8
Senior Manager, AlpaSim and AlpaDreams Production
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams, AlpaSim) into an industry standard, focusing on production engineering, performance, and developer ecosystem growth for applications in AV, robotics, rendering, and simulation.
ShipServeEngineeringSanta Clara, CA +21w ago8
Senior Systems Software Engineer, Semiconductor Systems Inspection
Senior Software Engineer to develop AI products for semiconductor inspection, focusing on computer vision, multimodal AI, anomaly detection, model compression, and deployment optimization. The role involves building models, adaptation workflows, and inference pipelines for production environments, with a focus on advancing roadmap progress and delivering practical systems.
ShipServeEngineeringSanta Clara, CA1w ago8
AI Computing Software Development Engineer, LLM Inference
Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams.
ServeEngineeringShanghai, China +11w ago8
Senior Software Engineer, AIOps
NVIDIA is seeking a Senior Software Engineer for their AIOps platform team to build core distributed systems for ingesting telemetry from GPU clusters and operationalizing predictive AI models. The role involves architecting an agentic AIOps system, handling high-scale data engineering, and building model-serving infrastructure for SaaS and on-premises deployments.
AgentServeEngineeringRaanana, Israel +11w ago8
Senior Applied AI Engineer
NVIDIA is seeking a Senior Applied AI Engineer to build AI solutions that unify data across engineering systems, enabling advanced analytics through AI agents, copilots, and workflow automation for ASIC networking product engineering. The role involves end-to-end ownership from architecture to deployment and maintenance, aiming to scale engineering productivity.
AgentEngineeringYokneam, Israel1w ago8
Senior Software Engineer, Applied AI
Senior Software Engineer, Applied AI Systems role focused on building production AI/ML and agentic solutions. Responsibilities include developing agents, workflow services, APIs, data pipelines, tool integrations, evaluation harnesses, and operational tooling. Requires strong Python skills, experience with LLMs, RAG, agentic AI, distributed systems, and system design. The role emphasizes turning ambiguous problems into durable software systems and shaping how production applied AI systems are built and measured.
AgentEngineeringMunich, Germany1w ago8
Senior Inference Engineer, AIConfigurator for Dynamo
Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms.
ServeEngineeringSanta Clara, CA +1 · Remote2w ago8
Distinguished Engineer - Wireless Infrastructure
NVIDIA is seeking a Distinguished Engineer to lead the technology strategy for next-generation wireless infrastructure, focusing on AI-RAN and Agentic Core. The role involves applying AI/ML to 6G RAN functions, transforming the wireless core into an agentic AI-based architecture, and driving rapid prototyping of GPU-accelerated platforms. Responsibilities include system architecture, design, development, and performance optimization for AI-for-RAN software stacks, as well as driving new applications in Integrated Sensing and Communications (ISAC) and Physical AI at the Edge. The position requires deep expertise in AI/ML, communication systems, and significant industry experience.
AgentDataEngineeringSanta Clara, CA +2 · Remote2w ago8
Senior System Security Architect
NVIDIA is seeking a Senior Security Architect to design, build, and deploy AI agent systems for security workflows, integrating LLMs, RAG, and automation with security data. The role involves owning the full agentic system lifecycle and partnering with product teams.
AgentEngineeringTel Aviv, Israel +22w ago8
Senior Software Engineer - Autonomous Driving Simulation
Senior Software Engineer role focused on building and scaling realistic virtual environments for autonomous vehicle (AV) training, testing, and validation. The role involves developing simulation platforms, domain adaptation technologies (Real2Sim, Sim2Real), and optimizing large-scale simulation workflows. It requires strong programming skills in Python, C/C++, PyTorch, and experience with modern software engineering and infrastructure tools, as well as a background in computer vision, deep learning, or simulation systems.
DataAgentEngineeringSanta Clara, CA2w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks.
ServeEngineeringShanghai, China2w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV
NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach.
ServeEngineeringShanghai, China +22w ago8
DL System Software Engineer - AI Platform
NVIDIA is seeking a DL System Software Engineer to join their AI Platform team. The role involves developing and building solutions for scheduling large-scale AI training and inference workloads on GPU clusters, optimizing performance and efficiency for large models. The engineer will work on core infrastructure, resource management, and GPU scheduling, contributing to NVIDIA's AI platform.
ServePost-trainEngineeringToronto, ON2w ago8