AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Jobs (434)

434 AI · 1824 total active
Show
Active onlyAI only (≥ 7)
Stage
AllData · 17Pretrain · 20Post-train · 28Serve · 236Agent · 95Eval Gate · 5Ship · 33
Function
AllEngineering · 375Research · 57Product · 2
Country
AllUnited States · 259China · 55Israel · 43Germany · 21Switzerland · 18United Kingdom · 14India · 13Poland · 12Vietnam · 12Canada · 10Italy · 7Netherlands · 6Singapore · 6France · 5Taiwan · 4Finland · 2Spain · 2Armenia · 1Czech Republic · 1Hungary · 1Japan · 1Romania · 1South Korea · 1Sweden · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to develop and optimize infrastructure software and tools for large-scale AI training, post-training, and inference. The role focuses on improving efficiency and resiliency of AI workloads, co-designing APIs, and enhancing AI platforms, requiring strong debugging and distributed systems experience.
ServePost-trainEngineeringSanta Clara, CA +4 · RemoteApr 29
Principal Engineer, Autonomous Vehicles and Physical AI Solutions
Principal Engineer for Autonomous Vehicles and Physical AI Solutions at NVIDIA, focusing on strategic automotive and robotics partnerships in Japan. The role involves tailoring NVIDIA's full-stack AI technologies (DRIVE AGX Thor, Alpamayo, Cosmos) to meet production-grade requirements of OEMs, bridging AI, system optimization, and safety architecture. Responsibilities include innovating with reasoning VLA models, ensuring engineering alignment for partnerships, representing NVIDIA in industry forums, serving as technical authority for RFIs/RFQs, and establishing standards for physical AI production deployment.
101–150 of 434← Prev1234…9Next →
AgentShip
Engineering
Tokyo, Japan
Apr 2
9
Senior GPU Networking Architect
This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development.
ServeEngineeringZurich, Switzerland +4 · RemoteMar 309
Agent RL Infra Engineer
NVIDIA is seeking an engineer to develop and productionize reinforcement learning (RL) capabilities for agent teams within an enterprise context. The role involves evaluating and adapting RL approaches, designing reward environments, operationalizing training backends, and integrating with existing ML services. Responsibilities include leading data curation, designing RL training loops, integrating with GPU infrastructure, building observability, and collaborating with various platform and customer teams. The ideal candidate has extensive experience in operationalizing fine-tuning and RL techniques, familiarity with distributed training frameworks and MLOps, and proficiency in relevant programming languages.
Post-trainAgentEngineeringSanta Clara, CAMar 299
Senior AI ML Solution Engineer, AI-Native Development
Senior AI/ML Solution Engineer focused on designing and building AI-powered development pipelines, evaluating ML approaches for code generation and review, and driving adoption of AI-assisted software development. The role involves architecting feedback and evaluation systems, leading proof-of-concept development, and collaborating on risk-based development levels.
AgentEval GateEngineeringTel Aviv, IsraelMar 259
Director of Engineering, End to End Autonomous Driving
NVIDIA is seeking a Director of Engineering to lead the design and deployment of end-to-end autonomous driving systems. This role focuses on leveraging LLMs, VLMs, and VLAs for advanced planning and reasoning in vehicles and robotics, involving strategic leadership, team management, and technical oversight of ML model development and integration into safety-critical production environments.
ShipPost-trainEngineeringSanta Clara, CAMar 189
Director, Perception - Autonomous Vehicles
Director of Perception for Autonomous Vehicles at NVIDIA, leading teams to develop and deploy state-of-the-art deep learning models for real-time 3D world reconstruction and navigation. This role involves end-to-end ownership of the ML lifecycle, from data generation to deployment on NVIDIA DRIVE platforms, with a strong emphasis on safety-critical systems and cross-functional collaboration.
ShipDataEngineeringSanta Clara, CAMar 129
Senior Manager, Engineering - Enterprise AI and Automation
Senior Engineering Manager to lead the strategy and execution for NVIDIA’s agentic developer platform, focusing on building, evaluating, and improving autonomous agents. The role involves identifying gaps, driving POCs, operationalizing approaches into reusable components, and establishing governance and safety mechanisms to scale autonomous systems within NVIDIA.
AgentServeEngineeringSanta Clara, CAFeb 239
Senior High-Performance AI Training Engineer
Senior engineer focused on optimizing AI training workloads for performance on NVIDIA's hardware and software stack, from drivers to DL frameworks, impacting hardware/software roadmap and contributing to MLPerf benchmarks.
DataServeEngineeringSanta Clara, CAFeb 129
Senior Research Engineer Neural Reconstruction
Senior Research Engineer focused on neural reconstruction, developing and integrating neural rendering approaches for generative video, segmentation, and 3D reconstruction. The role involves adapting and fine-tuning generative models, collaborating on ML workflows, and contributing to core NVIDIA products. Requires strong Python and ML library skills, with experience in training and optimizing models.
Post-trainServeEngineeringSanta Clara, CAFeb 129
Senior Capability Development Engineer
NVIDIA is seeking a Senior Capability Development Engineer to develop and enhance internal RAG and Agent platforms for Ops Engineering productivity. The role involves developing, training, fine-tuning, and deploying multimodal LLMs, building LLM-based applications (RAG, TEXT2SQL, Agents), applying advanced tuning techniques, measuring performance, analyzing accuracy/bias, and driving dataset development. Requires strong Python skills, familiarity with ML/DL frameworks and LLMs, and practical experience with LLM training frameworks.
AgentPost-trainEngineeringShenzhen, ChinaFeb 119
Senior Software Architect, AI Networking
NVIDIA is looking for a Senior Software Architect to design and optimize inference infrastructure for large language models running on GPU clusters. The role involves working across software and hardware domains to define deployment and scaling strategies, optimize latency and throughput, and collaborate with various teams to ensure high-performance solutions.
ServeEngineeringTel Aviv, Israel +1Feb 49
Research Scientist, AI for Graphics and Gaming - New College Grad 2026
Research Scientist role focused on Generative AI for Graphics and Gaming, involving research, training, and prototyping of AI models for real-time graphics, world models, LLM-powered game experiences, and AI-driven characters. The role emphasizes step-change research and collaboration with product, driver, and hardware teams to ship features.
Post-trainPretrainResearchSanta Clara, CAJan 279
Research Scientist, Human‑AI Perception and Interaction Research - PhD New College Grad 2026
Research Scientist role focused on advancing AI in areas like gaming and robotics by understanding and shaping human perception, learning, and behavior through the lens of vision science, HCI, and HRI. The role involves proposing, researching, prototyping, and testing innovative ideas, publishing at top conferences, and collaborating with researchers and product engineers. Requires a PhD or equivalent research experience and a strong publication record.
Post-trainResearchSanta Clara, CAJan 219
Senior AI Algorithms Software Engineer
Senior AI Engineer at NVIDIA focused on developing and deploying foundation model applications (LLMs, VLMs, multi-modal) for manufacturing AI platforms, including computer vision, video understanding, and anomaly detection. The role involves technical leadership, co-development with customers, and driving research from concept to production.
ShipPost-trainEngineeringHsinchu, Taiwan +1Jan 219
Distinguished Engineer – High Performance AI
Distinguished Engineer role focused on building groundbreaking agentic AI systems for the CUDA ecosystem, encompassing multi-agent runtimes, orchestration, data/evaluation pipelines, training/inference stacks, and GPU-accelerated execution. The role involves defining technical strategy, co-designing solutions with hardware/software teams, developing evaluation frameworks, and driving architecture across the AI stack.
AgentServeEngineeringSanta Clara, CA +5 · RemoteJan 159
Research Scientist, Robotics Research - PhD New College Grad 2026
Research Scientist role focused on developing and integrating algorithms, models, and methods for robotic manipulation and loco-manipulation. The role involves contributing to multi-person research projects, publishing in top conferences, collaborating with product teams for research transfer, and working with real-world robotic systems and simulation. Requires a PhD and a strong research track record in robotics, ML, or related fields, with expertise in Python, deep learning frameworks, and robotics/simulation frameworks.
AgentDataResearchSeattle, WAJan 149
Senior Deep Learning Algorithm Engineer
Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks.
ServePost-trainEngineeringHo Chi Minh City, Vietnam +1 · RemoteJan 119
Research Scientist, AI-Mediated Reality and Interaction Research - PhD New College Grad 2026
Research Scientist role focused on fundamental research in AI-Mediated Reality and Interaction, involving interactive physical AIs, 4D world modeling, and human-AI interaction. The role requires proposing, researching, and prototyping innovative ideas, publishing at top conferences, and collaborating with engineers for technology transfer. Requires a Ph.D. and a strong research track record in AI and computer vision.
PretrainResearchSanta Clara, CA +2Jan 99
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focused on ML Systems, contributing to hardware, software, and infrastructure for ML systems at various scales. The role involves understanding and developing solutions for efficiency, scaling, and resilience in ML systems, with a focus on co-design of algorithms and systems. Requires a PhD and expertise in areas like OS, distributed systems, inference/training systems, data management, cloud computing, or computer architecture.
ServePost-trainResearchSanta Clara, CA +3Jan 99
Senior Research Scientist, Efficient Deep Learning
Senior Research Scientist at NVIDIA focusing on efficient deep learning methods, including post-training optimization, architecture design, and resource-efficient training/fine-tuning. The role involves research, implementation, publication, collaboration, and technology transfer to products.
Post-trainServeResearchSanta Clara, CAJan 99
Senior GPU Architect, Deep Learning
NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing.
ServeEngineeringSanta Clara, CA +2Jan 99
Senior Research Scientist - Autonomous Vehicles
Research Scientist role focused on AI for autonomous vehicles, involving designing and implementing techniques, publishing research, and collaborating with product teams for deployment. The role emphasizes agent behavior, foundation models, closed-loop training, and AI safety within the robotics domain.
AgentResearchSanta Clara, CAJan 99
Senior Deep Learning Computer Architect
NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels.
ServeEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference.
ServePost-trainEngineeringSanta Clara, CA +1Jan 99
Senior Research Engineer - Autonomous Vehicles
Senior Research Engineer at NVIDIA focusing on AI for Autonomous Vehicles. The role involves developing large-scale training frameworks for multimodal foundation models, optimizing GPU utilization, implementing data loaders, building simulation infrastructure, integrating new architectures, developing sim-to-real pipelines, combining LLMs with policy learning, and applying RL for fine-tuning LLMs. Requires expertise in deep learning, reinforcement learning, generative modeling, distributed training systems, and GPU acceleration.
Post-trainAgentResearchSanta Clara, CAJan 99
Senior Robotics Research Scientist
NVIDIA's Seattle Robotics Lab is seeking a Senior Robotics Research Scientist to develop algorithms, models, and methods for robotic manipulation and loco-manipulation, integrating them into real-world systems and transferring research into NVIDIA products. The role involves fundamental and applied research across the robotics stack, with a focus on enabling companies to become robotics companies.
ShipDataResearchSeattle, WAJan 99
Senior Deep Learning Software Engineer, Inference
Senior Software Engineer specializing in Deep Learning Inference, focusing on optimizing GPU-accelerated software for large-scale model serving and inference using frameworks like SGLang and vLLM. The role involves performance tuning, implementing latest algorithms, and scaling performance across NVIDIA accelerators.
ServeEngineeringNetherlands +2 · RemoteJan 99
Senior Research Engineer, Foundation Model Training Infrastructure
Senior/Principal Engineer to build cutting-edge infrastructure for large-scale foundation model training in the Generalist Embodied Agent Research (GEAR) group, focusing on Project GR00T for humanoid robots. Responsibilities include designing and optimizing distributed training systems, data loaders, and monitoring tools for multimodal foundation models.
PretrainPost-trainEngineeringSanta Clara, CAJan 99
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focusing on ML Systems, contributing to hardware, software, and infrastructure for training, fine-tuning, and serving ML models at scale. Requires a PhD and expertise in systems research areas.
ServePost-trainResearchSingapore, Singapore · RemoteDec '259
Senior Software Architect, AI Networking
Senior Software Architect role focused on designing and optimizing large-scale LLM inference infrastructure on GPU clusters, involving system-level optimizations for latency, throughput, and cost-efficiency.
ServeEngineeringTel Aviv, IsraelDec '259
Senior Software Research Architect, AI Networking
NVIDIA is seeking a Senior Software Research Architect to improve the framework for large-scale LLM learning and prediction. This role focuses on designing and optimizing systems for generative AI workloads on advanced GPU clusters, specifically leveraging the NVIDIA Spectrum-X Networking Platform to define deployment and scaling strategies. The architect will work on inter-node communication, compute scheduling, and system-level optimization, collaborating with engineers and researchers to enable generative AI technologies in real-world applications.
ServePretrainResearchTel Aviv, IsraelNov '259
Senior LLM Train Framework Engineer
NVIDIA is seeking a Senior LLM Train Framework Engineer to contribute to the Megatron Core team, focusing on building and developing open-source frameworks for LLM and Multimodal foundation model pretraining and post-training. The role involves addressing AI training and inference challenges across the model lifecycle, enhancing distributed training strategies, and optimizing performance on NVIDIA GPUs.
PretrainPost-trainEngineeringShanghai, ChinaOct '259
AI Computing Software Development Engineer, TensorRT-LLM
NVIDIA is seeking a Software Development Engineer for its TensorRT-LLM team to develop and optimize LLM inference software for various platforms. The role involves performance analysis, tuning, and contributing to the architecture and hardware design, with a focus on scaling inference capabilities.
ServeEngineeringTaipei, Taiwan +1Sep '259
Senior Deep Learning Researcher, Diffusion
Senior Deep Learning Researcher at NVIDIA focusing on diffusion-based technologies and multi-modality. The role involves inventing and building new techniques, publishing findings, and contributing to NVIDIA's AI enterprise software. Requires a PhD, research experience, and publications in top-tier conferences. Experience with image/video understanding and LLMs is essential.
PretrainPost-trainResearchTel Aviv, IsraelMay '259
Senior Product Engineer, Agentic AI
Product leader and builder to define and drive Agentic AI products and platforms, bridging product vision with deep technical execution. Focus on translating early-stage innovations into scalable, production-ready systems for agentic AI workflows.
AgentProductHo Chi Minh City, Vietnam +11w ago8
Senior Manager, Interactive World Model Platforms
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams) into an industry standard, focusing on production engineering, performance, and developer/researcher success across AV, robotics, rendering, and simulation.
ShipServeEngineeringMunich, Germany +21w ago8
Senior Manager, AlpaSim and AlpaDreams Production
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams, AlpaSim) into an industry standard, focusing on production engineering, performance, and developer ecosystem growth for applications in AV, robotics, rendering, and simulation.
ShipServeEngineeringSanta Clara, CA +21w ago8
Senior Systems Software Engineer, Semiconductor Systems Inspection
Senior Software Engineer to develop AI products for semiconductor inspection, focusing on computer vision, multimodal AI, anomaly detection, model compression, and deployment optimization. The role involves building models, adaptation workflows, and inference pipelines for production environments, with a focus on advancing roadmap progress and delivering practical systems.
ShipServeEngineeringSanta Clara, CA1w ago8
AI Computing Software Development Engineer, LLM Inference
Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams.
ServeEngineeringShanghai, China +11w ago8
Senior Software Engineer, AIOps
NVIDIA is seeking a Senior Software Engineer for their AIOps platform team to build core distributed systems for ingesting telemetry from GPU clusters and operationalizing predictive AI models. The role involves architecting an agentic AIOps system, handling high-scale data engineering, and building model-serving infrastructure for SaaS and on-premises deployments.
AgentServeEngineeringRaanana, Israel +11w ago8
Senior Applied AI Engineer
NVIDIA is seeking a Senior Applied AI Engineer to build AI solutions that unify data across engineering systems, enabling advanced analytics through AI agents, copilots, and workflow automation for ASIC networking product engineering. The role involves end-to-end ownership from architecture to deployment and maintenance, aiming to scale engineering productivity.
AgentEngineeringYokneam, Israel1w ago8
Senior Software Engineer, Applied AI
Senior Software Engineer, Applied AI Systems role focused on building production AI/ML and agentic solutions. Responsibilities include developing agents, workflow services, APIs, data pipelines, tool integrations, evaluation harnesses, and operational tooling. Requires strong Python skills, experience with LLMs, RAG, agentic AI, distributed systems, and system design. The role emphasizes turning ambiguous problems into durable software systems and shaping how production applied AI systems are built and measured.
AgentEngineeringMunich, Germany1w ago8
Senior Inference Engineer, AIConfigurator for Dynamo
Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms.
ServeEngineeringSanta Clara, CA +1 · Remote2w ago8
Distinguished Engineer - Wireless Infrastructure
NVIDIA is seeking a Distinguished Engineer to lead the technology strategy for next-generation wireless infrastructure, focusing on AI-RAN and Agentic Core. The role involves applying AI/ML to 6G RAN functions, transforming the wireless core into an agentic AI-based architecture, and driving rapid prototyping of GPU-accelerated platforms. Responsibilities include system architecture, design, development, and performance optimization for AI-for-RAN software stacks, as well as driving new applications in Integrated Sensing and Communications (ISAC) and Physical AI at the Edge. The position requires deep expertise in AI/ML, communication systems, and significant industry experience.
AgentDataEngineeringSanta Clara, CA +2 · Remote2w ago8
Senior System Security Architect
NVIDIA is seeking a Senior Security Architect to design, build, and deploy AI agent systems for security workflows, integrating LLMs, RAG, and automation with security data. The role involves owning the full agentic system lifecycle and partnering with product teams.
AgentEngineeringTel Aviv, Israel +22w ago8
Senior Software Engineer - Autonomous Driving Simulation
Senior Software Engineer role focused on building and scaling realistic virtual environments for autonomous vehicle (AV) training, testing, and validation. The role involves developing simulation platforms, domain adaptation technologies (Real2Sim, Sim2Real), and optimizing large-scale simulation workflows. It requires strong programming skills in Python, C/C++, PyTorch, and experience with modern software engineering and infrastructure tools, as well as a background in computer vision, deep learning, or simulation systems.
DataAgentEngineeringSanta Clara, CA2w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks.
ServeEngineeringShanghai, China2w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV
NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach.
ServeEngineeringShanghai, China +22w ago8
DL System Software Engineer - AI Platform
NVIDIA is seeking a DL System Software Engineer to join their AI Platform team. The role involves developing and building solutions for scheduling large-scale AI training and inference workloads on GPU clusters, optimizing performance and efficiency for large models. The engineer will work on core infrastructure, resource management, and GPU scheduling, contributing to NVIDIA's AI platform.
ServePost-trainEngineeringToronto, ON2w ago8