AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-366 -50%
360 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 5w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
60 new roles
22

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (439)

434 AI · 1824 total active
FilteredCountryUnited States×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 30Post-train · 51Serve · 356Agent · 192Eval Gate · 11Ship · 55
Function
AllEngineering · 627Research · 82Product · 14
Country
AllUnited States · 439China · 93Israel · 54Germany · 36Switzerland · 31India · 26United Kingdom · 24Poland · 17Vietnam · 13Canada · 12Singapore · 11France · 10Netherlands · 9Italy · 8Taiwan · 6Hong Kong · 4Japan · 4Spain · 3Australia · 2Czech Republic · 2Finland · 2Hungary · 2South Korea · 2Armenia · 1Brazil · 1Mexico · 1Romania · 1Saudi Arabia · 1Sweden · 1United Arab Emirates · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Solutions Architect, Agentic AI
NVIDIA is seeking Solutions Architects to build and deploy agentic AI applications at scale for enterprises, focusing on integrating enterprise data, developing multi-modal dialogue systems, and task-specific agents. The role involves working with agentic frameworks, providing feedback to improve software products, and educating vertical teams.
AgentEngineeringSanta Clara, CA +1 · RemoteApr 158
Senior Solutions Architect, Generative AI
Senior Solutions Architect role focused on customer engagements, improving AI workload performance, and developing proof-of-concepts for Generative AI solutions (LLMs, recommenders) using NVIDIA software and technologies. Requires strong coding, GPU optimization, and communication skills.
ServeAgentEngineering
201–250 of 439← Prev1…456…9Next →
Santa Clara, CA +1 · Remote
Apr 15
8
Principal Deep Learning Communication Architect
NVIDIA is seeking a Principal Deep Learning Communication Architect to lead the technical roadmap for communication libraries across next-generation platforms, ensuring seamless scaling of models to massive clusters. The role involves designing and optimizing communication primitives for heterogeneous interconnects, co-designing with application developers and silicon architects, and developing analytical models for system behavior. Expertise in parallel computing, HPC/distributed deep learning, inference engines, and GPU architecture is required.
ServeAgentEngineeringSanta Clara, CA +2 · RemoteApr 148
Technical Lead, GenAI - Autonomous Vehicles
This role is a Technical Lead focused on Generative AI within Autonomous Vehicles, engaging with developer ecosystems and partners to promote NVIDIA's AI platforms. The candidate will act as a technical advisor, develop expertise in NVIDIA's platforms, create enablement resources, and represent partner needs internally. Requires a strong technical background in AI, AV systems, and GenAI model development, with experience in production code, DevOps, and DL/RL frameworks.
AgentEngineeringSanta Clara, CAApr 138
Senior Software Engineer, Computer Vision - Autonomous Vehicles
Senior Software Engineer at NVIDIA for Autonomous Vehicles, focusing on Computer Vision and Machine Learning for offline perception tasks. Responsibilities include advancing DL components for training and inference, developing tools for large datasets, and integrating DL algorithms into large-scale pipelines.
DataServeEngineeringSanta Clara, CAApr 138
Senior Architect
NVIDIA is seeking a Senior Architect to lead the development of software infrastructure for AI-driven scientific discovery in chemistry and materials science. The role involves shaping NVIDIA ALCHEMI and its ecosystem, translating AI research (ML interatomic potentials, generative modeling) into product direction, and engaging with internal/external stakeholders. The ideal candidate has a PhD or equivalent experience, 8+ years of AI/ML software development for chemistry/materials, strong GPU computing and ML framework experience, and expertise in scientific software architecture.
ShipProductCA +1 · RemoteApr 138
AI for Design Engineer
Develop and deploy AI agents and frameworks for hardware verification tasks, processing codebases and optimizing retrieval/generation algorithms for enterprise data.
AgentEngineeringSanta Clara, CAApr 138
Engineering Manager, Prediction and Planning - Autonomous Vehicles
Engineering Manager for NVIDIA's Autonomous Vehicles division, leading teams to build and scale AI-native autonomous driving systems, integrating classical safety stacks with foundation models and large-scale AI systems from research to production.
ShipAgentEngineeringSanta Clara, CAApr 138
Senior Integration Engineer - Autonomous Vehicles
NVIDIA is seeking a Senior Integration Engineer to work on their end-to-end autonomous driving application, focusing on integrating modular software components and optimizing performance on heterogeneous hardware architectures. The role involves defining software architecture for L2/L3/L4 autonomous driving solutions, performing in-vehicle and simulation testing, and developing efficient C++ code using CUDA.
AgentEngineeringSanta Clara, CAApr 138
Senior Integration Engineer - Autonomous Vehicles
Senior Integration Engineer for NVIDIA's end-to-end autonomous driving application, focusing on integrating software components, optimizing performance, and developing efficient C++ code on heterogeneous hardware architectures (including GPUs) for L2/L3/L4 autonomous driving solutions.
AgentServeEngineeringSanta Clara, CAApr 138
Senior Product Manager, AI Frameworks
Product Manager for AI Frameworks at NVIDIA, focusing on Recommender Systems and Generative Recommendation Models. The role involves building products for frontier RecSys and Generative Recommendation Models on Nvidia systems, enabling researchers and operators, and pushing the boundaries of what is possible in research-to-production. Responsibilities include creating and optimizing pre-training/inference and post-training frameworks, developing product strategy, roadmaps, and go-to-market plans, and collaborating with internal and external customers. Requires experience with training/inference post-training and optimization software, GenAI/ML concepts, large-scale distributed systems, and technical product management.
Post-trainServeProductSanta Clara, CAApr 128
Senior Product Manager, AI Inference - Dynamo
Product Manager for NVIDIA Dynamo, a distributed inference framework for LLMs and Generative AI. Focuses on defining the roadmap for high-scale serving, optimizing hardware-software co-design, and developing agentic inference capabilities. Collaborates with engineering, open-source communities, and customers to integrate model evaluation into workflows.
ServeAgentProductSanta Clara, CA +4 · RemoteApr 98
AI and FSI Developer Technology Engineer - New College Grad 2026
NVIDIA is seeking an AI and FSI Developer Technology Engineer to optimize AI and HPC workloads on NVIDIA GPUs and CPUs, focusing on performance tuning and eliminating bottlenecks for financial markets. The role involves research, development, analysis, and collaboration with experts to improve performance across the stack, from algorithms to kernels. The engineer will also publish and present their work and influence future hardware/software designs.
ServeEngineeringSanta Clara, CA +3 · RemoteApr 98
Senior Software Engineer, Platform Engineering
Senior Software Engineer to build next-generation AI platforms and products, focusing on agentic AI systems, RAG, and scalable infrastructure for enterprise workflows.
AgentEngineeringSanta Clara, CAApr 78
Solutions Architect, Physical AI and Robotics
NVIDIA is looking for a Solutions Architect to guide partners in building enterprise Physical AI systems using Omniverse, Cosmos, synthetic data, and coding-agent-assisted digital twins workflows. The role involves technical advising on simulation, digital twins, robotics, industrial autonomy, and auto, focusing on architecture, compute, testing, and rollout strategies. Key responsibilities include guiding partners on synthetic data generation, evaluation methods, using coding agents for development acceleration, defining benchmarks, advising on compute infrastructure for simulation and inference, and building reference architectures.
AgentDataEngineeringCA · RemoteApr 78
Senior Systems Software Engineer, E-commerce AI Platform - GeForce NOW
Senior Systems Software Engineer to architect and deploy production-grade AI agents for NVIDIA's e-commerce platform, focusing on personalization, logistics, and customer experience. Requires expertise in Python, Java, GoLang, distributed systems, and AI frameworks like LangChain/LangGraph.
AgentEngineeringSanta Clara, CAApr 68
Senior SOC Product Architect Physical AI Platforms
This role focuses on architecting physical AI platforms for automotive and robotics, specifically defining the SoC architecture for embedded computer vision and AI systems. The individual will analyze use cases, map requirements to hardware/software features, define system requirements, and drive recommendations into product roadmaps. The role involves deep benchmarking, customer interaction, technical leadership, and mentorship, with a strong emphasis on functional safety (ISO 26262, SOTIF).
ServeEngineeringSanta Clara, CAApr 48
Senior Technical Program Manager - Agentic System
Senior Technical Program Manager to drive and coordinate cross-functional teams for large-scale technical projects in agentic AI, connecting foundation models with real-world applications for edge deployment and AI workflows.
AgentEngineeringSanta Clara, CAApr 48
Principal Software Engineer - Enterprise AI Platform
Principal Software Engineer to lead security foundations for autonomous, self-evolving agents in an enterprise setting. This role involves defining security requirements, designing scalable architectures with guardrails, implementing isolation and access controls, building secure data access pathways, establishing observability and auditing, and operating a continuous evaluation framework for agent behavior. The goal is to enable developer velocity while ensuring robust safety and security for agents that generate and execute code and access data.
AgentEngineeringSanta Clara, CAApr 48
Senior Power Analysis and Optimization Engineer
Senior Engineer to apply AI/ML and LLMs to power analysis and optimization for NVIDIA's GPUs and SoCs. Focus on developing and productionizing ML/RL models and custom LLMs to improve energy efficiency, interpret power data, and recommend optimizations. Involves RTL analysis, Verilog prototyping, and automation.
ServeDataEngineeringSanta Clara, CA +1Apr 48
Senior Machine Learning Applications and Compiler Engineer, LPX
Develops algorithms and optimizations for NVIDIA's LPX inference and compiler stack, focusing on mapping neural network workloads onto future NVIDIA platforms and optimizing end-to-end inference performance. Requires strong software engineering, compiler/runtime development, and deep learning framework experience.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 48
Senior Software Engineer, TensorRT-LLM
NVIDIA is seeking a Senior Software Engineer for its TensorRT-LLM team to develop and scale inferencing software for LLMs and Generative AI. The role involves crafting robust inferencing software, performing benchmarking and profiling for GPU applications, writing high-quality Python code for LLM inference, and improving the TensorRT-LLM library. Collaboration with software, research, and product teams is key.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 48
Senior Software Engineer – TensorRT Edge-LLM
Senior Software Engineer to develop and optimize a state-of-the-art inference framework for Large Language, Vision-Language, and Multimodal models on edge and embedded platforms, focusing on real-time performance and constrained environments.
ServeEngineeringSanta Clara, CA +2 · RemoteApr 48
Senior Performance Engineer - Deep Learning
Senior Performance Engineer at NVIDIA focused on optimizing Deep Learning models and frameworks (PyTorch, JAX) for NVIDIA GPUs. The role involves building and supporting Transformer Engine, collaborating on systems research for performance improvements, implementing and benchmarking new DL models, contributing to MLPerf, and engaging with the open-source community and enterprise customers. It also involves influencing future hardware and software design.
ServePost-trainEngineeringSanta Clara, CAApr 48
Senior System Software Engineer, 3D Computer Vision
Senior System Software Engineer focused on 3D Computer Vision at NVIDIA, involving the development and deployment of advanced neural reconstruction models for generating 3D scenes. The role requires strong programming skills in Python and C/C++, a background in computer vision and deep learning, and experience with production-grade software development.
Post-trainServeEngineeringSanta Clara, CA +3 · RemoteApr 48
Senior AI Research Scientist, Robotics Digital Twins
Senior AI Research Scientist role focused on developing digital twins for chemical, biological, and physical laboratories, integrating AI agents with science experiments, and collaborating with robotics and software engineers. Requires a Ph.D. and 5+ years of AI research experience in robotics.
AgentResearchSanta Clara, CAApr 48
Senior Software Engineer, Quantized Inference
Senior Software Engineer focused on optimizing quantized inference for LLMs by implementing recipes, developing kernels, and collaborating on inference engines like vLLM and TRT-LLM. The role involves model export pipelines, benchmarking, and data analysis tooling.
ServeEngineeringRedmond, WA +1Apr 48
Senior Compiler Engineer, AI Inference Performance
NVIDIA is seeking a Senior Compiler Engineer to optimize AI inference performance for their Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate next-generation deep learning software for various AI applications.
ServeEngineeringSanta Clara, CA +5 · RemoteApr 48
Senior Compiler Engineer, AI Inference Platforms
NVIDIA is seeking a Senior Compiler Engineer to join its Deep Learning & AI Compiler (DLC) team. The role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and architecture teams to accelerate AI inference performance on NVIDIA GPUs. The compiler is critical for data centers, personal devices, automotive, and robotics, focusing on inference performance, build time, memory footprints, and ease of use.
ServeEngineeringSanta Clara, CA +4 · RemoteApr 48
Research Scientist, Security and Privacy - PhD New College Grad 2026
Research Scientist focused on security and privacy for AI systems, aiming to develop hardware, software, and algorithms for trustworthy AI with verifiable protection. Requires a PhD and expertise in areas like computer architecture, programming languages, applied cryptography, or AI/ML algorithms, with a strong publication record.
Post-trainResearchWestford, MA +1Apr 48
Principal GenAI Engagement Lead, Partner Platforms
This role focuses on driving the technical integration of NVIDIA's Generative AI software with enterprise partners, including ISVs and CSPs. The Principal GenAI Engagement Lead will build trusted relationships, accelerate adoption, and influence product direction by designing and shipping methodologies, code, and reference architectures for RAG, LLM inference, and Multi-Agent workflows. The role requires a strong background in AI/ML, deep learning, and enterprise-grade GenAI systems, with experience in various LLM application stages and MLOps. The individual will act as the key technical lead, ensuring the deployment of robust, scalable GenAI solutions.
AgentServeEngineeringSanta Clara, CAApr 48
AI Chip Design Engineer - New College Grad 2026
NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks. The role involves creating AI agents to enhance productivity, building production infrastructure for these agents, and optimizing algorithms for enterprise data. Requires strong proficiency in LLM libraries, GPU/CPU architectures, and HW verification methodologies.
AgentEngineeringSanta Clara, CAMar 318
Senior Solutions Architect – Simulation Solutions 3D Reconstruction
This role focuses on developing and scaling AI platforms for simulation and 3D reconstruction, particularly within the Omniverse ecosystem. The Senior Solutions Architect will act as a technical advisor, prototype solutions, implement intricate technical systems, provide technical enablement, and advocate for partner needs. The role requires expertise in AI, systems knowledge, autonomous systems, simulation, generative AI, Python, C++, DL/RL frameworks, computer vision, and 3D reconstruction.
AgentServeEngineeringSanta Clara, CAMar 318
AI Chip Design Engineer - New College Grad 2026
NVIDIA is seeking an AI Chip Design Engineer to develop and integrate AI capabilities into verification tasks, focusing on building and maintaining infrastructure for AI agents that process large codebases and optimize verification flows. The role involves developing retrieval and generation algorithms, integrating AI optimizations, and working with HW engineering teams.
AgentEngineeringSanta Clara, CAMar 308
Developer Relations Manager – AI Natives
NVIDIA is seeking a Developer Relations Manager to engage with AI-native companies, helping them design, optimize, and scale their AI platforms on NVIDIA technologies. The role involves advising founders and engineering teams on building agentic systems, AI copilots, and multimodal applications, with a focus on accelerating training, optimizing inference, and delivering AI experiences. The ideal candidate has deep technical expertise in AI systems, developer platforms, and large-scale inference infrastructure.
ServeAgentEngineeringSanta Clara, CA +1 · RemoteMar 248
Senior AI Performance and Efficiency Engineer
Senior AI/ML Performance and Efficiency Engineer focused on optimizing GPU cluster performance for AI/ML researchers by addressing infrastructure and application bottlenecks. This role involves building tools, analyzing efficiency, and collaborating across teams to improve hardware, software, and infrastructure usage for various ML workloads like Robotics, Autonomous vehicles, LLMs, and Videos.
ServeEngineeringSanta Clara, CA +1 · RemoteMar 198
Engineering Manager, AI Developer Technology
Engineering Manager for NVIDIA's AI Developer Technology team, focused on leading a team to optimize and develop algorithms for Deep Learning and Machine Learning applications, influencing next-generation hardware/software, and collaborating with customers and internal teams. The role involves optimizing training and inference performance on NVIDIA hardware.
ServePost-trainEngineeringSanta Clara, CA +4Mar 178
Senior Developer Technology Engineer - AI
Senior Developer Technology Engineer focused on researching and optimizing AI/ML workloads for GPU acceleration, involving deep analysis, performance tuning, and collaboration with the developer community and internal teams to influence next-generation hardware and software design.
ServeEngineeringSanta Clara, CA +5 · RemoteMar 178
Senior Design Automation Engineer, Applied AI
NVIDIA is seeking an Applied AI Engineer to lead end-to-end solution development for timing and constraint analysis workflows in VLSI/ASIC design. The role involves data generation, model training, orchestration, and building autonomous agents that interact with timing tools. The engineer will develop AI-driven solutions, integrate data sources, implement scalable orchestration, and build interpretable AI pipelines using GNNs, LLMs, and reasoning engines. Experience with Python, PyTorch/TensorFlow, graph/agentic AI frameworks, and EDA tools is required.
AgentDataEngineeringSanta Clara, CA +1Mar 178
Senior Product Architect, Storage
NVIDIA is seeking a Senior Product Architect to design and validate AI storage infrastructure, focusing on optimizing systems for large-scale foundation model training, disaggregated inference, and agentic AI pipelines. The role involves architecting end-to-end reference architectures, defining system-level architectures, and collaborating with partners and customers to deliver proof-of-concepts.
AgentServeEngineeringSanta Clara, CA +1 · RemoteMar 138
Technical Marketing Engineer
This role leads complex, cross-functional programs for next-generation generative AI systems, focusing on media content creation. It involves translating research into execution roadmaps, defining program plans, and managing model release readiness across various stages from research to product integration. The role requires strong program management skills in AI/ML and a solid understanding of generative AI systems.
ShipPretrainProductSanta Clara, CAMar 128
Senior Software Engineer, Video Analytics
Senior Software Engineer role focused on building large-scale distributed Vision AI platforms for video analytics using NVIDIA Metropolis. The role involves designing and developing functionalities for video processing, integrating VLMs, CV models, and LLMs, and optimizing performance on NVIDIA hardware. Requires strong software development experience with ML systems, C++, Python, and GPU acceleration.
ShipServeEngineeringSanta Clara, CAMar 118
Senior Software Engineer, Robotics - Isaac Lab
Senior Software Engineer to join the Isaac Lab team, focusing on developing a platform for robot learning, including perception-in-the-loop reinforcement learning, multi-agent/multi-task learning, and VLA & RL integration. The role involves sim-to-real efforts, defining training workflows, and collaborating with research teams to advance humanoid robots.
ShipDataEngineeringSanta Clara, CAFeb 208
Senior HPC Performance Engineer - AI for Science at Scale
Senior HPC Performance Engineer focused on optimizing large-scale, CUDA-backed ML training frameworks for AI in Science applications, particularly in digital biology and chemistry. The role involves kernel design, GPU porting, distributed learning, and algorithmic improvements within HPC software stacks.
ServePost-trainEngineeringSanta Clara, CAFeb 178
Senior AI Application Developer - GPU and SOC Architecture Modeling
Senior AI Application Developer role focused on developing and deploying scalable GenAI applications to accelerate GPU/SOC architecture modeling. The role involves integrating LLMs into existing workflows, collaborating with hardware architects and infrastructure engineers, and researching emerging AI technologies. Requires proficiency in C++, Python, ML frameworks, and hands-on experience with LLMs and multimodal models.
AgentEngineeringSanta Clara, CA +1 · RemoteFeb 48
Architect, AI Solutions Engineering
NVIDIA is looking for an AI Solutions Architect to scale internal AI platforms and solutions for thousands of developers. The role involves identifying AI opportunities, setting system outcomes, optimizing performance and cost, and collaborating with AI product vendors. Requires strong experience in building large-scale distributed systems and hands-on experience with LLMs, RAG, fine-tuning, and agentic orchestration.
AgentServeEngineeringSanta Clara, CAJan 288
Manager, Deep Learning Algorithms
Manager for Deep Learning Algorithms at NVIDIA, focusing on productizing DL models, optimizing inference, and leading engineering teams. The role involves working with LLMs/VLMs, inference optimization, and collaborating across NVIDIA to develop state-of-the-art algorithms for GPU-accelerated platforms.
ServeEngineeringSanta Clara, CAJan 278
Distinguished Engineer, JAX
Distinguished Engineer to develop NVIDIA's AI platform, focusing on performance optimizations in deep learning frameworks using JAX. The role involves designing and implementing core JAX components, driving peak performance on NVIDIA products, and building tools to increase the efficiency of AI-based system development teams. It bridges numerical computing, simulation, and deep learning research with real-world applications.
ServeEngineeringSanta Clara, CA +1 · RemoteJan 128
Distinguished Engineer - Dynamo
Distinguished Engineer role focused on NVIDIA Dynamo, an AI inferencing platform. The role involves technical leadership, driving product direction, and contributing to open-source projects to achieve state-of-the-art performance and scalability for AI inference across modalities on NVIDIA hardware.
ServeEngineeringSanta Clara, CA +4 · RemoteJan 98
Principal Software Engineer - Dynamo
Principal Software Engineer for NVIDIA Dynamo, an open-source platform for efficient, scalable inference of large language and reasoning models in distributed GPU environments. Focuses on Kubernetes serving, scalability, disaggregated serving, dynamic GPU scheduling, intelligent routing, and distributed KV cache management.
ServeAgentEngineeringSanta Clara, CA +1 · RemoteJan 98