AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (380)

434 AI · 1824 total active
FilteredFunctionEngineering×CountryUnited States×Clear all
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 30Post-train · 51Serve · 356Agent · 192Eval Gate · 11Ship · 55
Function
AllEngineering · 627Research · 82Product · 14
Country
AllUnited States · 439China · 93Israel · 54Germany · 36Switzerland · 31India · 26United Kingdom · 24Poland · 17Vietnam · 13Canada · 12Singapore · 11France · 10Netherlands · 9Italy · 8Taiwan · 6Hong Kong · 4Japan · 4Spain · 3Australia · 2Czech Republic · 2Finland · 2Hungary · 2South Korea · 2Armenia · 1Brazil · 1Mexico · 1Romania · 1Saudi Arabia · 1Sweden · 1United Arab Emirates · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA is seeking an experienced engineer to optimize LLM training workloads on high-performance computing systems, focusing on software stack optimization for thousands of GPUs and influencing future hardware roadmaps. The role involves performance analysis, profiling, and implementation across the deep learning platform, from drivers to frameworks, and contributing to MLPerf benchmarks.
DataEngineeringSanta Clara, CA1w ago9
Senior Systems Software Engineer, AI Stack and Performance - DGX Station
Senior Systems Software Engineer focused on optimizing AI stack performance and readiness on NVIDIA's DGX Station, a workstation-class AI computer. The role involves profiling, identifying bottlenecks, and driving optimizations across the full stack from GPU kernels to applications, ensuring AI workloads like LLM inference and agents run efficiently in multi-GPU, multi-user configurations. Collaboration with framework, compiler, and GPU architecture teams is critical.
1–50 of 380← Prev12…8Next →
ServeShip
Engineering
Santa Clara, CA +1 · Remote
3w ago
9
Senior Machine Learning Engineer, Perception - Autonomous Driving
NVIDIA is seeking a Senior Machine Learning Engineer for their Autonomous Driving Perception team. The role involves designing and developing end-to-end deep learning solutions for perception modules, focusing on road layout detection, lane structures, and other critical driving components. The engineer will also drive data-driven development, leverage simulation and augmentation, and productize solutions meeting safety and latency requirements. Experience with deep learning frameworks, Python/C++, and perception for autonomous driving or robotics is essential.
ShipDataEngineeringSanta Clara, CA +2 · Remote3w ago9
Senior Software Engineer, DGX Cloud AI Infrastructure
Senior Software Engineer to lead the bring-up, triage, benchmarking, analysis, and optimization of distributed training and inference workloads across NVIDIA GPU platforms at scale. This role involves setting technical direction for communication libraries, model frameworks, and inference/training stacks, leading performance and reliability investigations, defining benchmarking and qualification processes, and building resilience capabilities for large clusters.
ServePost-trainEngineeringSanta Clara, CA +4 · Remote3w ago9
Software Engineer, DGX Cloud AI Infrastructure
Software Engineer role focused on AI infrastructure, specifically distributed training and inference workloads on NVIDIA GPU platforms. Responsibilities include bring-up, triage, benchmarking, analysis, and optimization of these workloads at scale. Requires experience with multi-GPU/multi-node systems, debugging distributed environments, and strong Python/C++ skills.
ServePost-trainEngineeringSanta Clara, CA +4 · Remote3w ago9
Senior Deep Learning Performance Architect
NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and HPC applications. The role involves developing innovative architectures, analyzing performance/cost/power trade-offs using models and simulators, understanding hardware/software interplay, and evaluating PPA for architectural decisions. Collaboration with software, product, and research teams is key. Requires MS/PhD, 6+ years experience, strong background in GPU/Deep Learning ASIC architecture for distributed training/inference, performance modeling, and ML/DL fundamentals, particularly transformer architectures. Proficiency in Python, C, C++ is essential.
ServeEngineeringSanta Clara, CA +13w ago9
AI Inference Performance Engineer - New College Grad 2026
NVIDIA is seeking an AI Inference Performance Engineer to optimize and benchmark GenAI inference on their accelerators, working with frameworks like TensorRT-LLM, SGLang, and vLLM. The role involves driving industry benchmark results, defining cutting-edge workloads, architecting distributed inference, establishing performance methodology, and influencing the ecosystem through open-source contributions and cross-functional partnerships. Requires strong programming skills, DL framework expertise, and a deep understanding of LLM inference mechanics.
ServeEngineeringSanta Clara, CA3w ago9
Senior Software Engineer, Generative AI Research
NVIDIA is seeking a Senior Software Engineer for Generative AI Research to build and operate scalable infrastructure for training their world foundation model for physical AI, Cosmos. This role involves designing and developing high-throughput systems for data processing, retrieval, and workflow orchestration, improving system reliability and performance, and contributing to long-term infrastructure strategy for training, data management, and large-scale compute efficiency. The role requires a strong engineering background in distributed systems, ML infrastructure, or large-scale compute/data platforms, proficiency in Python and C++/Go/Rust, and experience with orchestration systems and data pipelines. Experience with large-scale model training infrastructure, distributed compute, synthetic data, or multimodal datasets is a plus.
DataPretrainEngineeringSanta Clara, CA3w ago9
Senior Software Manager, Agentic AI
Senior Software Manager to lead a team building agentic AI solutions for chip design workflows, involving coding agents, custom skills, and integration with enterprise systems. The role requires technical leadership in designing, developing, and deploying AI applications using LLMs and agentic systems, including model customization (fine-tuning, RL, instruction tuning) and overseeing retrieval/generation algorithms for enterprise data. Collaboration with cross-functional teams and ensuring high technical standards for evaluation, guardrails, and monitoring are key.
AgentPost-trainEngineeringSanta Clara, CA3w ago9
Senior Software Engineer - Agentic AI
Senior Software Engineer role focused on leading Agentic AI solutions, including sophisticated AI agents and fine-tuning, integrating them with enterprise production systems. The role involves designing, developing, and deploying AI applications using LLMs, Agentic frameworks, and optimizing retrieval/generation algorithms for enterprise data (text, code, images) to build advanced AI applications for engineering assistants and multi-turn, multi-modal dialogue systems, ultimately solving complex problems in chip design.
AgentPost-trainEngineeringSanta Clara, CA4w ago9
Senior Systems Software Engineer, Machine Learning
Senior Systems Software Engineer focused on Machine Learning, specifically generative AI, LLMs/VLMs, computer vision, and agentic systems. The role involves converting research into production products, building and shipping ML workflows/pipelines, and leveraging AI in data generation. Key responsibilities include defining evaluation criteria and running offline evals. Experience with multi-agent pipelines, VLMs in production, and shipping AI-powered features to users is highly valued.
AgentDataEngineeringSanta Clara, CA +14w ago9
Senior High Performance AI Engineer
Senior High Performance AI Engineer to build multi-agent systems for the CUDA ecosystem, focusing on agentic runtimes, compiler-integrated orchestration, and GPU acceleration for agent workloads like planning, tool-use, and code generation. Collaborates across the AI stack from hardware to model/agent teams.
AgentServeEngineeringSanta Clara, CA +5 · Remote5w ago9
Senior Performance Architect, Nemotron
NVIDIA is seeking a Senior Performance Architect for Nemotron to focus on deep model-system-hardware co-design. The role involves developing high-fidelity performance models to evaluate architectural choices, predict deployment efficiency, and ensure Pareto-optimal trade-offs for future Nemotron models. This position will guide future software and hardware roadmaps by modeling end-to-end performance impact of GenAI workflows and collaborating with research, framework, compiler, and hardware teams.
ServeEngineeringSanta Clara, CA +25w ago9
Software Engineer, AI and DL Kernel Libraries - New College Grad 2026
Software Engineer role focused on developing AI systems software for efficient inference, including libraries, code generators, and GPU kernels for NVIDIA's hardware. The role involves designing abstractions, optimizing kernels, building LLM serving runtimes, and contributing to open-source projects like FlashInfer and vLLM.
ServeEngineeringSanta Clara, CA +1 · Remote5w ago9
Senior Machine Learning Engineer - Physical AI and Synthetic Data Generation
NVIDIA is seeking a Senior Machine Learning Engineer to join their Physical AI team. The role focuses on architecting and developing generative pipelines for high-fidelity synthetic data using multimodal and diffusion models. Responsibilities include building and fine-tuning large-scale models, applying user controls for data synthesis, establishing quality assurance pipelines, and leading the generation of massive training datasets. The role requires deep technical knowledge in image/video synthesis, strong programming skills, and experience in assessing synthetic data impact on model performance.
DataPost-trainEngineeringSanta Clara, CA6w ago9
Senior Software Engineer, Agentic AI – Nvidia Blueprints and NIM Integrations
Senior Software Engineer focused on integrating NVIDIA's NIM microservices and Blueprints into agentic AI frameworks. The role involves building and maintaining agentic workflows, developing test harnesses, and contributing to the open-source agentic AI ecosystem.
AgentEngineeringSanta Clara, CA +3 · Remote7w ago9
Senior DL Algorithms Engineer - Inference Performance
Senior engineer to optimize LLM/Omni model inference performance on NVIDIA's accelerated inference software stack, working across hardware and software layers. Involves enabling and optimizing open models, contributing code to frameworks like TRT-LLM and vLLM, profiling bottlenecks, and benchmarking.
ServeEngineeringSanta Clara, CA +1 · Remote7w ago9
Senior Deep Learning Software Engineer, Inference
Senior Software Engineer specializing in Deep Learning Inference to optimize GPU-accelerated software for AI applications. Focus on high-performance deep learning frameworks like SGLang and vLLM for efficient model serving and inference, improving performance across NVIDIA accelerators.
ServeEngineeringSanta Clara, CA +1 · Remote7w ago9
Senior GenAI Technical Lead, Partner Platforms
Senior Technical Lead for GenAI Product Integration at NVIDIA, focusing on integrating NVIDIA's GenAI software with enterprise ISV and CSP partners. The role involves defining technical strategy, building trusted relationships, and driving adoption of NVIDIA's offerings. Responsibilities include hands-on design and shipping of RAG, LLM inference, and Multi-Agent workflows, owning technical engagements, and representing partner needs to Product and Engineering teams. Requires strong background in AI/ML, Deep Learning, and building enterprise-grade GenAI systems, with experience in relevant programming languages and LLM application stages.
AgentServeEngineeringSanta Clara, CA +1 · Remote7w ago9
Senior Solutions Architect, Autonomous Driving - GenAI
Senior Solutions Architect focused on Generative AI and Autonomous Vehicles, engaging with customers to guide adoption of NVIDIA's full-stack technologies, including AI platforms, CUDA-X libraries, and GenAI/Physical AI solutions. Responsibilities include technical mentorship, developing AV perception and planning models, simulations, synthetic data generation, AI-enhanced manipulation/navigation, and building collateral for AI workflows. Requires strong experience in AV systems, GenAI model development, Python/C++, Linux, DevOps, and DL/RL frameworks.
AgentDataEngineeringSanta Clara, CA7w ago9
Senior Systems Software Engineer, Machine Learning
Senior Systems Software Engineer focused on building and shipping machine learning workflows and agentic systems, particularly leveraging LLMs/VLMs and computer vision for data generation and product features. The role involves converting research into production products, defining evaluation criteria, and iterating quickly.
AgentDataEngineeringSanta Clara, CA +17w ago9
 Senior AI Architect, Computer Use Agents
Senior Software Engineer role focused on building multi-modal agentic AI solutions for NVIDIA's software stack, aiming to accelerate various stages of the SDLC. Responsibilities include leading design and development, and creating benchmarks.
AgentEngineeringAustin, TX +4 · Remote7w ago9
Senior Machine Learning Engineer, End‑to‑End Autonomous Driving
Senior Machine Learning Engineer at NVIDIA focused on building, training, and deploying large-scale end-to-end autonomous driving models using VLM/VLA architectures and a data flywheel for continuous improvement. The role involves designing models, driving data collection and iteration, curating multimodal datasets, developing data-centric algorithms, exploring new data sources, and creating agentic data workflows.
ShipDataEngineeringSanta Clara, CA8w ago9
Senior Research Engineer - AI Coding Tools
Senior Research Engineer at NVIDIA focused on building and improving AI coding agents, fine-tuning code LLMs, designing evaluations, and developing interfaces for AI agents to interact with NVIDIA's developer tools. The role involves shipping novel agents and features, contributing to benchmarks, and generating synthetic data for AI-for-code applications.
AgentPost-trainEngineeringSanta Clara, CA8w ago9
Tech Engagement Lead - Model Builder
This role focuses on engaging with leading AI model builders to drive the adoption and optimize the performance of NVIDIA's hardware, systems, and software (e.g., GPUs, DGX, CUDA-X, NeMo, TensorRT) within their generative AI workflows, specifically for training and inference. The role involves technical integration, strengthening partnerships, influencing product roadmaps, and showcasing best practices for scalable AI model development pipelines.
ServePost-trainEngineeringSanta Clara, CA8w ago9
ML and Agentic Systems Engineer
NVIDIA's Cosmos team is seeking an ML and Agentic Systems Engineer to build AI-native systems and agentic workflows across the ML lifecycle. The role focuses on creating the meta-layer for ML development, enabling AI agents to interact with code, data, experiments, and evaluations to accelerate ML processes. Responsibilities include designing agentic workflows, building AI-native systems, creating self-improving loops, owning large-scale Python/PyTorch codebases, and scaling evaluation platforms.
AgentEval GateEngineeringSanta Clara, CA8w ago9
Senior Software Engineer, Agentic AI
Senior Software Engineer role focused on building and scaling agentic AI systems for high-performance code generation. Responsibilities include architecting agentic systems, scaling distributed systems, developing evaluation frameworks, optimizing for performance on NVIDIA GPUs, and establishing engineering standards. Requires experience in building coding agents, AI evaluation, and distributed systems.
AgentEval GateEngineeringRedmond, WA +18w ago9
Principal High-Performance LLM Training Engineer
NVIDIA is seeking a Principal Engineer to lead performance analysis and optimization of large-scale AI training and post-training workloads on NVIDIA's hardware and software stack. The role involves deep technical analysis across compute, memory, communication, and frameworks to improve efficiency and influence future roadmaps.
PretrainPost-trainEngineeringSanta Clara, CA8w ago9
Senior Software Engineer, AI Inference Systems
Senior Software Engineer focused on building and optimizing AI inference systems, including vLLM, GPU kernels, and orchestration for large-scale model deployments. The role involves performance engineering, benchmarking (MLPerf), and potentially research integration.
ServeEngineeringSanta Clara, CA8w ago9
Senior Deep Learning Software Engineer
Senior Deep Learning Software Engineer to design and build an automated inference and deployment solution with a scalable architecture focusing on ease-of-use and compute efficiency. The role involves developing features in high-level frameworks, implementing a high-performance execution environment, and low-level GPU optimizations.
ServeEngineeringSanta Clara, CA +1Apr 249
Senior Software Engineer, RL Post-Training Frameworks
NVIDIA is seeking a Senior Software Engineer to build and scale RL post-training infrastructure, focusing on distributed systems, high-performance computing, and deep learning infrastructure. The role involves architecting and optimizing RL training-inference-rollout loops, ensuring fault tolerance and elastic scaling, and collaborating with researchers and hardware teams.
Post-trainServeEngineeringSanta Clara, CA +1 · RemoteApr 239
Manager, Deep Learning – Autonomous Vehicles and Robotics
Manager for a Deep Learning Engineering team focused on delivering production-quality deep learning solutions for autonomous vehicles and robotics on edge hardware. The role involves leading a team, defining technical initiatives, and collaborating with automotive OEMs and robotics partners to optimize solutions on NVIDIA platforms, working at the intersection of model architectures, compiler technology, and embedded deployment.
ServePost-trainEngineeringSanta Clara, CAApr 229
Senior AI Software Engineer, Kernel Libraries
Senior AI Software Engineer focused on developing kernel libraries and inference systems software to accelerate AI workloads, including LLMs and agents, on NVIDIA's hardware. Responsibilities include innovating and optimizing kernels, designing abstractions for serving engines, and building compilers/runtimes.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 229
Senior Software Engineer, AI and DL Kernel Libraries
Develops libraries, code generators, and GPU kernel technologies for NVIDIA's AI inference systems software stack, focusing on accelerating AI inference through efficient kernels, abstractions, and runtimes for LLMs and agents.
ServeEngineeringSanta Clara, CA +7 · RemoteApr 229
Senior AI Compiler Engineer, MLIR
NVIDIA is hiring a Senior AI Compiler Engineer to build an MLIR-based AI compiler for their inference engine, focusing on performance, low memory usage, and usability across data center and edge. The role involves developing graph representations, optimizations, defining APIs, and implementing compiler optimizations and kernel generation for neural networks.
ServeEngineeringSanta Clara, CA +5 · RemoteApr 229
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
Senior DL Software Engineer focused on optimizing and deploying large multimodal models (LLMs/VLMs) for real-time robotic execution in autonomous vehicles. The role involves advanced model compression, quantization, pruning, distillation, and inference optimization techniques for edge deployment on NVIDIA hardware, integrating with C++ production environments.
ServeAgentEngineeringSanta Clara, CAApr 219
Solutions Architect, AI Models
NVIDIA is seeking a Solutions Architect to help enterprise customers adopt NVIDIA AI software and models. This role involves developing end-to-end AI solutions, tackling complex challenges across the AI model lifecycle (data processing, orchestration, training, post-training, RL, evaluation, optimization), and supporting a broad model portfolio. The architect will partner with customers to understand their needs and deliver customized AI solutions, contributing to product improvement and sharing knowledge through open-source projects, product engineering, or training.
ShipPost-trainEngineeringSanta Clara, CA +1 · RemoteApr 219
Senior Solutions Architect, Retail
Senior Solutions Architect for Retail at NVIDIA, focusing on developing and deploying Agentic AI solutions for enterprise clients. The role involves building complex agentic systems, RAG pipelines, and optimizing inference performance using NVIDIA's AI infrastructure. Requires strong programming skills, experience with LLM applications, and agentic frameworks.
AgentEngineeringCA · RemoteApr 219
Senior Research Engineer - Video Search
Senior Research Engineer at NVIDIA to design and build video search technologies for Autonomous Vehicles, Robotics, and Medical applications, focusing on exabyte scale and agentic search. The role involves developing and integrating innovative video search approaches, benchmarking retrieval methods, and collaborating with researchers and product teams to build robust physical AI dataset search workflows.
AgentDataEngineeringSanta Clara, CAApr 199
Senior Deep Learning Software Engineer, LLM Performance
Senior Deep Learning Software Engineer focused on optimizing LLM inference performance on NVIDIA accelerators using frameworks like TensorRT LLM, VLLM, and Triton. The role involves implementing and scaling inference, serving, and deployment algorithms, collaborating with various teams, and contributing to NVIDIA/OSS LLM frameworks.
ServeEngineeringSanta Clara, CAApr 169
Senior Solutions Architect, Generative AI Specialist
Senior Solutions Architect specializing in Generative AI, focusing on developing end-to-end AI solutions, reference architectures, and proof-of-concept engagements for agentic AI systems and LLM-powered workflows. The role involves designing multi-cloud strategies, leading workshops, and advising on MLOps principles and emerging standards for agentic AI.
AgentEngineeringSanta Clara, CAApr 159
Senior Machine Learning and Simulation Engineer - Autonomous Vehicles
Senior ML Engineer focused on building and optimizing large-scale Reinforcement Learning (RL) training frameworks for multi-modal Autonomous Vehicle (AV) foundation models. This role involves designing simulation and data processing pipelines, refining reward functions, and ensuring the reliability of training workflows on GPU clusters, with a focus on closed-loop simulation for training end-to-end AV models.
Post-trainAgentEngineeringSanta Clara, CAApr 159
Solutions Architect, Generative AI
NVIDIA is seeking an AI Engineer or Solutions Architect to enable ecosystem partners for Generative AI. The role involves building innovative proof-of-concept solutions and reference architectures for AI agents, demonstrating NVIDIA's full-stack accelerated Generative AI platforms. Responsibilities include acting as a technical expert, developing foundational solutions, providing technical blueprints, advising on deployment, and enabling partners to build their own services and products. The role requires experience in deploying AI models at scale, building enterprise-grade agentic AI systems, and proficiency in LLM/VLM frameworks and Python/C++.
AgentServeEngineeringSanta Clara, CAApr 159
Senior ML Evaluation Engineer - Autonomous Vehicles
NVIDIA is seeking a Senior ML Evaluation Engineer for their Autonomous Vehicles team. The role involves designing and building learned evaluation pipelines using LLMs, VLMs, and agentic workflows to assess driving behavior. The engineer will define evaluation methodologies, build golden-set frameworks, and contribute to the transition from rule-based to learned evaluation systems. This position requires a strong background in ML system development, software engineering, and experience with large-scale data processing, with a focus on shipping production ML systems.
Eval GateAgentEngineeringSanta Clara, CA +4 · RemoteApr 159
Senior Software Engineer - AI Inference
Senior Software Engineer focused on optimizing and contributing to open-source LLM inference serving engines like vLLM and SGLang to run efficiently on NVIDIA GPUs, focusing on high-throughput, low-latency inference at scale.
ServeEngineeringSanta Clara, CA +3 · RemoteApr 149
Senior Solutions Architect, Autonomous Vehicles - Data Center
NVIDIA is seeking a Senior Solutions Architect for Autonomous Vehicles and Robotics to help customers accelerate Physical AI workloads using NVIDIA's full-stack technologies. The role involves engaging with customers to optimize training, simulations, and synthetic data generation for AV perception and planning models, providing technical expertise, and driving full-stack adoption. The candidate will analyze and optimize AI models for GPU performance, build collateral for various AI workflows, and provide technical leadership. Requires 8+ years of ML/DL Infra experience in AVs, proficiency in Python, CUDA/C++, Linux, DevOps tools, and a strong understanding of AV models and simulations. Experience with model deployment at scale and robotics model development is a plus. The role focuses on the data and infrastructure aspects of AI model development and deployment in the AV domain.
DataServeEngineeringSanta Clara, CAApr 149
Principal Engineer - AI Agents and Systems
Principal Engineer to lead the deployment of advanced AI agent frameworks and local runtimes on Windows and NVIDIA GPUs, focusing on open-source agents, local inference, privacy, and security for consumer PCs.
AgentServeEngineeringSanta Clara, CA +1Apr 139
Senior Software Engineer - Agentic Memory
Senior Software Engineer role focused on developing and researching agentic memory systems, including designing benchmarks, generating synthetic data, running experiments, and contributing to open-source evaluation tools. The role involves partnering with other NVIDIA teams deploying agents and advancing the state of the art in agentic memory evaluation.
AgentEval GateEngineeringCA +4 · RemoteApr 89
Senior Machine Learning Engineer, Perception - Autonomous Driving
NVIDIA is seeking a Senior Machine Learning Engineer for their autonomous driving perception team. The role involves designing and developing end-to-end deep learning solutions for perception modules, focusing on road layout detection and other critical driving components. Responsibilities include applied research, data-driven development, and productizing solutions with a focus on safety, latency, and robustness. Experience with deep learning frameworks, Python/C++, and perception for autonomous driving or robotics is required.
ShipDataEngineeringSanta Clara, CA +2 · RemoteApr 89
Senior High-Performance LLM Training Engineer
NVIDIA is seeking an experienced Senior High-Performance LLM Training Engineer to optimize LLM training workloads on advanced computing systems. The role focuses on improving the efficiency of NVIDIA's high-performance LLM software stack using frameworks like PyTorch and JAX for training on thousands of GPUs, and influencing future hardware roadmaps.
DataEngineeringSanta Clara, CAApr 89