AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (434)

434 AI · 1824 total active
Show
Active onlyAI only (≥ 7)
Stage
AllData · 17Pretrain · 20Post-train · 28Serve · 236Agent · 95Eval Gate · 5Ship · 33
Function
AllEngineering · 375Research · 57Product · 2
Country
AllUnited States · 259China · 55Israel · 43Germany · 21Switzerland · 18United Kingdom · 14India · 13Poland · 12Vietnam · 12Canada · 10Italy · 7Netherlands · 6Singapore · 6France · 5Taiwan · 4Finland · 2Spain · 2Armenia · 1Czech Republic · 1Hungary · 1Japan · 1Romania · 1South Korea · 1Sweden · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Deep Learning Engineer - AI for Wireless Systems
NVIDIA is seeking a Senior Deep Learning Engineer to develop AI-native wireless networks, integrating deep learning into signal processing and radio access technologies. The role involves designing, prototyping, implementing, training, and optimizing deep learning models for real-time inference and deployment on GPU platforms, collaborating with researchers and system engineers.
ServePost-trainEngineeringHanoi, Vietnam +1Jan 218
Engineering Manager - AI for RAN and 6G Wireless Systems
NVIDIA is seeking an Engineering Manager to lead a team developing AI/ML models for 6G wireless networks. The role involves guiding model development, training, evaluation, and deployment, with a focus on integrating deep learning into signal processing and radio access technologies. Experience with Python, PyTorch/TensorFlow, and leading engineering teams is required.
251–300 of 434← Prev1…567…9Next →
ServePost-train
Engineering
Hanoi, Vietnam +1
Jan 21
8
System Software Engineer - Deep Learning
System Software Engineer at NVIDIA focused on accelerating deep learning inference for autonomous driving systems using NVIDIA GPUs and DL accelerators. The role involves developing SDKs/frameworks for LLMs and state-of-the-art models, benchmarking, and optimizing for latency, accuracy, and power consumption. Requires experience with deep learning frameworks, DNN optimization, and C/C++.
ServePost-trainEngineeringBangalore, IndiaJan 218
Senior AI Infrastructure Software Engineer
Senior AI Infrastructure Software Engineer at NVIDIA, focusing on building and scaling infrastructure for AI agents and applications in chip design. The role involves designing, developing, and improving scalable infrastructure, driving performance and reliability improvements, and collaborating with research and hardware teams. Requires expertise in Python, distributed systems, microservices, and integrating LLMs/agent frameworks.
AgentServeEngineeringShanghai, ChinaJan 198
Senior Deep Learning Compiler Engineer - PyTorch
Senior Deep Learning Compiler Engineer to develop and optimize PyTorch models for NVIDIA GPUs using compiler technology like Thunder, TorchDynamo, and TorchInductor. Focus on performance analysis and contributing to open-source AI ecosystem.
ServeEngineeringBerlin, Germany +4Jan 108
Distinguished Engineer - Dynamo
Distinguished Engineer role focused on NVIDIA Dynamo, an AI inferencing platform. The role involves technical leadership, driving product direction, and contributing to open-source projects to achieve state-of-the-art performance and scalability for AI inference across modalities on NVIDIA hardware.
ServeEngineeringSanta Clara, CA +4 · RemoteJan 98
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems
NVIDIA is seeking a Principal Systems Engineer to design and evolve a unified memory layer for large-scale LLM inference, focusing on KV-cache offload, reuse, and sharing across heterogeneous clusters. The role involves deep integration with LLM serving engines and optimizing performance across GPU, CPU, and storage tiers.
ServeEngineeringSanta Clara, CA +2 · RemoteJan 98
Senior Software Engineer, Deep Learning - MLIR TRT
Senior Software Engineer focused on developing and productizing deep learning solutions for autonomous driving vehicles, specifically involving compiler technology to optimize deep learning inference on NVIDIA hardware. The role requires expertise in deep learning frameworks, compiler technologies, and GPU programming.
ServeEngineeringSanta Clara, CAJan 98
Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK
Senior Software Engineer at NVIDIA to build the future of real-time AI for sensor-driven applications using the Holoscan Platform. The role involves architecting APIs, prototyping GPU-accelerated algorithms for computer vision, imaging, sensor fusion, and low-latency rendering, and integrating generative models and multimodal foundation models into real-time pipelines. Focus on enabling GPU-resident generative methods for perception, simulation, and robotics.
AgentServeEngineeringSanta Clara, CAJan 98
Manager, Deep Learning Algorithms
Manager for Deep Learning Algorithms at NVIDIA, focusing on leading engineering efforts for productizing DL models, optimizing inference, and collaborating with research teams to implement and improve algorithms. The role involves managing a team, aligning priorities, and developing the GPU-accelerated DL platform.
ServeEngineeringSanta Clara, CAJan 98
Senior Software Architect - Deep Learning and HPC Communications
This role focuses on architecting and implementing next-generation communication software and platforms for deep learning and high-performance computing (HPC) applications, specifically targeting the efficient scaling of GPU clusters. The work involves identifying performance bottlenecks, designing new communication technologies, exploring hardware/software co-design, and using simulation to evaluate performance at massive scales.
ServeEngineeringGermany +4 · RemoteJan 98
Senior Deep Learning Performance Architect
NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and high-performance computing. Responsibilities include developing HW architectures for performance and energy efficiency, benchmarking AI workloads, creating simulation tools, and evaluating hardware features. Requires MS/PhD or equivalent experience with 4+ years in parallel computing architectures, GPU/ASIC architecture evaluation for training/inference, and strong Python/C++ skills.
ServeEngineeringSanta Clara, CA +2Jan 98
Senior Research Engineer, Simulation
NVIDIA is seeking a Senior Research Engineer specializing in physics simulation for their General Embodied Agent Research (GEAR) group, focusing on Project GR00T, an initiative to build foundation models and full-stack technology for humanoid robots. The role involves developing and optimizing simulation environments, implementing control algorithms, building procedural generation pipelines, and deploying learned models to physical robots, with a strong emphasis on sim2real transfer.
DataShipResearchSanta Clara, CAJan 98
Senior Software Architect, Advanced Development
Senior Software Architect focused on accelerating networking and building AI data centers, researching transport functions for AI workloads, and leading architectural efforts in distributed AI, deep learning, HPC, SDN, virtualization, and storage.
ServeEngineeringYokneam, Israel +1Dec '258
Senior Manager, Deep Learning Performance Architecture
NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures.
ServeEngineeringShanghai, China +1Dec '258
Deep Learning Performance Architect
NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions.
ServeEngineeringShanghai, China +1Dec '258
AI Developer Technology Engineer
NVIDIA is seeking an AI Developer Technology Engineer to work on optimizing AI techniques on GPU architectures and collaborate with customers and internal teams to influence future designs. The role involves studying and developing cutting-edge deep learning, graphs, and machine learning techniques, with a focus on performance analysis and optimization for GPUs. The engineer will also work with customers to understand their problems and provide AI solutions using GPUs, and collaborate with NVIDIA's internal teams to shape next-generation architectures and software platforms.
ServeEngineeringBangalore, India +1Nov '258
Director, Engineering – Software Engineering and AI Inferencing Platforms
NVIDIA is seeking an Engineering Director to lead and scale software engineering teams in Vietnam, focusing on AI Inferencing Platforms and AI data/factory initiatives. The role involves driving the design, architecture, and delivery of high-performance system software platforms, collaborating with global teams, and overseeing the development and optimization of AI delivery platforms like NIMs and Blueprints. Experience with cloud, data, accelerated computing, and managing large AI/ML product teams is required.
ServeDataEngineeringHanoi, Vietnam +1Nov '258
Senior System Software Architect, HPC and AI Networking
NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers.
ServePost-trainEngineeringBeijing, ChinaOct '258
Compute Architecture Software Engineer
NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments.
ServeEngineeringShanghai, ChinaSep '258
Senior AI Storage Software Architect
NVIDIA is seeking a Senior AI Storage Software Architect to define and design the next generation of storage solutions for AI workloads, including training, inferencing, KV cache, and RAG. The role involves researching AI storage workloads, optimizing them, designing the storage software stack and APIs, leading POCs, and driving hardware features for DPUs and NICs. Requires 5+ years of storage experience and familiarity with AI applications and technologies.
ServeDataEngineeringRaanana, Israel +2May '258
Senior Manager, Engineering - AI Developer Tools
Senior Engineering Manager to lead a team building and evolving AI developer tools and technology for local and cloud GPUs, focusing on the developer experience for AI workflows and managing AI workloads on accelerated infrastructure.
ServeAgentEngineeringSanta Clara, CA +1 · Remote1w ago7
Senior Software Engineer, AI Developer Tools
Senior Software Engineer to craft intuitive AI developer tools that make advanced AI workflows accessible and scalable across diverse accelerated infrastructure.
AgentEngineeringSanta Clara, CA +1 · Remote1w ago7
Senior DL Compiler Engineer -CUDA Tile
NVIDIA is hiring a Senior DL Compiler Engineer for the CUDA Tile team. This role involves designing and implementing compiler transformations, developing MLIR-based dialects and lowering passes, and optimizing performance for tile-based kernels on NVIDIA GPUs. The CUDA Tile programming model is a new addition to CUDA, shipped with CUDA 13.1.
ServeEngineeringSanta Clara, CA +5 · Remote1w ago7
Senior Software Engineer - Storage
Software Engineer role focused on designing, building, and operating exascale infrastructure for AI research and development at NVIDIA. The role involves managing distributed systems, large-scale storage, compute orchestration, and automation to support AI workloads across thousands of GPUs and petabytes of storage.
ServeEngineeringSanta Clara, CA +3 · Remote2w ago7
Principal Developer, AI Networking
This role focuses on optimizing AI workloads, specifically LLM training and inference, on large-scale GPU and CPU clusters. The core responsibility is to profile, analyze, and optimize the performance of distributed systems with a strong emphasis on high-performance networking and communication libraries. The engineer will develop tools for performance analysis and collaborate across hardware and software teams to identify and resolve bottlenecks.
ServePretrainEngineeringSanta Clara, CA +3 · Remote2w ago7
AI Automation Engineer, Security
NVIDIA is seeking an AI Automation Engineer to build AI-native, agent-enabled security organization. The role involves developing AI agents for security programs, building and maintaining infrastructure for agent workflows, translating business needs into agent solutions, architecting integrations for agents to interact with data systems, owning ETL and agentic data pipelines, and ensuring data security and governance. The engineer will also monitor and optimize data infrastructure, pipelines, and agents, and mentor other engineers.
AgentDataEngineeringSanta Clara, CA +3 · Remote2w ago7
Software R&D Engineer, RTL Optimization Tools
Software R&D Engineer at NVIDIA focused on developing internal EDA tools for RTL optimization. The role involves fusing parallel computing, machine learning, and novel algorithms to improve hardware design productivity. It explores the use of LLMs, GNNs, GANs, and Reinforcement Learning for optimization tasks, and requires strong C++ development skills with a focus on graph-based algorithms and optimization.
ServeEngineeringSanta Clara, CA +12w ago7
Senior Software Engineer, AI Speed Infrastructure
Senior Software Engineer to build AI speed infrastructure for Tegra, focusing on a fast build, test, and validation system. The role involves designing AI-native, self-healing CI workflows, integrating reasoning agents for failure triage and automation, and optimizing the entire code-to-merge pipeline for C/C++ codebases, with a strong emphasis on performance engineering and developer experience.
AgentEngineeringSanta Clara, CA +12w ago7
Senior Systems Software Engineer, Kubernetes Scale - DGX Cloud
Senior Systems Software Engineer focused on scaling NVIDIA DGX Cloud's AI infrastructure, specifically optimizing Kubernetes and distributed inference serving for performance, cost, and reliability. The role involves end-to-end performance characterization, developing automated tests for AI workloads, debugging complex distributed systems, and contributing to open-source communities.
ServeAgentEngineeringSanta Clara, CA +12w ago7
Senior Software Engineer, Mapping - Autonomous Vehicles
NVIDIA is seeking a Senior Software Engineer for their Autonomous Vehicles mapping team. The role involves designing and developing algorithms for map-based driving products, including architecture design, efficient C++ development, and integrating algorithmic solutions. Key responsibilities include researching and developing transformer-based models for graphs, implementing evaluation frameworks for LLMs, fine-tuning pre-trained models, and building automated map content analysis and map-building workflows. The role requires a background in computer vision, 3D geometry, and machine learning, with heavy AI tool usage for development, and strong prompt-crafting skills. The position is focused on building AI-powered solutions for self-driving cars, with a primary focus on agentic systems for navigation and map content, and secondary involvement in model fine-tuning.
AgentPost-trainEngineeringSanta Clara, CA +12w ago7
GPU Architect - New College Grad 2026
NVIDIA is seeking new college graduates for its GPU Architecture Group to design and validate GPU profiling and performance telemetry features. The role involves hardware modeling, test development, and infrastructure, with a focus on the world's leading AI platform. Responsibilities include building and maintaining hardware models, writing and executing test plans, contributing to development infrastructure, and collaborating with cross-functional teams.
ServeEngineeringSanta Clara, CA2w ago7
GPU System Architect
NVIDIA is seeking a GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves defining system architectures that tightly couple GPU compute, memory, and interconnects for optimal AI performance, scalability, and resilience. Responsibilities include architecting system topologies, defining high-speed interconnects, collaborating on RDMA hardware, using system models for analysis, and enabling hardware-software co-design.
ServeEngineeringBangalore, India2w ago7
NBU Manufacturing Test Engineer
NVIDIA is seeking a Manufacturing Test Engineer to design tools for product definition, data collection, test case execution, and results analysis. The role involves driving NBU diagnosis, analyzing test issues, qualifying equipment, and automating NBU product testing using AI and machine learning techniques. The engineer will also provide feedback on debug tools and support NBU product test setup.
ServeEngineeringHanoi, Vietnam2w ago7
Senior ML Platform Engineer
Senior ML Platform Engineer at NVIDIA responsible for architecting, building, and scaling high-performance ML infrastructure using Infrastructure-as-Code (IaC) practices. The role focuses on creating reliable, automated platforms for training and deploying advanced ML models on GPU systems, applying SRE principles, and developing internal automation for ML workflows. Requires strong software engineering skills in Python/Go, experience with Kubernetes/Docker, and a solid understanding of ML workflows.
ServeEngineeringSanta Clara, CA +5 · Remote3w ago7
Senior Software Engineer - Developer Tools for Deep Learning
Senior Software Engineer to enhance NVIDIA's developer tools for deep learning, focusing on neural network design and performance efficiency. The role involves partnering with management and architects, staying updated on research, and working with SOTA computer vision and LLMs.
ServePost-trainEngineeringMA · Remote3w ago7
Senior Deep Learning Hardware Modeling Architect - LPU
NVIDIA is seeking a Senior Deep Learning Hardware Modeling Architect to optimize AI inference speed and efficiency. The role involves driving architectural specifications, developing written specifications for component-level and system-level designs, and embodying these specifications in an executable model. The candidate will ensure high performance using C++ software practices, solid algorithms, and parallelism, and resolve performance and correctness issues across chip and hardware subsystems.
ServeEngineeringCA +2 · Remote3w ago7
Senior AI Infrastructure Engineer - DGX Cloud
Senior AI Infrastructure Engineer responsible for designing, building, and maintaining large-scale production systems for NVIDIA's DGX Cloud, focusing on AI training and inferencing platforms. This role involves infrastructure automation, distributed systems, performance characterization, and ensuring reliability and availability of GPU cloud services.
ServeEngineeringSanta Clara, CA +1 · Remote3w ago7
Senior Compiler Engineer - DL
NVIDIA is seeking a Senior Compiler Engineer for its Deep Learning Compiler (DLC) team. This role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and hardware teams to accelerate deep learning inference performance. The compiler is critical for data centers, personal devices, automotive, and robotics, aiming for leading inference performance, fast build times, and reduced memory footprints.
ServeEngineeringSanta Clara, CA +5 · Remote3w ago7
Deep Learning Performance Architect, CUTLASS DSL
NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++.
ServeEngineeringShanghai, China +13w ago7
Senior Software Architect, GPU Networking Research
NVIDIA is seeking a Senior Software Architect to focus on GPU Networking Research for accelerating AI workloads and building AI data centers. The role involves leading vision, architecture, design, and proof-of-concept development for future GPU Networking offerings, identifying new technologies, and working with the community. Requires M.Sc./Ph.D. or equivalent experience, 8+ years in systems architecture, and experience in virtualization, networking, storage, and OS drivers. Experience in performance profiling, optimization, and HW offloads is crucial. A research track record and knowledge of Deep Learning frameworks are desirable.
ServeEngineeringZurich, Switzerland +1 · Remote3w ago7
Principal Simulation Engineer, Industrial Physics and Robotics
NVIDIA is seeking a Principal Simulation Engineer to lead the development of advanced physically based simulation systems for robotics and industrial digital twins. This role requires deep expertise in multibody dynamics, contact, friction, and flexible bodies, with a focus on integrating simulation with robotics workflows and applying modern AI-assisted and agentic development. The ideal candidate has a track record of building production-level simulation software and experience validating simulators against physical systems.
AgentEngineeringSwitzerland +5 · Remote3w ago7
Senior Software Engineer, CUTLASS Kernels 
Senior Software Engineer to develop and optimize high-performance deep learning kernels (e.g., GEMM, attention, convolution) using CUTLASS CUDA C++ and Python DSL for NVIDIA GPUs and future architectures. The role involves optimizing kernels for peak throughput, collaborating with various NVIDIA teams (architecture, compiler, libraries, DL frameworks), and requires strong C++ and CUDA experience, understanding of computer architecture, and experience with parallel programming languages targeting accelerators.
ServeEngineeringSanta Clara, CA +43w ago7
Senior Software Engineer, CUTLASS Performance
Senior Software Engineer role focused on optimizing the performance of CUTLASS, a high-performance linear algebra and Tensor Core primitive ecosystem for NVIDIA GPUs. The role involves benchmarking deep learning models, identifying performance gaps, developing tooling for optimization, and acting as a performance representative across NVIDIA teams.
ServeEngineeringSanta Clara, CA +43w ago7
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks.
ServeEngineeringShanghai, China3w ago7
Principal Architect, System Software - Orbital Data Center
NVIDIA is seeking a Principal Architect to lead the system software architecture for their Orbital Data Center (ODC) modules, specifically Space-1. This role involves designing and implementing a resilient, production-ready inference platform for the harsh environment of low-Earth orbit, covering the full stack from firmware to AI workloads. The architect will collaborate with hardware teams, drive customer use cases, and ensure the platform operates reliably for 5-year missions, enabling AI adoption in space.
ServeEngineeringSanta Clara, CA +1 · Remote4w ago7
Software Engineer, TensorRT Specialized Platforms - New College Grad 2025
Software Engineer role focused on developing and optimizing high-performance deep learning inference software (TensorRT) for specialized platforms. Requires strong C++ skills, familiarity with deep learning frameworks, and interest in performance optimization and systems programming.
ServeEngineeringSanta Clara, CA4w ago7
Senior Software Engineer, Agentic Systems
Senior Software Engineer to build NeMo Platform, focusing on NeMo Evaluator for developing, evaluating, deploying, and operating AI systems at scale. The role involves designing and implementing Python APIs, SDK workflows, and plugin interfaces for building, measuring, and improving agents, with a strong emphasis on agentic development and automated improvement.
Eval GateAgentEngineeringSanta Clara, CA +4 · Remote4w ago7
Infrastructure Software Engineer, Deep Learning Libraries
NVIDIA is seeking an Infrastructure Software Engineer to enable next-generation deep learning libraries by designing and developing scalable automation for build, test, integration, and release processes. The role involves developing and deploying AI agents to automate the software development cycle and configuring industry-standard tools, with a focus on open-source products like CUTLASS.
AgentEngineeringShanghai, China +14w ago7
Deep Learning Compiler Engineer - CUDA
NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks.
ServeEngineeringShanghai, China +14w ago7