AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-366 -50%
360 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 5w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
60 new roles
22

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (1,674)

434 AI · 1824 total active
FilteredFunctionEngineering×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 23Pretrain · 20Post-train · 28Serve · 265Agent · 102Eval Gate · 8Ship · 41
Function
AllEngineering · 1674Research · 68Product · 10
Country
AllUnited States · 945Israel · 413India · 146China · 119Taiwan · 78Germany · 34Switzerland · 26United Kingdom · 25Vietnam · 25Canada · 19Poland · 19France · 10Italy · 7Netherlands · 7Singapore · 6South Korea · 5Spain · 4Ukraine · 4Hungary · 3Japan · 3Romania · 3Czech Republic · 2Denmark · 2Finland · 2Palestine · 2Sweden · 2Armenia · 1Brazil · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Deep Learning Compiler Engineer - PyTorch
Senior Deep Learning Compiler Engineer to develop and optimize PyTorch models for NVIDIA GPUs using compiler technology like Thunder, TorchDynamo, and TorchInductor. Focus on performance analysis and contributing to open-source AI ecosystem.
ServeEngineeringBerlin, Germany +4Jan 108
Distinguished Engineer - Dynamo
Distinguished Engineer role focused on NVIDIA Dynamo, an AI inferencing platform. The role involves technical leadership, driving product direction, and contributing to open-source projects to achieve state-of-the-art performance and scalability for AI inference across modalities on NVIDIA hardware.
ServeEngineeringSanta Clara, CA +4 · RemoteJan 98
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems
NVIDIA is seeking a Principal Systems Engineer to design and evolve a unified memory layer for large-scale LLM inference, focusing on KV-cache offload, reuse, and sharing across heterogeneous clusters. The role involves deep integration with LLM serving engines and optimizing performance across GPU, CPU, and storage tiers.
201–250 of 1,674← Prev1…456…34Next →
Serve
Engineering
Santa Clara, CA +2 · Remote
Jan 9
8
Senior Software Engineer, Deep Learning - MLIR TRT
Senior Software Engineer focused on developing and productizing deep learning solutions for autonomous driving vehicles, specifically involving compiler technology to optimize deep learning inference on NVIDIA hardware. The role requires expertise in deep learning frameworks, compiler technologies, and GPU programming.
ServeEngineeringSanta Clara, CAJan 98
Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK
Senior Software Engineer at NVIDIA to build the future of real-time AI for sensor-driven applications using the Holoscan Platform. The role involves architecting APIs, prototyping GPU-accelerated algorithms for computer vision, imaging, sensor fusion, and low-latency rendering, and integrating generative models and multimodal foundation models into real-time pipelines. Focus on enabling GPU-resident generative methods for perception, simulation, and robotics.
AgentServeEngineeringSanta Clara, CAJan 98
Manager, Deep Learning Algorithms
Manager for Deep Learning Algorithms at NVIDIA, focusing on leading engineering efforts for productizing DL models, optimizing inference, and collaborating with research teams to implement and improve algorithms. The role involves managing a team, aligning priorities, and developing the GPU-accelerated DL platform.
ServeEngineeringSanta Clara, CAJan 98
Senior Software Architect - Deep Learning and HPC Communications
This role focuses on architecting and implementing next-generation communication software and platforms for deep learning and high-performance computing (HPC) applications, specifically targeting the efficient scaling of GPU clusters. The work involves identifying performance bottlenecks, designing new communication technologies, exploring hardware/software co-design, and using simulation to evaluate performance at massive scales.
ServeEngineeringGermany +4 · RemoteJan 98
Senior Deep Learning Performance Architect
NVIDIA is seeking a Senior Deep Learning Performance Architect to analyze and develop next-generation architectures for AI and high-performance computing. Responsibilities include developing HW architectures for performance and energy efficiency, benchmarking AI workloads, creating simulation tools, and evaluating hardware features. Requires MS/PhD or equivalent experience with 4+ years in parallel computing architectures, GPU/ASIC architecture evaluation for training/inference, and strong Python/C++ skills.
ServeEngineeringSanta Clara, CA +2Jan 98
Senior Software Architect, Advanced Development
Senior Software Architect focused on accelerating networking and building AI data centers, researching transport functions for AI workloads, and leading architectural efforts in distributed AI, deep learning, HPC, SDN, virtualization, and storage.
ServeEngineeringYokneam, Israel +1Dec '258
Senior Manager, Deep Learning Performance Architecture
NVIDIA is seeking an Engineering Manager to lead a Deep Learning Performance Architect Team. This role involves managing a team focused on analyzing deep learning networks and advancing deep learning computing systems through hardware/software co-design. Responsibilities include establishing team objectives, collaborating with software framework and hardware architecture teams, characterizing deep learning workloads, performance tuning, optimizing software stacks, and driving the evolution of next-generation hardware and software architectures.
ServeEngineeringShanghai, China +1Dec '258
Deep Learning Performance Architect
NVIDIA is seeking Software Engineers to join their Deep Learning Inference team, focusing on developing and optimizing GPU-accelerated deep learning kernels for inference. The role involves performance analysis, tuning, and collaboration with cross-functional teams on innovative solutions.
ServeEngineeringShanghai, China +1Dec '258
AI Developer Technology Engineer
NVIDIA is seeking an AI Developer Technology Engineer to work on optimizing AI techniques on GPU architectures and collaborate with customers and internal teams to influence future designs. The role involves studying and developing cutting-edge deep learning, graphs, and machine learning techniques, with a focus on performance analysis and optimization for GPUs. The engineer will also work with customers to understand their problems and provide AI solutions using GPUs, and collaborate with NVIDIA's internal teams to shape next-generation architectures and software platforms.
ServeEngineeringBangalore, India +1Nov '258
Director, Engineering – Software Engineering and AI Inferencing Platforms
NVIDIA is seeking an Engineering Director to lead and scale software engineering teams in Vietnam, focusing on AI Inferencing Platforms and AI data/factory initiatives. The role involves driving the design, architecture, and delivery of high-performance system software platforms, collaborating with global teams, and overseeing the development and optimization of AI delivery platforms like NIMs and Blueprints. Experience with cloud, data, accelerated computing, and managing large AI/ML product teams is required.
ServeDataEngineeringHanoi, Vietnam +1Nov '258
Senior System Software Architect, HPC and AI Networking
NVIDIA is seeking a Senior System Software Architect to design and prototype scalable software systems for distributed AI training and inference, focusing on optimizing throughput, latency, and memory efficiency. The role involves developing and evaluating communication libraries, collaborating with AI framework teams, co-designing hardware features for AI acceleration, and contributing to runtime systems and protocol layers.
ServePost-trainEngineeringBeijing, ChinaOct '258
Compute Architecture Software Engineer
NVIDIA is seeking an LLM Inference Software Engineer to accelerate LLM inference using GPU technology on the TRTLLM project. The role involves developing and optimizing software solutions, implementing GPU-based algorithms, and improving performance across diverse computing environments.
ServeEngineeringShanghai, ChinaSep '258
Senior AI Storage Software Architect
NVIDIA is seeking a Senior AI Storage Software Architect to define and design the next generation of storage solutions for AI workloads, including training, inferencing, KV cache, and RAG. The role involves researching AI storage workloads, optimizing them, designing the storage software stack and APIs, leading POCs, and driving hardware features for DPUs and NICs. Requires 5+ years of storage experience and familiarity with AI applications and technologies.
ServeDataEngineeringRaanana, Israel +2May '258
Senior Manager, Engineering - AI Developer Tools
Senior Engineering Manager to lead a team building and evolving AI developer tools and technology for local and cloud GPUs, focusing on the developer experience for AI workflows and managing AI workloads on accelerated infrastructure.
ServeAgentEngineeringSanta Clara, CA +1 · Remote1w ago7
Senior Software Engineer, AI Developer Tools
Senior Software Engineer to craft intuitive AI developer tools that make advanced AI workflows accessible and scalable across diverse accelerated infrastructure.
AgentEngineeringSanta Clara, CA +1 · Remote1w ago7
Senior DL Compiler Engineer -CUDA Tile
NVIDIA is hiring a Senior DL Compiler Engineer for the CUDA Tile team. This role involves designing and implementing compiler transformations, developing MLIR-based dialects and lowering passes, and optimizing performance for tile-based kernels on NVIDIA GPUs. The CUDA Tile programming model is a new addition to CUDA, shipped with CUDA 13.1.
ServeEngineeringSanta Clara, CA +5 · Remote1w ago7
Senior Software Engineer - Storage
Software Engineer role focused on designing, building, and operating exascale infrastructure for AI research and development at NVIDIA. The role involves managing distributed systems, large-scale storage, compute orchestration, and automation to support AI workloads across thousands of GPUs and petabytes of storage.
ServeEngineeringSanta Clara, CA +3 · Remote2w ago7
Principal Developer, AI Networking
This role focuses on optimizing AI workloads, specifically LLM training and inference, on large-scale GPU and CPU clusters. The core responsibility is to profile, analyze, and optimize the performance of distributed systems with a strong emphasis on high-performance networking and communication libraries. The engineer will develop tools for performance analysis and collaborate across hardware and software teams to identify and resolve bottlenecks.
ServePretrainEngineeringSanta Clara, CA +3 · Remote2w ago7
AI Automation Engineer, Security
NVIDIA is seeking an AI Automation Engineer to build AI-native, agent-enabled security organization. The role involves developing AI agents for security programs, building and maintaining infrastructure for agent workflows, translating business needs into agent solutions, architecting integrations for agents to interact with data systems, owning ETL and agentic data pipelines, and ensuring data security and governance. The engineer will also monitor and optimize data infrastructure, pipelines, and agents, and mentor other engineers.
AgentDataEngineeringSanta Clara, CA +3 · Remote2w ago7
Software R&D Engineer, RTL Optimization Tools
Software R&D Engineer at NVIDIA focused on developing internal EDA tools for RTL optimization. The role involves fusing parallel computing, machine learning, and novel algorithms to improve hardware design productivity. It explores the use of LLMs, GNNs, GANs, and Reinforcement Learning for optimization tasks, and requires strong C++ development skills with a focus on graph-based algorithms and optimization.
ServeEngineeringSanta Clara, CA +12w ago7
Senior Software Engineer, AI Speed Infrastructure
Senior Software Engineer to build AI speed infrastructure for Tegra, focusing on a fast build, test, and validation system. The role involves designing AI-native, self-healing CI workflows, integrating reasoning agents for failure triage and automation, and optimizing the entire code-to-merge pipeline for C/C++ codebases, with a strong emphasis on performance engineering and developer experience.
AgentEngineeringSanta Clara, CA +12w ago7
Senior Systems Software Engineer, Kubernetes Scale - DGX Cloud
Senior Systems Software Engineer focused on scaling NVIDIA DGX Cloud's AI infrastructure, specifically optimizing Kubernetes and distributed inference serving for performance, cost, and reliability. The role involves end-to-end performance characterization, developing automated tests for AI workloads, debugging complex distributed systems, and contributing to open-source communities.
ServeAgentEngineeringSanta Clara, CA +12w ago7
Senior Software Engineer, Mapping - Autonomous Vehicles
NVIDIA is seeking a Senior Software Engineer for their Autonomous Vehicles mapping team. The role involves designing and developing algorithms for map-based driving products, including architecture design, efficient C++ development, and integrating algorithmic solutions. Key responsibilities include researching and developing transformer-based models for graphs, implementing evaluation frameworks for LLMs, fine-tuning pre-trained models, and building automated map content analysis and map-building workflows. The role requires a background in computer vision, 3D geometry, and machine learning, with heavy AI tool usage for development, and strong prompt-crafting skills. The position is focused on building AI-powered solutions for self-driving cars, with a primary focus on agentic systems for navigation and map content, and secondary involvement in model fine-tuning.
AgentPost-trainEngineeringSanta Clara, CA +12w ago7
GPU Architect - New College Grad 2026
NVIDIA is seeking new college graduates for its GPU Architecture Group to design and validate GPU profiling and performance telemetry features. The role involves hardware modeling, test development, and infrastructure, with a focus on the world's leading AI platform. Responsibilities include building and maintaining hardware models, writing and executing test plans, contributing to development infrastructure, and collaborating with cross-functional teams.
ServeEngineeringSanta Clara, CA2w ago7
GPU System Architect
NVIDIA is seeking a GPU System Architect to design multi-GPU scale-up and scale-out datacenter systems for AI and HPC. The role involves defining system architectures that tightly couple GPU compute, memory, and interconnects for optimal AI performance, scalability, and resilience. Responsibilities include architecting system topologies, defining high-speed interconnects, collaborating on RDMA hardware, using system models for analysis, and enabling hardware-software co-design.
ServeEngineeringBangalore, India2w ago7
NBU Manufacturing Test Engineer
NVIDIA is seeking a Manufacturing Test Engineer to design tools for product definition, data collection, test case execution, and results analysis. The role involves driving NBU diagnosis, analyzing test issues, qualifying equipment, and automating NBU product testing using AI and machine learning techniques. The engineer will also provide feedback on debug tools and support NBU product test setup.
ServeEngineeringHanoi, Vietnam2w ago7
Senior ML Platform Engineer
Senior ML Platform Engineer at NVIDIA responsible for architecting, building, and scaling high-performance ML infrastructure using Infrastructure-as-Code (IaC) practices. The role focuses on creating reliable, automated platforms for training and deploying advanced ML models on GPU systems, applying SRE principles, and developing internal automation for ML workflows. Requires strong software engineering skills in Python/Go, experience with Kubernetes/Docker, and a solid understanding of ML workflows.
ServeEngineeringSanta Clara, CA +5 · Remote3w ago7
Senior Software Engineer - Developer Tools for Deep Learning
Senior Software Engineer to enhance NVIDIA's developer tools for deep learning, focusing on neural network design and performance efficiency. The role involves partnering with management and architects, staying updated on research, and working with SOTA computer vision and LLMs.
ServePost-trainEngineeringMA · Remote3w ago7
Senior Deep Learning Hardware Modeling Architect - LPU
NVIDIA is seeking a Senior Deep Learning Hardware Modeling Architect to optimize AI inference speed and efficiency. The role involves driving architectural specifications, developing written specifications for component-level and system-level designs, and embodying these specifications in an executable model. The candidate will ensure high performance using C++ software practices, solid algorithms, and parallelism, and resolve performance and correctness issues across chip and hardware subsystems.
ServeEngineeringCA +2 · Remote3w ago7
Senior AI Infrastructure Engineer - DGX Cloud
Senior AI Infrastructure Engineer responsible for designing, building, and maintaining large-scale production systems for NVIDIA's DGX Cloud, focusing on AI training and inferencing platforms. This role involves infrastructure automation, distributed systems, performance characterization, and ensuring reliability and availability of GPU cloud services.
ServeEngineeringSanta Clara, CA +1 · Remote3w ago7
Senior Compiler Engineer - DL
NVIDIA is seeking a Senior Compiler Engineer for its Deep Learning Compiler (DLC) team. This role involves analyzing deep learning networks, developing compiler optimization algorithms, and collaborating with framework and hardware teams to accelerate deep learning inference performance. The compiler is critical for data centers, personal devices, automotive, and robotics, aiming for leading inference performance, fast build times, and reduced memory footprints.
ServeEngineeringSanta Clara, CA +5 · Remote3w ago7
Deep Learning Performance Architect, CUTLASS DSL
NVIDIA is seeking an engineer to develop and optimize CUTLASS DSL, a Python-native language for GPU kernel development, and its associated MLIR dialects and lowering passes. The role involves accelerating kernel compilation for NVIDIA's next-generation AI platforms, aiming for performance comparable to CUTLASS C++.
ServeEngineeringShanghai, China +13w ago7
Senior Software Architect, GPU Networking Research
NVIDIA is seeking a Senior Software Architect to focus on GPU Networking Research for accelerating AI workloads and building AI data centers. The role involves leading vision, architecture, design, and proof-of-concept development for future GPU Networking offerings, identifying new technologies, and working with the community. Requires M.Sc./Ph.D. or equivalent experience, 8+ years in systems architecture, and experience in virtualization, networking, storage, and OS drivers. Experience in performance profiling, optimization, and HW offloads is crucial. A research track record and knowledge of Deep Learning frameworks are desirable.
ServeEngineeringZurich, Switzerland +1 · Remote3w ago7
Principal Simulation Engineer, Industrial Physics and Robotics
NVIDIA is seeking a Principal Simulation Engineer to lead the development of advanced physically based simulation systems for robotics and industrial digital twins. This role requires deep expertise in multibody dynamics, contact, friction, and flexible bodies, with a focus on integrating simulation with robotics workflows and applying modern AI-assisted and agentic development. The ideal candidate has a track record of building production-level simulation software and experience validating simulators against physical systems.
AgentEngineeringSwitzerland +5 · Remote3w ago7
Senior Software Engineer, CUTLASS Kernels 
Senior Software Engineer to develop and optimize high-performance deep learning kernels (e.g., GEMM, attention, convolution) using CUTLASS CUDA C++ and Python DSL for NVIDIA GPUs and future architectures. The role involves optimizing kernels for peak throughput, collaborating with various NVIDIA teams (architecture, compiler, libraries, DL frameworks), and requires strong C++ and CUDA experience, understanding of computer architecture, and experience with parallel programming languages targeting accelerators.
ServeEngineeringSanta Clara, CA +43w ago7
Senior Software Engineer, CUTLASS Performance
Senior Software Engineer role focused on optimizing the performance of CUTLASS, a high-performance linear algebra and Tensor Core primitive ecosystem for NVIDIA GPUs. The role involves benchmarking deep learning models, identifying performance gaps, developing tooling for optimization, and acting as a performance representative across NVIDIA teams.
ServeEngineeringSanta Clara, CA +43w ago7
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to optimize deep learning hardware and software architecture, analyze performance of deep learning algorithms on different architectures, identify bottlenecks, and explore new features and hardware capabilities. Requires a strong background in computer architecture and experience with deep learning platforms and frameworks.
ServeEngineeringShanghai, China3w ago7
Principal Architect, System Software - Orbital Data Center
NVIDIA is seeking a Principal Architect to lead the system software architecture for their Orbital Data Center (ODC) modules, specifically Space-1. This role involves designing and implementing a resilient, production-ready inference platform for the harsh environment of low-Earth orbit, covering the full stack from firmware to AI workloads. The architect will collaborate with hardware teams, drive customer use cases, and ensure the platform operates reliably for 5-year missions, enabling AI adoption in space.
ServeEngineeringSanta Clara, CA +1 · Remote4w ago7
Software Engineer, TensorRT Specialized Platforms - New College Grad 2025
Software Engineer role focused on developing and optimizing high-performance deep learning inference software (TensorRT) for specialized platforms. Requires strong C++ skills, familiarity with deep learning frameworks, and interest in performance optimization and systems programming.
ServeEngineeringSanta Clara, CA4w ago7
Senior Software Engineer, Agentic Systems
Senior Software Engineer to build NeMo Platform, focusing on NeMo Evaluator for developing, evaluating, deploying, and operating AI systems at scale. The role involves designing and implementing Python APIs, SDK workflows, and plugin interfaces for building, measuring, and improving agents, with a strong emphasis on agentic development and automated improvement.
Eval GateAgentEngineeringSanta Clara, CA +4 · Remote4w ago7
Infrastructure Software Engineer, Deep Learning Libraries
NVIDIA is seeking an Infrastructure Software Engineer to enable next-generation deep learning libraries by designing and developing scalable automation for build, test, integration, and release processes. The role involves developing and deploying AI agents to automate the software development cycle and configuring industry-standard tools, with a focus on open-source products like CUTLASS.
AgentEngineeringShanghai, China +14w ago7
Deep Learning Compiler Engineer - CUDA
NVIDIA is seeking a Deep Learning Compiler Engineer to design and implement DSLs and compiler cores for emerging GPU architectures, focusing on optimizing performance for AI/LLM workloads and integrating with AI/ML frameworks.
ServeEngineeringShanghai, China +14w ago7
Developer Technology Engineer, AI
NVIDIA Developer Technology Engineer focused on optimizing AI and deep learning applications on GPU architectures, working with customers to provide AI solutions, and collaborating with internal teams to influence future hardware and software design.
ServeEngineeringBeijing, China +24w ago7
Lead Algorithm Engineer, Map-Perception Fusion
Lead engineer for autonomous vehicle systems, focusing on fusing map and perception data to create real-time 3D world models for navigation. This role involves architecting scalable systems, advancing mapless driving capabilities, and improving scenario understanding through techniques like static obstacle modeling and occupancy grids.
AgentEngineeringSanta Clara, CA +3 · Remote4w ago7
Senior HPC AI Cluster Engineer
NVIDIA is seeking an experienced HPC-AI Engineer to join their Networking Clusters Solutions Infrastructure team. The role involves designing, implementing, and maintaining large-scale HPC/AI clusters, managing job schedulers, developing CI/CD pipelines, and automating infrastructure deployment and monitoring. The engineer will work with cutting-edge hardware and software, support R&D, and engage in POCs for future improvements.
ServeEngineeringGermany +5 · Remote5w ago7
Senior Power Analysis and Optimization Engineer
This role focuses on applying AI, ML, and LLMs to optimize power efficiency in NVIDIA's GPUs and SoCs. The engineer will develop and productionize ML/RL-based models for power analysis and optimization, design and train custom LLMs for interpreting power data and recommending improvements, and apply AI to tune power-efficient configurations. The role involves analyzing power data, partnering with cross-functional teams, and automating flows.
ServeDataEngineeringSanta Clara, CA +15w ago7
Software Manager, Planning and Control - Autonomous Vehicles
Software Manager for Planning and Control in Autonomous Vehicles at NVIDIA, leading a team to productize and deliver ADAS and autonomy functions. Responsibilities include setting algorithmic direction, designing software architecture, building testing infrastructure, and managing a team of developers. Requires significant software product and management experience, C++/C proficiency, and Agile/Linux environment familiarity. Experience shipping ADAS/Autonomy functions, building from scratch, robust testing infrastructure, algorithm development for physical systems, and automotive systems are highly desirable.
ShipEngineeringShanghai, China +25w ago7