AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-366 -50%
360 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 5w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
60 new roles
22

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (723)

434 AI · 1824 total active
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 30Post-train · 51Serve · 356Agent · 192Eval Gate · 11Ship · 55
Function
AllEngineering · 627Research · 82Product · 14
Country
AllUnited States · 439China · 93Israel · 54Germany · 36Switzerland · 31India · 26United Kingdom · 24Poland · 17Vietnam · 13Canada · 12Singapore · 11France · 10Netherlands · 9Italy · 8Taiwan · 6Hong Kong · 4Japan · 4Spain · 3Australia · 2Czech Republic · 2Finland · 2Hungary · 2South Korea · 2Armenia · 1Brazil · 1Mexico · 1Romania · 1Saudi Arabia · 1Sweden · 1United Arab Emirates · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Deep Learning Algorithm Engineer
Senior Deep Learning Algorithm Engineer at NVIDIA to design, develop, and optimize core AI frameworks (Megatron Core, NeMo Framework) for LLM and Multimodal foundation model pretraining and post-training. The role involves implementing distributed training algorithms, model parallel paradigms, performance tuning, and expanding toolkits, working across the full model lifecycle from orchestration to deployment on NVIDIA GPU architectures.
PretrainPost-trainEngineeringSanta Clara, CA +1 · RemoteApr 49
Senior DL Algorithms Engineer - Inference Performance
Senior DL Algorithms Engineer focused on optimizing inference performance for language and multimodal models using NVIDIA's inference stack (NIMs, TRT-LLM). Role involves profiling, analysis, and collaboration across hardware/software layers to maximize performance on GPUs.
151–200 of 723← Prev1…345…15Next →
Serve
Engineering
Santa Clara, CA +1 · Remote
Apr 4
9
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA is seeking an experienced engineer to optimize LLM training workloads on high-performance computing systems, focusing on software stack optimization for thousands of GPUs and influencing future hardware roadmaps. The role involves performance analysis, profiling, and implementation across various layers of the deep learning platform, including building tools for automation and contributing to MLPerf benchmarks.
DataEngineeringSanta Clara, CAApr 49
Senior Research Scientist, AI Accelerator Design and VLSI
Research Scientist focused on AI accelerator hardware design, VLSI, and AI HW/SW co-design, applying machine learning and generative AI to hardware design flows and optimization techniques like quantization.
ServeResearchSanta Clara, CAApr 49
Deep Learning Performance Software Engineer
Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks.
ServeEngineeringShanghai, ChinaApr 49
Senior Research Scientist, Electronic Design Automation
NVIDIA is seeking a Senior Research Scientist to conduct research at the intersection of AI, GPU computing, and Electronic Design Automation (EDA). The role involves defining and conducting original research in EDA algorithms, VLSI design methodology, and advanced machine learning techniques, with a focus on applying deep learning and GPU acceleration to improve chip design tools and flows. The scientist will collaborate with internal teams and the research community, publishing findings and potentially translating research into products.
Post-trainResearchSanta Clara, CAApr 49
AI Safety Scientist, Deep Learning
Research Scientist focused on AI safety for multilingual, multimodal LLMs, including content safety, ML fairness, bias detection, and hallucination mitigation. The role involves developing datasets, moderator models, and training techniques (SFT, RL), and contributing to safety tools.
Post-trainDataResearchHo Chi Minh City, Vietnam +1Apr 49
Research Scientist, AI Accelerator Design and VLSI - New College Grad 2026
Research Scientist role focused on AI Accelerator Design and VLSI, involving AI HW/SW Co-Design, quantization, and applying generative AI to hardware design. Requires a PhD and experience in VLSI, computer architecture, or numerical algorithms for AI. Collaborates on research prototypes and publishes findings.
ServeResearchSanta Clara, CAApr 49
Research Scientist, Quantum Computing and AI - New College Grad 2026
Research Scientist role at NVIDIA focusing on the intersection of AI and Quantum Computing. The role involves training AI models for quantum systems, advancing research in quantum simulation, and developing GPU-accelerated quantum tools. Requires a PhD, strong programming skills (Python, C++, PyTorch, JAX, CUDA), and a publication record in AI for quantum science or accelerated quantum simulations.
DataResearchSanta Clara, CAApr 49
Senior Applied Deep Learning Research Scientist, Efficiency
Research Scientist at NVIDIA focused on making deep learning models more efficient through techniques like quantization, sparsity, and optimized architectures. The role involves researching low-bit representations, pruning, and developing new algorithms for both training and inference, with a focus on understanding the root causes of efficiency gains and losses. The work directly influences next-generation hardware and state-of-the-art models, with opportunities for open-sourcing or publishing findings.
Post-trainServeResearchSanta Clara, CA +1Apr 49
Senior Research Scientist, Multi-Modal Language Models
Senior Research Scientist at NVIDIA focused on Multi-Modal Language Models, driving Nemotron technology. The role involves improving model abilities, generalization, and efficiency through data synthesis, retraining, and developing training recipes for mixed modalities (text, image, video, audio). It also includes translating research into production, exploring evaluation paradigms, and contributing to open-source communities.
PretrainPost-trainResearchSanta Clara, CAApr 49
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to develop and optimize infrastructure software and tools for large-scale AI training, post-training, and inference. The role focuses on improving efficiency and resiliency of AI workloads, co-designing APIs, and enhancing AI platforms, requiring strong debugging and distributed systems experience.
ServePost-trainEngineeringSanta Clara, CA +4 · RemoteApr 29
Principal Engineer, Autonomous Vehicles and Physical AI Solutions
Principal Engineer for Autonomous Vehicles and Physical AI Solutions at NVIDIA, focusing on strategic automotive and robotics partnerships in Japan. The role involves tailoring NVIDIA's full-stack AI technologies (DRIVE AGX Thor, Alpamayo, Cosmos) to meet production-grade requirements of OEMs, bridging AI, system optimization, and safety architecture. Responsibilities include innovating with reasoning VLA models, ensuring engineering alignment for partnerships, representing NVIDIA in industry forums, serving as technical authority for RFIs/RFQs, and establishing standards for physical AI production deployment.
AgentShipEngineeringTokyo, JapanApr 29
Senior GPU Networking Architect
This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development.
ServeEngineeringZurich, Switzerland +4 · RemoteMar 309
Solutions Architect, Pre-training and Post-training
NVIDIA is seeking a Solutions Architect to assist researchers and developers in accelerating their AI workloads using NVIDIA's platform. The role involves creating technical engagements, proposing state-of-the-art training and optimization frameworks, and promoting collaborative results. Requires 5+ years of experience in the full AI model lifecycle, including pre-training, fine-tuning, post-training, optimization, and evaluation, along with strong software engineering skills.
PretrainPost-trainEngineeringSeoul, South KoreaMar 309
Agent RL Infra Engineer
NVIDIA is seeking an engineer to develop and productionize reinforcement learning (RL) capabilities for agent teams within an enterprise context. The role involves evaluating and adapting RL approaches, designing reward environments, operationalizing training backends, and integrating with existing ML services. Responsibilities include leading data curation, designing RL training loops, integrating with GPU infrastructure, building observability, and collaborating with various platform and customer teams. The ideal candidate has extensive experience in operationalizing fine-tuning and RL techniques, familiarity with distributed training frameworks and MLOps, and proficiency in relevant programming languages.
Post-trainAgentEngineeringSanta Clara, CAMar 299
Senior AI ML Solution Engineer, AI-Native Development
Senior AI/ML Solution Engineer focused on designing and building AI-powered development pipelines, evaluating ML approaches for code generation and review, and driving adoption of AI-assisted software development. The role involves architecting feedback and evaluation systems, leading proof-of-concept development, and collaborating on risk-based development levels.
AgentEval GateEngineeringTel Aviv, IsraelMar 259
Director of Engineering, End to End Autonomous Driving
NVIDIA is seeking a Director of Engineering to lead the design and deployment of end-to-end autonomous driving systems. This role focuses on leveraging LLMs, VLMs, and VLAs for advanced planning and reasoning in vehicles and robotics, involving strategic leadership, team management, and technical oversight of ML model development and integration into safety-critical production environments.
ShipPost-trainEngineeringSanta Clara, CAMar 189
Director, Perception - Autonomous Vehicles
Director of Perception for Autonomous Vehicles at NVIDIA, leading teams to develop and deploy state-of-the-art deep learning models for real-time 3D world reconstruction and navigation. This role involves end-to-end ownership of the ML lifecycle, from data generation to deployment on NVIDIA DRIVE platforms, with a strong emphasis on safety-critical systems and cross-functional collaboration.
ShipDataEngineeringSanta Clara, CAMar 129
Robotics Research Intern - 2026
Robotics Research Intern at NVIDIA focusing on fundamental and applied research across the full robotics stack, including perception, planning, control, reinforcement learning, imitation learning, and simulation. The goal is to transform research paradigms, transfer into products, and create new markets.
AgentDataResearchZurich, SwitzerlandMar 119
Senior Manager, Engineering - Enterprise AI and Automation
Senior Engineering Manager to lead the strategy and execution for NVIDIA’s agentic developer platform, focusing on building, evaluating, and improving autonomous agents. The role involves identifying gaps, driving POCs, operationalizing approaches into reusable components, and establishing governance and safety mechanisms to scale autonomous systems within NVIDIA.
AgentServeEngineeringSanta Clara, CAFeb 239
PhD Research Intern, AI for Climate and Weather Simulation 2026
NVIDIA is seeking a PhD Research Intern to apply modern AI methods to climate and weather simulation. The role involves proposing, researching, and prototyping innovative ideas, publishing groundbreaking work, and contributing to technology transfer. The intern will utilize NVIDIA GPUs for cutting-edge research at the intersection of AI and climate science.
PretrainResearchUnited Kingdom · RemoteFeb 189
Senior High-Performance AI Training Engineer
Senior engineer focused on optimizing AI training workloads for performance on NVIDIA's hardware and software stack, from drivers to DL frameworks, impacting hardware/software roadmap and contributing to MLPerf benchmarks.
DataServeEngineeringSanta Clara, CAFeb 129
Senior Research Engineer Neural Reconstruction
Senior Research Engineer focused on neural reconstruction, developing and integrating neural rendering approaches for generative video, segmentation, and 3D reconstruction. The role involves adapting and fine-tuning generative models, collaborating on ML workflows, and contributing to core NVIDIA products. Requires strong Python and ML library skills, with experience in training and optimizing models.
Post-trainServeEngineeringSanta Clara, CAFeb 129
Senior Capability Development Engineer
NVIDIA is seeking a Senior Capability Development Engineer to develop and enhance internal RAG and Agent platforms for Ops Engineering productivity. The role involves developing, training, fine-tuning, and deploying multimodal LLMs, building LLM-based applications (RAG, TEXT2SQL, Agents), applying advanced tuning techniques, measuring performance, analyzing accuracy/bias, and driving dataset development. Requires strong Python skills, familiarity with ML/DL frameworks and LLMs, and practical experience with LLM training frameworks.
AgentPost-trainEngineeringShenzhen, ChinaFeb 119
Senior Software Engineer – AI and Autonomous Driving
Senior Software Engineer to build and deploy production AI for autonomous vehicles, focusing on training, fine-tuning, and optimizing deep learning models for real-time inference on NVIDIA GPUs. Requires strong C++/Python, deep learning training experience, and Linux development skills, with a preference for GPU programming, computer vision, or robotics.
Post-trainServeEngineeringMunich, GermanyFeb 59
Senior Software Architect, AI Networking
NVIDIA is looking for a Senior Software Architect to design and optimize inference infrastructure for large language models running on GPU clusters. The role involves working across software and hardware domains to define deployment and scaling strategies, optimize latency and throughput, and collaborate with various teams to ensure high-performance solutions.
ServeEngineeringTel Aviv, Israel +1Feb 49
Research Scientist, AI for Graphics and Gaming - New College Grad 2026
Research Scientist role focused on Generative AI for Graphics and Gaming, involving research, training, and prototyping of AI models for real-time graphics, world models, LLM-powered game experiences, and AI-driven characters. The role emphasizes step-change research and collaboration with product, driver, and hardware teams to ship features.
Post-trainPretrainResearchSanta Clara, CAJan 279
Research Scientist, Human‑AI Perception and Interaction Research - PhD New College Grad 2026
Research Scientist role focused on advancing AI in areas like gaming and robotics by understanding and shaping human perception, learning, and behavior through the lens of vision science, HCI, and HRI. The role involves proposing, researching, prototyping, and testing innovative ideas, publishing at top conferences, and collaborating with researchers and product engineers. Requires a PhD or equivalent research experience and a strong publication record.
Post-trainResearchSanta Clara, CAJan 219
Senior AI Algorithms Software Engineer
Senior AI Engineer at NVIDIA focused on developing and deploying foundation model applications (LLMs, VLMs, multi-modal) for manufacturing AI platforms, including computer vision, video understanding, and anomaly detection. The role involves technical leadership, co-development with customers, and driving research from concept to production.
ShipPost-trainEngineeringHsinchu, Taiwan +1Jan 219
Distinguished Engineer – High Performance AI
Distinguished Engineer role focused on building groundbreaking agentic AI systems for the CUDA ecosystem, encompassing multi-agent runtimes, orchestration, data/evaluation pipelines, training/inference stacks, and GPU-accelerated execution. The role involves defining technical strategy, co-designing solutions with hardware/software teams, developing evaluation frameworks, and driving architecture across the AI stack.
AgentServeEngineeringSanta Clara, CA +5 · RemoteJan 159
Research Scientist, Robotics Research - PhD New College Grad 2026
Research Scientist role focused on developing and integrating algorithms, models, and methods for robotic manipulation and loco-manipulation. The role involves contributing to multi-person research projects, publishing in top conferences, collaborating with product teams for research transfer, and working with real-world robotic systems and simulation. Requires a PhD and a strong research track record in robotics, ML, or related fields, with expertise in Python, deep learning frameworks, and robotics/simulation frameworks.
AgentDataResearchSeattle, WAJan 149
Senior Deep Learning Algorithm Engineer
Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks.
ServePost-trainEngineeringHo Chi Minh City, Vietnam +1 · RemoteJan 119
Research Scientist, AI-Mediated Reality and Interaction Research - PhD New College Grad 2026
Research Scientist role focused on fundamental research in AI-Mediated Reality and Interaction, involving interactive physical AIs, 4D world modeling, and human-AI interaction. The role requires proposing, researching, and prototyping innovative ideas, publishing at top conferences, and collaborating with engineers for technology transfer. Requires a Ph.D. and a strong research track record in AI and computer vision.
PretrainResearchSanta Clara, CA +2Jan 99
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focused on ML Systems, contributing to hardware, software, and infrastructure for ML systems at various scales. The role involves understanding and developing solutions for efficiency, scaling, and resilience in ML systems, with a focus on co-design of algorithms and systems. Requires a PhD and expertise in areas like OS, distributed systems, inference/training systems, data management, cloud computing, or computer architecture.
ServePost-trainResearchSanta Clara, CA +3Jan 99
Senior Research Scientist, Efficient Deep Learning
Senior Research Scientist at NVIDIA focusing on efficient deep learning methods, including post-training optimization, architecture design, and resource-efficient training/fine-tuning. The role involves research, implementation, publication, collaboration, and technology transfer to products.
Post-trainServeResearchSanta Clara, CAJan 99
Senior GPU Architect, Deep Learning
NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing.
ServeEngineeringSanta Clara, CA +2Jan 99
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA is seeking a Senior Research Scientist to conduct fundamental LLM research, focusing on post-training, alignment, synthetic data, reasoning, novel learning paradigms, and multi-modalities. The role involves exploring new capabilities, enabling agency, acquiring commonsense knowledge, publishing research, and collaborating with product groups.
Post-trainPretrainResearchSanta Clara, CAJan 99
Senior Research Scientist - Autonomous Vehicles
Research Scientist role focused on AI for autonomous vehicles, involving designing and implementing techniques, publishing research, and collaborating with product teams for deployment. The role emphasizes agent behavior, foundation models, closed-loop training, and AI safety within the robotics domain.
AgentResearchSanta Clara, CAJan 99
Senior Deep Learning Computer Architect
NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels.
ServeEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference.
ServePost-trainEngineeringSanta Clara, CA +1Jan 99
Senior Research Engineer - Autonomous Vehicles
Senior Research Engineer at NVIDIA focusing on AI for Autonomous Vehicles. The role involves developing large-scale training frameworks for multimodal foundation models, optimizing GPU utilization, implementing data loaders, building simulation infrastructure, integrating new architectures, developing sim-to-real pipelines, combining LLMs with policy learning, and applying RL for fine-tuning LLMs. Requires expertise in deep learning, reinforcement learning, generative modeling, distributed training systems, and GPU acceleration.
Post-trainAgentResearchSanta Clara, CAJan 99
Senior Robotics Research Scientist
NVIDIA's Seattle Robotics Lab is seeking a Senior Robotics Research Scientist to develop algorithms, models, and methods for robotic manipulation and loco-manipulation, integrating them into real-world systems and transferring research into NVIDIA products. The role involves fundamental and applied research across the robotics stack, with a focus on enabling companies to become robotics companies.
ShipDataResearchSeattle, WAJan 99
Senior Deep Learning Software Engineer, Inference
Senior Software Engineer specializing in Deep Learning Inference, focusing on optimizing GPU-accelerated software for large-scale model serving and inference using frameworks like SGLang and vLLM. The role involves performance tuning, implementing latest algorithms, and scaling performance across NVIDIA accelerators.
ServeEngineeringNetherlands +2 · RemoteJan 99
Senior Research Engineer, Foundation Model Training Infrastructure
Senior/Principal Engineer to build cutting-edge infrastructure for large-scale foundation model training in the Generalist Embodied Agent Research (GEAR) group, focusing on Project GR00T for humanoid robots. Responsibilities include designing and optimizing distributed training systems, data loaders, and monitoring tools for multimodal foundation models.
PretrainPost-trainEngineeringSanta Clara, CAJan 99
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focusing on ML Systems, contributing to hardware, software, and infrastructure for training, fine-tuning, and serving ML models at scale. Requires a PhD and expertise in systems research areas.
ServePost-trainResearchSingapore, Singapore · RemoteDec '259
Senior Software Architect, AI Networking
Senior Software Architect role focused on designing and optimizing large-scale LLM inference infrastructure on GPU clusters, involving system-level optimizations for latency, throughput, and cost-efficiency.
ServeEngineeringTel Aviv, IsraelDec '259
Research Scientist, Deep Learning and Computer Vision - New College Graduate
Research Scientist role focused on deep learning and computer vision, with an emphasis on novel methods, generative and multimodal AI, and explainable AI. The role involves research, design, implementation, and publication of original work, with potential for technology transfer to products. Requires a Ph.D. and a strong publication record in top-tier conferences.
PretrainResearchTaipei, TaiwanNov '259
Senior Software Research Architect, AI Networking
NVIDIA is seeking a Senior Software Research Architect to improve the framework for large-scale LLM learning and prediction. This role focuses on designing and optimizing systems for generative AI workloads on advanced GPU clusters, specifically leveraging the NVIDIA Spectrum-X Networking Platform to define deployment and scaling strategies. The architect will work on inter-node communication, compute scheduling, and system-level optimization, collaborating with engineers and researchers to enable generative AI technologies in real-world applications.
ServePretrainResearchTel Aviv, IsraelNov '259
Senior LLM Train Framework Engineer
NVIDIA is seeking a Senior LLM Train Framework Engineer to contribute to the Megatron Core team, focusing on building and developing open-source frameworks for LLM and Multimodal foundation model pretraining and post-training. The role involves addressing AI training and inference challenges across the model lifecycle, enhancing distributed training strategies, and optimizing performance on NVIDIA GPUs.
PretrainPost-trainEngineeringShanghai, ChinaOct '259