AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 440 active AI roles, down 50% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-366 -50%
360 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 5w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
60 new roles
22

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (3,149)

434 AI · 1824 total active
Show
Active onlyAI only (≥ 7)
Stage
AllData · 40Pretrain · 30Post-train · 51Serve · 412Agent · 205Eval Gate · 15Ship · 71
Function
AllEngineering · 2752Product · 184Research · 98
Country
AllUnited States · 1651Israel · 590India · 249China · 225Taiwan · 150Germany · 72United Kingdom · 60Switzerland · 46Vietnam · 41Poland · 30France · 29Canada · 25Singapore · 15South Korea · 14Netherlands · 12Italy · 11Japan · 10Sweden · 8Denmark · 6Hong Kong · 6Spain · 6Finland · 5Hungary · 5Australia · 4Brazil · 4Mexico · 4Saudi Arabia · 4Thailand · 4Ukraine · 4United Arab Emirates · 4Czech Republic · 3Palestine · 3Romania · 3Armenia · 1Belgium · 1Greece · 1Norway · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
AI Computing Software Development Engineer, TensorRT-LLM
NVIDIA is seeking a Software Development Engineer for its TensorRT-LLM team to develop and optimize LLM inference software for various platforms. The role involves performance analysis, tuning, and contributing to the architecture and hardware design, with a focus on scaling inference capabilities.
ServeEngineeringTaipei, Taiwan +1Sep '259
Senior AI Training Performance Engineer
NVIDIA is seeking a Senior AI Training Performance Engineer to optimize AI training workloads on state-of-the-art hardware and software platforms. The role involves analyzing, profiling, and optimizing performance across the hardware/software stack, implementing production-quality software, and building automation tools. Requires a strong background in deep learning training, computer architecture (especially GPU), performance tuning, and programming in C++, Python, and CUDA.
Data
201–250 of 3,149← Prev1…456…63Next →
Engineering
Shanghai, China
Aug '25
9
Senior Deep Learning Researcher, Diffusion
Senior Deep Learning Researcher at NVIDIA focusing on diffusion-based technologies and multi-modality. The role involves inventing and building new techniques, publishing findings, and contributing to NVIDIA's AI enterprise software. Requires a PhD, research experience, and publications in top-tier conferences. Experience with image/video understanding and LLMs is essential.
PretrainPost-trainResearchTel Aviv, IsraelMay '259
Senior Product Engineer, Agentic AI
Product leader and builder to define and drive Agentic AI products and platforms, bridging product vision with deep technical execution. Focus on translating early-stage innovations into scalable, production-ready systems for agentic AI workflows.
AgentProductHo Chi Minh City, Vietnam +11w ago8
Senior Manager, Interactive World Model Platforms
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams) into an industry standard, focusing on production engineering, performance, and developer/researcher success across AV, robotics, rendering, and simulation.
ShipServeEngineeringMunich, Germany +21w ago8
Senior Manager, AlpaSim and AlpaDreams Production
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams, AlpaSim) into an industry standard, focusing on production engineering, performance, and developer ecosystem growth for applications in AV, robotics, rendering, and simulation.
ShipServeEngineeringSanta Clara, CA +21w ago8
Senior Systems Software Engineer, Semiconductor Systems Inspection
Senior Software Engineer to develop AI products for semiconductor inspection, focusing on computer vision, multimodal AI, anomaly detection, model compression, and deployment optimization. The role involves building models, adaptation workflows, and inference pipelines for production environments, with a focus on advancing roadmap progress and delivering practical systems.
ShipServeEngineeringSanta Clara, CA1w ago8
AI Computing Software Development Engineer, LLM Inference
Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams.
ServeEngineeringShanghai, China +11w ago8
Senior Software Engineer, AIOps
NVIDIA is seeking a Senior Software Engineer for their AIOps platform team to build core distributed systems for ingesting telemetry from GPU clusters and operationalizing predictive AI models. The role involves architecting an agentic AIOps system, handling high-scale data engineering, and building model-serving infrastructure for SaaS and on-premises deployments.
AgentServeEngineeringRaanana, Israel +11w ago8
Senior Applied AI Engineer
NVIDIA is seeking a Senior Applied AI Engineer to build AI solutions that unify data across engineering systems, enabling advanced analytics through AI agents, copilots, and workflow automation for ASIC networking product engineering. The role involves end-to-end ownership from architecture to deployment and maintenance, aiming to scale engineering productivity.
AgentEngineeringYokneam, Israel1w ago8
Senior Software Engineer, Applied AI
Senior Software Engineer, Applied AI Systems role focused on building production AI/ML and agentic solutions. Responsibilities include developing agents, workflow services, APIs, data pipelines, tool integrations, evaluation harnesses, and operational tooling. Requires strong Python skills, experience with LLMs, RAG, agentic AI, distributed systems, and system design. The role emphasizes turning ambiguous problems into durable software systems and shaping how production applied AI systems are built and measured.
AgentEngineeringMunich, Germany1w ago8
Senior Inference Engineer, AIConfigurator for Dynamo
Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms.
ServeEngineeringSanta Clara, CA +1 · Remote2w ago8
Distinguished Engineer - Wireless Infrastructure
NVIDIA is seeking a Distinguished Engineer to lead the technology strategy for next-generation wireless infrastructure, focusing on AI-RAN and Agentic Core. The role involves applying AI/ML to 6G RAN functions, transforming the wireless core into an agentic AI-based architecture, and driving rapid prototyping of GPU-accelerated platforms. Responsibilities include system architecture, design, development, and performance optimization for AI-for-RAN software stacks, as well as driving new applications in Integrated Sensing and Communications (ISAC) and Physical AI at the Edge. The position requires deep expertise in AI/ML, communication systems, and significant industry experience.
AgentDataEngineeringSanta Clara, CA +2 · Remote2w ago8
Senior System Security Architect
NVIDIA is seeking a Senior Security Architect to design, build, and deploy AI agent systems for security workflows, integrating LLMs, RAG, and automation with security data. The role involves owning the full agentic system lifecycle and partnering with product teams.
AgentEngineeringTel Aviv, Israel +22w ago8
Senior Software Engineer - Autonomous Driving Simulation
Senior Software Engineer role focused on building and scaling realistic virtual environments for autonomous vehicle (AV) training, testing, and validation. The role involves developing simulation platforms, domain adaptation technologies (Real2Sim, Sim2Real), and optimizing large-scale simulation workflows. It requires strong programming skills in Python, C/C++, PyTorch, and experience with modern software engineering and infrastructure tools, as well as a background in computer vision, deep learning, or simulation systems.
DataAgentEngineeringSanta Clara, CA2w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks.
ServeEngineeringShanghai, China2w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV
NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach.
ServeEngineeringShanghai, China +22w ago8
DL System Software Engineer - AI Platform
NVIDIA is seeking a DL System Software Engineer to join their AI Platform team. The role involves developing and building solutions for scheduling large-scale AI training and inference workloads on GPU clusters, optimizing performance and efficiency for large models. The engineer will work on core infrastructure, resource management, and GPU scheduling, contributing to NVIDIA's AI platform.
ServePost-trainEngineeringToronto, ON2w ago8
Senior Applied AI Engineer, Product Simulation
Senior Applied AI Engineer at NVIDIA to lead the rebuild of a silicon productization toolchain around AI. The role involves building agentic systems to demystify chip feature interactions, integrating AI tools into an agent harness, and leading eval-driven development for applied AI in production.
AgentEngineeringSanta Clara, CA2w ago8
Software Engineer, AI Networking Architect
NVIDIA is seeking an AI Networking Architect to optimize AI workload performance by analyzing AI models, distributed training, and inference workloads, and translating research insights into software, hardware, and networking architecture requirements. The role involves building platforms and simulations to evaluate trade-offs and influence future NVIDIA product roadmaps.
ServeAgentEngineeringTel Aviv, Israel +12w ago8
Senior Software Engineer, Agentic Engineering
Senior Software Engineer to build agentic workflows for code generation, testing, and tuning within NVIDIA's frameworks and compilers. The role involves partnering with internal teams to develop and integrate AI agents into engineering processes, focusing on multi-agent orchestration and autonomous loops.
AgentEngineeringSanta Clara, CA +1 · Remote2w ago8
GPU Performance Engineer - Neural Reconstruction
GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads, involving PyTorch, CUDA, and GPU profiling to improve training and rendering performance.
ServePost-trainEngineeringCanada · Remote3w ago8
Developer Technology Engineer - AI
NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms.
ServeEngineeringShanghai, China +23w ago8
Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026
This role focuses on modeling, simulating, and analyzing the system-level performance of agentic AI workloads in datacenter environments. The engineer will develop simulators, characterize LLM serving traffic, identify performance bottlenecks, and provide architectural recommendations for next-generation AI systems. The role requires strong programming skills in C++ and Python, a solid understanding of queueing theory, traffic modeling, and statistics, as well as fundamentals of deep learning and LLM inference serving.
ServeAgentEngineeringSanta Clara, CA +23w ago8
Software Engineering Manager, Robotics Neural Reconstruction and Real2Sim Applications
NVIDIA is seeking an Engineering Manager to lead a team focused on robotics Neural Reconstruction & Real2Sim Applications, advancing technologies for creating digital twins and workflows at scale for physical AI.
ShipDataEngineeringSanta Clara, CA3w ago8
Senior Applied AI and AI Infrastructure Engineer - Chip Design and DFX
Senior Engineer focused on Applied AI and AI Infrastructure for Chip Design and DFX at NVIDIA. The role involves building and managing deployment cycles for ML & Gen AI projects, establishing robust AI infrastructure, and applying AI methods to solve complex problems in Design For Test. Requires expertise in agents, multi-agentic ecosystems, SQL, ETL, data modeling, cloud platforms, and strong programming skills in Python/C++.
AgentServeEngineeringSanta Clara, CA3w ago8
Applied AI Engineer - VLSI Design
NVIDIA is seeking an Applied AI Engineer to develop and deploy AI agents leveraging LLMs to solve complex problems in VLSI design. The role involves designing and building infrastructure for LLM-powered engineering assistants and multi-turn dialogue systems, fine-tuning models, and integrating them with CAD flows.
AgentEngineeringSanta Clara, CA3w ago8
Senior ASIC AI Engineer
Develop AI powered methodologies and Agents to generate micro-architecture, RTL, and physical design starting with specification, using AI agents to process large data and existing codebase to generate skills that can be widely used. Evaluate latest Multi-agent collaboration frameworks and apply them to generate area/power/timing/functionally accurate designs for memory system units in the GPU.
AgentEngineeringSanta Clara, CA3w ago8
Senior System Software Engineer, Robotics
NVIDIA is seeking a Senior System Software Engineer for their Robotics Platform Team, focusing on humanoid robots and embodied intelligence. The role involves integrating robotics software stacks, enabling deployment of foundation models and RL policies, developing validation workflows, and optimizing system metrics. The engineer will work with AI, simulation, and hardware teams to bring up and harden robotic systems.
ShipAgentEngineeringShanghai, China3w ago8
Deep Learning Computer Architect - New College Grad 2026
NVIDIA is seeking a Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics. The role involves analyzing DL methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and deep learning kernels.
ServeEngineeringSanta Clara, CA +13w ago8
Senior Manager, Artificial Intelligence - Machine Learning Platform
Senior Manager for AI/ML Platform at NVIDIA, leading the development and management of tools and services for the entire AI/ML project lifecycle, focusing on large-scale model training and deployment efficiency. Requires extensive experience in AI/ML infrastructure, team leadership, and strategic vision for AI platforms.
ServePost-trainEngineeringSanta Clara, CA +2 · Remote4w ago8
Manager, Deep Learning Algorithms
Manager to lead engineering activities for productizing Deep Learning models, focusing on implementing and optimizing state-of-the-art algorithms for GPU-accelerated platforms. The role involves leading a team, collaborating with internal partners on roadmap development, and deploying training and inference workloads.
ServeDataEngineeringWarsaw, Poland +1 · Remote4w ago8
Engineering Manager, Inference Benchmarking — AI Perf
Engineering Manager for NVIDIA's AIPerf platform, a standard for assessing LLM serving performance. The role involves leading a team to build and advance the platform, focusing on core infrastructure, accuracy of benchmark results, and advising on upstream engine integrations for various AI workloads (LLM, multimodal, diffusion, computer vision). Requires strong systems engineering, inference infrastructure, and open-source community experience.
ServeEngineeringSanta Clara, CA +5 · Remote4w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM
NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing.
ServeEngineeringShanghai, China4w ago8
Senior Software Engineer, Generative AI Systems
Senior Software Engineer role focused on building and scaling Generative AI systems, including LLMs, agentic AI, and RAG pipelines. Responsibilities include designing and developing infrastructure for ML training and inference, creating evaluation frameworks, optimizing RAG pipelines, and building backend services and APIs. Requires strong software engineering fundamentals, experience with ML systems, distributed infrastructure, and GenAI workflows.
AgentServeEngineeringSanta Clara, CA4w ago8
GPU Performance Engineer - Neural Reconstruction
GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads. This role involves profiling, identifying bottlenecks, and improving performance in CUDA, PyTorch, and C++ for training and rendering, while ensuring reconstruction quality is maintained. It requires strong programming, GPU optimization, and performance analysis skills, with collaboration across research and engineering teams.
ServeDataEngineeringCA +5 · Remote4w ago8
Senior AI Tools Engineer, SRE Operations - GeForce NOW
This role focuses on building and deploying AI/ML tools, specifically LLM- and Agent-based systems, to analyze production data for a global service (GeForce Now). The goal is to automate root cause analysis for incidents and predict future service trends, requiring strong data pipeline management and expertise in AI frameworks.
AgentDataEngineeringCA +1 · Remote4w ago8
Perception Engineer - Autonomous Driving
NVIDIA is hiring a Perception Engineer for their Autonomous Driving team in China. The role involves research, design, and implementation of software features for autonomous driving perception, including DNN improvement, evaluation, and deployment. Requires strong C++/PyTorch, ML/DL techniques for Computer Vision, and experience with perception stacks. Familiarity with DNN development, network acceleration, and GPU computing is a plus.
ShipPost-trainEngineeringShanghai, China +15w ago8
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience.
ServeEngineeringShanghai, China +15w ago8
Principal Machine Learning Engineer, Accelerated Apache Spark
This role focuses on applying ML/AI to optimize and accelerate Apache Spark workloads on NVIDIA GPUs, involving performance prediction, adaptive systems, and developing AI agents for system issue resolution and optimization. The role requires significant experience in ML/DL solution design, productionization, and large-scale data processing platforms like Spark, with a focus on LLM/GenAI, reinforcement learning, and adaptive ML systems.
AgentServeEngineeringSanta Clara, CA5w ago8
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems.
ServeDataEngineeringShanghai, China5w ago8
Director, AI Enablement
Director of AI Enablement at NVIDIA, responsible for developing and implementing an AI enablement roadmap to accelerate AI adoption and agentic developments across various internal workflows. The role focuses on building tools, services, and blueprints for NVIDIA AI developers, optimizing AI development, and transforming NVIDIA into an AI-native company.
AgentEngineeringSanta Clara, CA5w ago8
AI Software Engineer, Kernel Libraries - New College Grad 2026
AI Software Engineer focused on developing inference systems software stack, including libraries, code generators, and GPU kernels for NVIDIA's hardware. The role involves innovating for efficient AI inference, optimizing kernels, designing abstractions for LLM serving engines, and building JIT compilers and runtimes. Collaboration with internal teams and contributions to open-source projects like FlashInfer, vLLM, and SGLang are expected.
ServeEngineeringSanta Clara, CA5w ago8
Senior AI Infrastructure Software Engineer - DGX Cloud
NVIDIA is seeking a Senior AI Infrastructure Software Engineer to design, build, and maintain AI platforms for large-scale AI training, inferencing, fine-tuning, and Agentic AI in production. The role involves developing platform and tools for AI/ML workload efficiency, resiliency, and observability, with a focus on distributed systems and Kubernetes.
ServeEngineeringSanta Clara, CA +3 · Remote6w ago8
Software Engineer - AI Research Clusters
Software Engineer to build and maintain GPU clusters for internal AI researchers, focusing on reliability, performance, and self-service. The role involves applying AIOps and Agentic AI to reduce operational toil and support the training, fine-tuning, and deployment of advanced ML models.
ServeEngineeringSanta Clara, CA +5 · Remote6w ago8
Senior Engineer - AI Agents and Systems
Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs, and creating the foundation of the desktop AI operating system.
AgentEngineeringSanta Clara, CA +16w ago8
Senior Engineer - AI Agents and Systems
Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs. The role involves leading development for the foundation of the desktop AI operating system by combining local inference with robust privacy routers and sandboxed execution.
AgentEngineeringSanta Clara, CA +16w ago8
Manager, Test and Tools Development Engineering
Manager for a test and tools development engineering team focused on building autonomous systems and AI-powered quality infrastructure for Omniverse. The role involves leading a team to design agentic test pipelines, multi-agent orchestration for test generation, failure triage, and establishing evaluation frameworks for AI-generated outputs.
AgentEngineeringPune, India6w ago8
Senior AI Engineer, Agents and Developer Workflows
Senior AI Engineer role focused on developing and deploying AI agents and LLM-based solutions to automate software engineering workflows within NVIDIA. The role involves creating tools to improve developer efficiency, accelerate feedback loops, and enhance release reliability, with a focus on predictive modeling for risk identification and leveraging RAG and fine-tuning techniques.
AgentEngineeringBeijing, China +17w ago8
Senior Performance Compiler Engineer - Triton
Senior Performance Compiler Engineer to work on the open-source Triton compiler project, focusing on using compilers to improve AI performance on NVIDIA GPUs for large language models, agents, and other AI applications. The role involves investigating GPU hardware, designing and implementing compiler technology using MLIR to optimize kernel descriptions for efficient GPU code generation, and collaborating with internal teams.
ServeEngineeringRedmond, WA +5 · Remote7w ago8