AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (224)

434 AI · 1824 total active
FilteredFunctionEngineering×CountryUnited States×Clear all
Show
Active onlyAI only (≥ 7)
Stage
AllData · 17Pretrain · 20Post-train · 28Serve · 236Agent · 95Eval Gate · 5Ship · 33
Function
AllEngineering · 375Research · 57Product · 2
Country
AllUnited States · 259China · 55Israel · 43Germany · 21Switzerland · 18United Kingdom · 14India · 13Poland · 12Vietnam · 12Canada · 10Italy · 7Netherlands · 6Singapore · 6France · 5Taiwan · 4Finland · 2Spain · 2Armenia · 1Czech Republic · 1Hungary · 1Japan · 1Romania · 1South Korea · 1Sweden · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Software Engineer - Simulation
Senior Software Engineer role focused on building scalable 3D simulation software for Digital Twin and Synthetic Data Generation applications, collaborating on backend services and AI Agents for end-to-end SDG solutions. Requires strong C/C++/Python skills, 3D simulation experience, and proficiency in physics game engines and containerization tools.
AgentEngineeringSanta Clara, CA5w ago7
Senior Software Engineer - Simulation
Senior Software Engineer role focused on building scalable 3D simulation software for Digital Twin and Synthetic Data Generation applications, collaborating with teams to build backend services and AI Agents for end-to-end SDG solutions. Requires strong programming skills in C/C++, Python, and experience with 3D simulation and physics engines.
AgentEngineeringSanta Clara, CA5w ago
151–200 of 224← Prev12345Next →
7
Senior Software Engineer, AI Resiliency
Senior Software Engineer to lead the development of AI software resiliency for large-scale AI supercomputers (100,000+ GPUs), focusing on features like fast checkpoint-recovery, error detection/isolation, and straggler/hang detection to minimize cluster downtime. The role involves hands-on C++ and Python coding, debugging, fault tolerance, and collaboration with AI researchers and hardware/software teams, integrating resiliency into AI frameworks like PyTorch and JAX/XLA. Experience with distributed systems, fault tolerance, AI frameworks, and debugging tools is required, with a preference for experience in training models, CUDA/NCCL/MPI, checkpointing strategies, and large-scale AI clusters/HPC.
ServeEngineeringRedmond, WA +16w ago7
Systems Software Engineer - New College Grad 2026
Systems Software Engineer role focused on applying AI and computational methods to accelerate semiconductor manufacturing and design using GPUs. The role involves developing and optimizing complex software solutions, with a strong emphasis on performance and parallel programming.
ServeEngineeringHillsboro, OR6w ago7
Senior Manager, Robotics Quality Assurance
Senior Manager, Robotics Quality Assurance to lead test engineering strategy, automation, and execution for Nvidia's robotics products. This includes validating hardware/software on Jetson, testing AI foundation models (Isaac GR00T), building simulation infrastructure (Isaac Sim/Lab), and ensuring sim-to-real transferability for AI-driven robot policies.
ShipDataEngineeringSanta Clara, CA6w ago7
Lead Engineer, Healthcare Data Operations and Strategy
Lead engineer responsible for defining strategy and architecting/building the MLOps platform for NVIDIA's healthcare data programs, ensuring data quality, governance, and serving for model training and evaluation.
DataEngineeringSanta Clara, CA7w ago7
Senior Deep Learning Systems Engineer, Datacenters
Senior Deep Learning Systems Engineer focused on analyzing and optimizing the performance and power consumption of deep learning applications on datacenter hardware, influencing the design of future AI systems and software stacks. This role involves developing software infrastructure, analysis tools, and profiling methodologies for DL workloads, with a strong emphasis on system architecture and performance analysis.
ServeEngineeringSanta Clara, CA +17w ago7
Senior System Software Engineer - AI Performance and Efficiency Tools
Develops internal profiling, analysis, debugging, benchmarking, and simulation tools for AI workloads running on GPU clusters, supporting AI researchers and SW/HW teams to improve performance and efficiency.
ServeDataEngineeringSanta Clara, CA7w ago7
Senior Developer Technology Engineer - Windows AI Platform
Senior Developer Technology Engineer focused on optimizing and deploying AI/GenAI applications on NVIDIA RTX platforms, particularly LLMs on Windows. This role involves working with internal teams and external developers, analyzing performance, conducting training, and improving user experience with OSS software like Llama.cpp and Ollama. Collaboration with driver and architecture teams is key to influencing future GPU features.
ServeAgentEngineeringSanta Clara, CA7w ago7
Senior Deep Learning Tools Engineer – CUDA Tile
Senior Deep Learning Tools Engineer at NVIDIA focused on performance validation, analysis, and tracking for AI workloads accelerated by CUDA Tile compiler technologies and GPU systems. The role involves designing and developing performance testing frameworks, building automated CI/CD pipelines, implementing benchmarking systems, analyzing performance trends, and collaborating with compiler and architecture teams to resolve performance issues. Requires strong programming skills in Python, experience with CI/CD, deep learning frameworks, and hardware-aware performance analysis.
ServeEngineeringSanta Clara, CA +5 · Remote7w ago7
Senior Systems Software Engineer - GPU Performance at Scale
Senior Systems Software Engineer focused on GPU performance at scale for AI workloads, involving collaboration with various hardware and software teams to optimize large-scale computing platforms and deliver insights into AI workload performance.
ServeEngineeringSanta Clara, CA +4 · Remote7w ago7
Senior Staff Software Engineer - AI Agent Platform
Senior Staff Software Engineer to build and scale the infrastructure for NVIDIA's AI agent ecosystem, focusing on platform services for the full agent lifecycle, Kubernetes execution environments, CI/CD pipelines, and AI data platform components.
AgentEngineeringSanta Clara, CA7w ago7
Senior Compiler Engineer - AI
NVIDIA is seeking a Senior Compiler Engineer with expertise in machine learning and compiler technologies to focus on applied AI and ML within compilers and development tools. The role involves working with Python, C/C++, Julia, and Lisp/Scheme, with a strong foundation in compilers, code generation, and GPU architecture. Experience with LLVM is a plus.
ServeEngineeringAustin, TX +5 · Remote7w ago7
Senior Sensor Fusion Engineer - Autonomous Vehicles
NVIDIA is seeking a Senior Sensor Fusion Engineer for their Autonomous Vehicles team. The role involves developing core functionality for autonomous driving by fusing perception DNN and map signals to generate a real-time 3D world model. Responsibilities include enabling L3 autonomy, building fused obstacle and occupancy grids, and providing technical leadership. The position requires significant experience in AV or robotics, with a focus on sensor fusion and real-time systems.
AgentEngineeringSanta Clara, CA +3 · Remote7w ago7
Distinguished Software Architect - Deep Learning and HPC Communications
Distinguished Software Architect role focused on designing and researching next-generation communication libraries and platforms for Deep Learning and High Performance Computing at NVIDIA. The role involves co-designing HW/SW solutions with GPU, Networking, and SW architects, driving adoption of new communication technologies, and keeping up with DL research. Requires deep expertise in HPC, parallel programming, communication runtimes, system/GPU architecture, and networking, with strong programming skills in C/C++.
ServeEngineeringSanta Clara, CA7w ago7
Manager, Next-Gen AI Cluster Validation
Manager to lead a team developing and validating next-generation NVIDIA AI supercomputing systems, integrating new compute, networking, storage, and software. Focus on building a platform for software development, automation, and performance engineering, and supporting large-scale deployments for AI and HPC.
ServeEngineeringSanta Clara, CA +1 · Remote8w ago7
GPU Power Architect - New College Grad 2026
NVIDIA is seeking a New College Grad Datacenter GPU Power Architect to contribute to the research and development of energy-efficient GPU and SOC architectures. The role involves developing power estimation models and tools, exploring energy efficiency at GPU and Datacenter levels, and deploying machine learning techniques to model GPU, CPU, Switch, and platform performance and power. The candidate will understand GenAI/HPC workload characteristics to drive HW/SW features for Perf@Watt improvements.
ServeEngineeringSanta Clara, CA8w ago7
Senior Deep Learning Compiler Engineer
NVIDIA is seeking a Senior Deep Learning Compiler Engineer to develop compiler optimization algorithms for deep learning networks. This role involves collaborating with deep learning software framework and hardware architecture teams to accelerate next-generation deep learning software, focusing on public APIs, performance, and compiler infrastructure for neural networks.
ServeEngineeringSanta Clara, CA +5 · Remote8w ago7
Lead Software Engineer – Robotics Platform
Lead Software Engineer for NVIDIA's Robotics Platform, focusing on end-to-end solutions for training, simulation, and deployment of Physical AI on robots. The role involves leading development of core platform features, building reference integrations, and optimizing for NVIDIA hardware, with an emphasis on performance, reliability, and leveraging AI for development acceleration.
ShipAgentEngineeringSanta Clara, CA8w ago7
Senior Software Engineer - Deep Learning Compiler CI Infrastructure
Senior Software Engineer to own and evolve CI/CD infrastructure for NVIDIA's deep learning compiler stacks. Responsibilities include designing and operating scalable CI systems for ML workloads, delivering performance signals, and applying AI/agent-based workflows to improve developer efficiency and triage.
ServeEngineeringSanta Clara, CA +38w ago7
Principal Software Engineer - DGX Cloud
Principal Software Engineer for NVIDIA's DGX Cloud team, focusing on building and scaling foundational systems for high-performance GPU infrastructure. The role involves leading the development of next-generation APIs, state management, and workflow orchestration systems to automate fleet lifecycle operations at scale, integrating AI schedulers and observability tools.
AgentServeEngineeringSanta Clara, CA +18w ago7
Senior ASIC Infrastructure Engineer
Senior ASIC Infrastructure Engineer at NVIDIA, focusing on defining and deploying AI/ML applications to enhance chip design, debug, and verification processes. The role involves collaborating with hardware design teams, architecting AI/LLM solutions, and staying updated on AI/ML advancements.
AgentEngineeringSanta Clara, CA8w ago7
SoC Product Architect, Telecom AI RAN
NVIDIA is seeking a Lead SoC Product Architect for their Telecom AI RAN platform, focusing on defining the architecture and roadmap for radio and distributed unit products. The role involves analyzing workloads, driving competitive analysis, synthesizing customer requirements, and collaborating with engineering teams to ensure efficient implementation of AI-native RAN applications. The ideal candidate will have extensive experience in wireless RAN/baseband architecture or SoC product definition, with a strong understanding of 3GPP RAN standards and L1/PHY algorithms.
ServeEngineeringSanta Clara, CA8w ago7
Senior System Software Engineer - Neural Graphics Performance
Senior System Software Engineer focused on optimizing neural graphics performance, specifically Gaussian Splatting and neural reconstruction algorithms, for applications in robotics, healthcare, and AV development. The role involves implementing and optimizing reconstruction/rendering algorithms using CUDA and Slang, optimizing data processing pipelines, and influencing software architecture for performance.
ServeDataEngineeringSanta Clara, CA8w ago7
Senior Manager, AlpaSim and AlpaDreams Production
Engineering leader to productize neural simulation technologies (AlpaDreams, AlpaSim) for AV and robotics, building a team and scaling an open-source platform by integrating research findings into a modular, production-ready system.
ShipServeEngineeringSanta Clara, CA +28w ago7
Senior System Software Engineer - Dynamo-Triton Inference Server
Senior System Software Engineer to work on Dynamo-Triton Inference Server, a GPU-accelerated AI inference serving platform. The role involves developing high-performance inference software, contributing to feature development, driving customer adoption, and optimizing throughput and latency for both LLM and non-LLM workloads.
ServeEngineeringSanta Clara, CA +2 · Remote8w ago7
Senior Software Performance Engineer - AV Platform
Senior Software Performance Engineer for Autonomous Vehicles platform, focusing on optimizing latency and throughput of L2/L3/L4 autonomous driving solutions on NVIDIA's heterogeneous hardware architectures. Requires strong C++ skills, parallel programming, performance analysis, and experience with GPGPU/CUDA.
ServeAgentEngineeringSanta Clara, CA8w ago7
Senior Solutions Architect - Autonomous Vehicles
Senior Solutions Architect role focused on integrating and deploying scalable solutions for autonomous vehicles using NVIDIA's platform. Responsibilities include leading vehicle software integration, driving use case analysis, defining requirements, and optimizing processing pipelines. Requires strong C/C++, Python, system performance evaluation, and experience with automotive design processes and standards.
ShipEngineeringSanta Clara, CA8w ago7
Senior AI and ML HPC Cluster Engineer
This role focuses on designing, implementing, and managing large-scale GPU compute clusters for AI/ML and HPC workloads. It involves infrastructure engineering, automation, and supporting researchers with performance analysis and optimization. The role requires expertise in cluster management, Linux administration, container technologies, scripting, and MPI workflows.
ServeEngineeringSanta Clara, CA +5 · RemoteApr 247
Manager, Software Architecture
Manager for a systems and networking engineering team focused on building distributed AI communication systems (libraries, frameworks, system integrations) for GPUs, nodes, and storage. The role involves setting technical direction, leading execution, and fostering technical excellence within the team, with a focus on AI infrastructure problems.
ServeEngineeringSanta Clara, CAApr 237
Senior Formal Verification Engineer, GPU Kernels
Senior Formal Verification Engineer for GPU Kernels at NVIDIA. The role focuses on developing verification tools that combine formal methods and AI to ensure the correctness of performance-critical GPU kernels, enabling their use in safety-critical systems. Responsibilities include designing and implementing new verification approaches, integrating AI into formal verification workflows, and building agents for task automation.
AgentServeEngineeringSanta Clara, CAApr 237
Principal Cyber Security Engineer - Agentic Identity and Security
This Principal Cyber Security Engineer role focuses on building core agentic identity and security capabilities for AI agents within NVIDIA's internal ecosystem. The role involves architecting and prototyping solutions for agent use cases across various environments, developing reusable tools and APIs, and collaborating with multiple teams to ensure secure and reliable agent operations. It requires strong software engineering, security, and identity expertise, with an emphasis on practical, production-ready systems.
AgentEngineeringSanta Clara, CAApr 227
Principal Software Engineering Lead — Enterprise Data Platforms
Principal Software Engineer to lead engineering efforts in transforming enterprise systems with AI-infused automated solutions. This role involves architecting and delivering enterprise-grade AI applications and workflow platforms, building resilient end-to-end systems, and developing data integration capabilities for AI applications. Key responsibilities include building agentic workflow automation, defining agent interactions, and supporting orchestration patterns using frameworks like LangChain and LangGraph, while also operationalizing NVIDIA AI technologies.
AgentDataEngineeringSanta Clara, CAApr 227
Senior Software Architect, AI Systems and Networking
This role focuses on building and optimizing systems-level software for high-performance communication and memory management libraries essential for distributed AI workloads. It involves hardware-software co-optimization, profiling data movement, and integrating networking capabilities into AI serving stacks, bridging applied research and production engineering.
ServeEngineeringSanta Clara, CA +2 · RemoteApr 197
Senior Software Engineer, Humanoid Robotics
Senior Software Engineer role focused on shaping the future of autonomous machines and building/deploying scalable robotic solutions, with a focus on humanoid robotics. Responsibilities include crafting application software architecture, scaling deployment of new technologies, integrating hardware/software, and providing technical guidance. Requires experience in robotics, ML/RL, and software development, with a strong plus for GPU programming/CUDA and sim-to-real transfer.
ShipAgentEngineeringSanta Clara, CAApr 177
Senior Data Analysis Engineer, AD Metrics - Autonomous Vehicles
Senior Data Analysis Engineer role focused on critical metrics for autonomous driving (AD) products. The role involves defining performance, supervising test events, analyzing trends, improving triage efficiency, and automating metric generation for dashboards. It requires strong technical ability, scripting experience, and leadership to support AV software development and release.
ShipEngineeringSanta Clara, CAApr 177
Deep Learning Kernel Software Performance Architect - New College Grad 2026
NVIDIA is seeking a Deep Learning Kernel Software Performance Architect to develop and analyze processor and system architectures that accelerate machine learning and data analytics applications. The role involves debugging deep learning software, developing analysis tools, and collaborating with various NVIDIA teams to optimize performance.
ServeEngineeringSanta Clara, CAApr 167
Senior Software Triage Engineer - Autonomous Vehicles
Senior Software Triage Engineer for Autonomous Vehicles at NVIDIA, focusing on debugging, root cause analysis, and improving the quality of AV software through triage and evaluation.
ShipEngineeringSanta Clara, CAApr 137
Senior Manager, Site Reliability Engineering
Senior Manager of Site Reliability Engineering to lead and reshape IT operations at scale, building AI-powered systems for reliability, speed, and employee experience. Focuses on transforming Incident, Problem, and Change Management using observability, AI insights, and orchestration to move towards predictive and autonomous operations.
ServeEngineeringSanta Clara, CAApr 137
Senior Software Engineer - Autonomous Vehicles
Senior Software Engineer for Autonomous Vehicles at NVIDIA, focusing on integrating ML and classical trajectory planners within a safety-oriented framework for SAE Level 3/4 autonomy. The role involves architectural work, establishing safety frameworks, building minimum-risk planning, and designing scalable architectures for self-driving products.
ShipAgentEngineeringSanta Clara, CAApr 137
Senior Software Systems Engineer, L3 and L4 - Autonomous Driving
Senior Software Systems Engineer role focused on L3 and L4 autonomous driving products at NVIDIA. Responsibilities include developing use cases, system requirements, performance analysis, formulating test cases, and defining test strategies. Requires strong experience in safety-critical systems engineering, SOTIF, functional safety, and understanding trade-offs between deep learning and classical approaches.
AgentServeEngineeringSanta Clara, CA +1 · RemoteApr 137
Senior Software Engineer, Parking - Autonomous Vehicles
Senior Software Engineer for NVIDIA's Autonomous Vehicles team, focusing on building groundbreaking technology at the intersection of automotive and robotics for parking and driving features. The role involves designing and implementing planning algorithms in C++ and supporting deployment on test vehicles.
AgentEngineeringSanta Clara, CAApr 137
Lead Safety Architect - Autonomous Vehicles
Lead Safety Architect for NVIDIA's autonomous vehicle technology, focusing on integrating safety measures into DRIVE products and ensuring safety at scale. This role involves defining safety architecture, requirements, and metrics, and improving processes for safety analysis and data-driven evaluation.
ShipEval GateEngineeringSanta Clara, CA +5 · RemoteApr 137
Senior Software Engineer, Machine Learning Inference
Senior Software Engineer role focused on designing and implementing inference software optimizations for NVIDIA TensorRT and TensorRT-LLM to accelerate AI applications on NVIDIA GPUs. Involves C++, Python, and CUDA development, collaboration with AI experts, and optimization of deep learning frameworks and compilers.
ServeEngineeringSanta Clara, CAApr 107
Senior Math Libraries Engineer - Sparsity in AI
Software engineer to design and develop C++ libraries and tools for unstructured sparsity in Deep Learning (DL) and High-Performance Computing (HPC) on NVIDIA GPUs. This involves DSL specifications, on-demand code generation, and enabling the system in Python/PyTorch. The role focuses on performance evaluation, library quality, and collaboration with product management.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 107
Senior Software Engineer, JAX
Senior Software Engineer focused on performance optimizations for JAX, a deep learning framework, to build a scalable platform for data, training, and analysis. The role involves developing core JAX components, working with AI researchers, and building tools to improve AI system development efficiency.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 107
Senior AI and FSI Developer Technology Engineer
Senior AI and FSI Developer Technology Engineer at NVIDIA focused on optimizing AI and HPC workloads on NVIDIA CPUs and GPUs for the financial services industry. The role involves researching, designing, and developing techniques to accelerate these workloads, profiling and eliminating performance bottlenecks, and collaborating with internal and external experts to influence future hardware and software designs. The engineer will also publish and present their work.
ServeEngineeringSanta Clara, CA +2Apr 107
Senior Architecture Energy Modeling Engineer
NVIDIA is seeking a Senior Architecture Energy Modeling Engineer to develop and deploy methodologies for energy-efficient products, focusing on building Machine Learning based power models for GPUs, CPUs, and Tegra SOCs. The role involves collaborating with various engineering teams to analyze and reduce power consumption, improve model accuracy, and integrate power models into simulation platforms.
DataEngineeringSanta Clara, CA +1Apr 107
Autonomous Agent Engineer
NVIDIA is seeking an Autonomous Agent Engineer to build the infrastructure for secure and autonomous AI agent execution. This role involves designing and shipping SDKs, CLIs, and developer tooling for sandboxed compute environments, state management, and security boundaries for AI agents. The position requires strong systems engineering skills, experience with distributed systems and developer platforms, and proficiency in languages like Python or Go. Experience with agentic AI systems, sandboxing technologies, and security fundamentals is highly desirable.
AgentServeEngineeringSanta Clara, CAApr 97
Senior Software Engineer - NIM Factory Container and Cloud Infrastructure
Senior Software Engineer role focused on container and cloud infrastructure for NVIDIA Inference Microservices (NIMs) and hosted services. The role involves designing and implementing container strategies, building enterprise-grade software for container build, packaging, and deployment, and improving reliability, performance, and scale across thousands of GPUs, with a focus on disaggregated LLM inference.
ServeEngineeringSanta Clara, CA +1 · RemoteApr 97