AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (356)

434 AI · 1824 total active
FilteredStageServe×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 28Pretrain · 30Post-train · 51Serve · 356Agent · 192Eval Gate · 11Ship · 55
Function
AllEngineering · 627Research · 82Product · 14
Country
AllUnited States · 439China · 93Israel · 54Germany · 36Switzerland · 31India · 26United Kingdom · 24Poland · 17Vietnam · 13Canada · 12Singapore · 11France · 10Netherlands · 9Italy · 8Taiwan · 6Hong Kong · 4Japan · 4Spain · 3Australia · 2Czech Republic · 2Finland · 2Hungary · 2South Korea · 2Armenia · 1Brazil · 1Mexico · 1Romania · 1Saudi Arabia · 1Sweden · 1United Arab Emirates · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Deep Learning Performance Software Engineer
Develops GPU-accelerated deep learning software, including compilers, DSLs, and optimized kernels, for current and next-generation chips, focusing on performance analysis of AI workloads and integration with AI frameworks.
ServeEngineeringShanghai, ChinaApr 49
Research Scientist, AI Accelerator Design and VLSI - New College Grad 2026
Research Scientist role focused on AI Accelerator Design and VLSI, involving AI HW/SW Co-Design, quantization, and applying generative AI to hardware design. Requires a PhD and experience in VLSI, computer architecture, or numerical algorithms for AI. Collaborates on research prototypes and publishes findings.
ServeResearchSanta Clara, CAApr 49
Senior DGX Cloud AI Infrastructure Software Engineer
51–100 of 356← Prev123…8Next →
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to develop and optimize infrastructure software and tools for large-scale AI training, post-training, and inference. The role focuses on improving efficiency and resiliency of AI workloads, co-designing APIs, and enhancing AI platforms, requiring strong debugging and distributed systems experience.
ServePost-train
Engineering
Santa Clara, CA +4 · Remote
Apr 2
9
Senior GPU Networking Architect
This role focuses on building and optimizing GPU communication kernels for large-scale AI systems, linking GPU computing with networking. The Senior GPU Networking Architect will leverage deep knowledge of GPU architecture to improve kernel efficiency, minimize latency, and overlap computation with communication. Responsibilities include developing GPU-resident communication primitives, profiling and tuning kernels, and collaborating with various teams to co-design communication strategies. The role requires strong CUDA programming, GPU architecture fundamentals, and systems-level C/C++ development.
ServeEngineeringZurich, Switzerland +4 · RemoteMar 309
Senior Software Architect, AI Networking
NVIDIA is looking for a Senior Software Architect to design and optimize inference infrastructure for large language models running on GPU clusters. The role involves working across software and hardware domains to define deployment and scaling strategies, optimize latency and throughput, and collaborate with various teams to ensure high-performance solutions.
ServeEngineeringTel Aviv, Israel +1Feb 49
Senior Deep Learning Algorithm Engineer
Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks.
ServePost-trainEngineeringHo Chi Minh City, Vietnam +1 · RemoteJan 119
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focused on ML Systems, contributing to hardware, software, and infrastructure for ML systems at various scales. The role involves understanding and developing solutions for efficiency, scaling, and resilience in ML systems, with a focus on co-design of algorithms and systems. Requires a PhD and expertise in areas like OS, distributed systems, inference/training systems, data management, cloud computing, or computer architecture.
ServePost-trainResearchSanta Clara, CA +3Jan 99
Senior GPU Architect, Deep Learning
NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing.
ServeEngineeringSanta Clara, CA +2Jan 99
Senior Deep Learning Computer Architect
NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels.
ServeEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference.
ServePost-trainEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Software Engineer, Inference
Senior Software Engineer specializing in Deep Learning Inference, focusing on optimizing GPU-accelerated software for large-scale model serving and inference using frameworks like SGLang and vLLM. The role involves performance tuning, implementing latest algorithms, and scaling performance across NVIDIA accelerators.
ServeEngineeringNetherlands +2 · RemoteJan 99
Research Scientist, ML Systems - PhD New College Grad 2026
Research Scientist role focusing on ML Systems, contributing to hardware, software, and infrastructure for training, fine-tuning, and serving ML models at scale. Requires a PhD and expertise in systems research areas.
ServePost-trainResearchSingapore, Singapore · RemoteDec '259
Senior Software Architect, AI Networking
Senior Software Architect role focused on designing and optimizing large-scale LLM inference infrastructure on GPU clusters, involving system-level optimizations for latency, throughput, and cost-efficiency.
ServeEngineeringTel Aviv, IsraelDec '259
Senior Software Research Architect, AI Networking
NVIDIA is seeking a Senior Software Research Architect to improve the framework for large-scale LLM learning and prediction. This role focuses on designing and optimizing systems for generative AI workloads on advanced GPU clusters, specifically leveraging the NVIDIA Spectrum-X Networking Platform to define deployment and scaling strategies. The architect will work on inter-node communication, compute scheduling, and system-level optimization, collaborating with engineers and researchers to enable generative AI technologies in real-world applications.
ServePretrainResearchTel Aviv, IsraelNov '259
AI Computing Software Development Engineer, TensorRT-LLM
NVIDIA is seeking a Software Development Engineer for its TensorRT-LLM team to develop and optimize LLM inference software for various platforms. The role involves performance analysis, tuning, and contributing to the architecture and hardware design, with a focus on scaling inference capabilities.
ServeEngineeringTaipei, Taiwan +1Sep '259
AI Computing Software Development Engineer, LLM Inference
Software Development Engineer focused on LLM inference software (TensorRT LLM and TensorRT Edge LLM) at NVIDIA, involving crafting, scaling, performance analysis, optimization, and tuning of inferencing software for GPUs. The role requires strong C/C++ skills, experience with deep learning frameworks, and collaboration across teams.
ServeEngineeringShanghai, China +11w ago8
Senior Inference Engineer, AIConfigurator for Dynamo
Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms.
ServeEngineeringSanta Clara, CA +1 · Remote2w ago8
AI Computing Software Development Engineer, TensorRT
NVIDIA is seeking an AI Computing Software Development Engineer for its TensorRT team to craft and develop robust, scalable inferencing software for GPUs. The role involves performance analysis, optimization, tuning, and collaborating with various teams to guide the direction of machine learning inferencing. Requires a Masters or higher degree, 2+ years of software development experience, strong C/C++ skills, and familiarity with deep learning frameworks.
ServeEngineeringShanghai, China2w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM AIGV
NVIDIA is seeking software engineers to develop and optimize inferencing software (TensorRT/TensorRT-LLM) for AI computing. The role involves performance analysis, tuning, integrating AI advancements, and collaborating across teams to shape machine learning inferencing on NVIDIA platforms. Requires strong programming skills, experience with deep learning frameworks, and a proactive approach.
ServeEngineeringShanghai, China +22w ago8
DL System Software Engineer - AI Platform
NVIDIA is seeking a DL System Software Engineer to join their AI Platform team. The role involves developing and building solutions for scheduling large-scale AI training and inference workloads on GPU clusters, optimizing performance and efficiency for large models. The engineer will work on core infrastructure, resource management, and GPU scheduling, contributing to NVIDIA's AI platform.
ServePost-trainEngineeringToronto, ON2w ago8
Software Engineer, AI Networking Architect
NVIDIA is seeking an AI Networking Architect to optimize AI workload performance by analyzing AI models, distributed training, and inference workloads, and translating research insights into software, hardware, and networking architecture requirements. The role involves building platforms and simulations to evaluate trade-offs and influence future NVIDIA product roadmaps.
ServeAgentEngineeringTel Aviv, Israel +12w ago8
GPU Performance Engineer - Neural Reconstruction
GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads, involving PyTorch, CUDA, and GPU profiling to improve training and rendering performance.
ServePost-trainEngineeringCanada · Remote3w ago8
Developer Technology Engineer - AI
NVIDIA is seeking an AI Developer Technology Engineer to study and develop cutting-edge deep learning techniques, analyze and optimize performance on GPU architectures, and work with customers to provide AI solutions using GPUs. The role involves close collaboration with internal NVIDIA teams to influence future architectures and software platforms.
ServeEngineeringShanghai, China +23w ago8
Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026
This role focuses on modeling, simulating, and analyzing the system-level performance of agentic AI workloads in datacenter environments. The engineer will develop simulators, characterize LLM serving traffic, identify performance bottlenecks, and provide architectural recommendations for next-generation AI systems. The role requires strong programming skills in C++ and Python, a solid understanding of queueing theory, traffic modeling, and statistics, as well as fundamentals of deep learning and LLM inference serving.
ServeAgentEngineeringSanta Clara, CA +23w ago8
Deep Learning Computer Architect - New College Grad 2026
NVIDIA is seeking a Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics. The role involves analyzing DL methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and deep learning kernels.
ServeEngineeringSanta Clara, CA +13w ago8
Senior Manager, Artificial Intelligence - Machine Learning Platform
Senior Manager for AI/ML Platform at NVIDIA, leading the development and management of tools and services for the entire AI/ML project lifecycle, focusing on large-scale model training and deployment efficiency. Requires extensive experience in AI/ML infrastructure, team leadership, and strategic vision for AI platforms.
ServePost-trainEngineeringSanta Clara, CA +2 · Remote4w ago8
Manager, Deep Learning Algorithms
Manager to lead engineering activities for productizing Deep Learning models, focusing on implementing and optimizing state-of-the-art algorithms for GPU-accelerated platforms. The role involves leading a team, collaborating with internal partners on roadmap development, and deploying training and inference workloads.
ServeDataEngineeringWarsaw, Poland +1 · Remote4w ago8
Engineering Manager, Inference Benchmarking — AI Perf
Engineering Manager for NVIDIA's AIPerf platform, a standard for assessing LLM serving performance. The role involves leading a team to build and advance the platform, focusing on core infrastructure, accuracy of benchmark results, and advising on upstream engine integrations for various AI workloads (LLM, multimodal, diffusion, computer vision). Requires strong systems engineering, inference infrastructure, and open-source community experience.
ServeEngineeringSanta Clara, CA +5 · Remote4w ago8
AI Computing Development Engineer, TensorRT and TensorRT-LLM
NVIDIA is seeking software engineers to develop and optimize AI inference software (TensorRT/TensorRT-LLM) for GPUs. The role involves performance analysis, tuning, integrating new advancements, and collaborating across teams to shape the future of machine learning inferencing.
ServeEngineeringShanghai, China4w ago8
GPU Performance Engineer - Neural Reconstruction
GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads. This role involves profiling, identifying bottlenecks, and improving performance in CUDA, PyTorch, and C++ for training and rendering, while ensuring reconstruction quality is maintained. It requires strong programming, GPU optimization, and performance analysis skills, with collaboration across research and engineering teams.
ServeDataEngineeringCA +5 · Remote4w ago8
Deep Learning Performance Architect
NVIDIA is seeking a Deep Learning Performance Architect to develop and optimize GPU-accelerated deep learning inference software, focusing on highly optimized kernels, performance analysis, and tuning. The role involves collaboration across various domains like automotive, image, and speech understanding, and requires strong C/C++ skills and GPU programming experience.
ServeEngineeringShanghai, China +15w ago8
Senior DGX Cloud AI Infrastructure Software Engineer
NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to design, build, and maintain AI infrastructure for large-scale AI training and inferencing. The role involves optimizing efficiency and resiliency of AI workloads, developing scalable AI and Data infrastructure tools, and ensuring high availability of AI systems.
ServeDataEngineeringShanghai, China5w ago8
AI Software Engineer, Kernel Libraries - New College Grad 2026
AI Software Engineer focused on developing inference systems software stack, including libraries, code generators, and GPU kernels for NVIDIA's hardware. The role involves innovating for efficient AI inference, optimizing kernels, designing abstractions for LLM serving engines, and building JIT compilers and runtimes. Collaboration with internal teams and contributions to open-source projects like FlashInfer, vLLM, and SGLang are expected.
ServeEngineeringSanta Clara, CA5w ago8
Senior AI Infrastructure Software Engineer - DGX Cloud
NVIDIA is seeking a Senior AI Infrastructure Software Engineer to design, build, and maintain AI platforms for large-scale AI training, inferencing, fine-tuning, and Agentic AI in production. The role involves developing platform and tools for AI/ML workload efficiency, resiliency, and observability, with a focus on distributed systems and Kubernetes.
ServeEngineeringSanta Clara, CA +3 · Remote6w ago8
Software Engineer - AI Research Clusters
Software Engineer to build and maintain GPU clusters for internal AI researchers, focusing on reliability, performance, and self-service. The role involves applying AIOps and Agentic AI to reduce operational toil and support the training, fine-tuning, and deployment of advanced ML models.
ServeEngineeringSanta Clara, CA +5 · Remote6w ago8
Senior Performance Compiler Engineer - Triton
Senior Performance Compiler Engineer to work on the open-source Triton compiler project, focusing on using compilers to improve AI performance on NVIDIA GPUs for large language models, agents, and other AI applications. The role involves investigating GPU hardware, designing and implementing compiler technology using MLIR to optimize kernel descriptions for efficient GPU code generation, and collaborating with internal teams.
ServeEngineeringRedmond, WA +5 · Remote7w ago8
Machine Learning Intern - 2026
NVIDIA is seeking a Machine Learning Intern to assist with developing demonstrations using NVIDIA SDKs, algorithmic development, and AI software development. The role involves keeping up with the latest NVIDIA technology, building demos, and engaging the AI community through workshops.
ServeEngineeringSTP, Hong Kong7w ago8
Solutions Architect - AI for Drug Discovery
NVIDIA seeks a Solutions Architect for their EMEA team to drive AI adoption in drug discovery within the biopharma industry. The role involves acting as a technical advisor to pharmaceutical companies, biotechs, and research organizations, leveraging NVIDIA's computing platform. Responsibilities include building proof-of-concept demonstrations, scaling AI deployments, and supporting business development by guiding customers on production-grade inference, model training, RL, and post-training algorithms. The role also involves exploring foundation models, agentic LLM applications, and physical AI in biopharma, providing feedback to internal teams, and documenting/teaching NVIDIA solutions.
ServePost-trainEngineeringUnited Kingdom +5 · Remote7w ago8
Senior GPU System Architect
Seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC datacenters. The role involves defining system architectures that integrate GPU compute, memory, and interconnects for optimal AI performance and scalability. Requires deep experience in system-level fabric/networking architecture and hardware-software co-design.
ServeEngineeringSanta Clara, CA7w ago8
Solution Architect, Generative AI
NVIDIA is seeking a Solution Architect to promote adoption and provide technical support for their GPU-accelerated computing solutions, focusing on generative AI, machine learning, and deep learning for enterprise clients in Japan. The role involves pre-sales activities, technical support for model training and deployment, and developing solutions for inference and agent-based systems.
ServeAgentEngineeringTokyo, Japan7w ago8
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect at NVIDIA to design and evaluate hardware architectures for AI/HPC applications, focusing on LLM inference and training performance, and optimizing system bottlenecks.
ServePost-trainEngineeringSanta Clara, CA +17w ago8
Senior Data Center Performance Engineer - Benchmarking and Optimization
Senior Data Center Performance Engineer at NVIDIA focused on benchmarking and optimizing data center platforms for AI training, inference, and HPC workloads. Responsibilities include designing benchmarks, characterizing workloads, identifying bottlenecks, and driving performance improvements through system tuning and architectural recommendations.
ServeEngineeringSanta Clara, CA +1 · Remote7w ago8
NCX Engineer, AI Accelerator
This role focuses on engineering and deploying AI infrastructure and solutions for strategic customers, optimizing large-scale training and inference workloads on NVIDIA's AI platform. It involves MLOps, Kubernetes, GPU scheduling, and performance tuning, with a strong emphasis on customer-facing technical support and collaboration.
ServePost-trainEngineeringSanta Clara, CA +17w ago8
Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026
NVIDIA is seeking engineers to develop algorithms and optimizations for their LPX inference and compiler stack, working at the intersection of large-scale systems, compilers, and deep learning to optimize neural network workloads on future NVIDIA platforms. The role involves building and maintaining high-performance runtime and compiler components, defining workload mappings, integrating with the SW ecosystem, benchmarking, profiling, and collaborating with hardware teams. It also includes prototyping new compilation techniques and publishing technical work.
ServeEngineeringToronto, ON +1 · Remote7w ago8
Senior AI Solutions Architect
NVIDIA is seeking an AI Solutions Architect with deep expertise in AI solutions and scalable data center infrastructure. The role involves embedding NVIDIA software into customer architectures, improving application performance, and establishing technical foundations for next-generation AI systems. Responsibilities include supporting business development, working directly with developers and customers, analyzing architectures for acceleration opportunities, and delivering trainings.
ServeAgentEngineeringSanta Clara, CA +1 · Remote7w ago8
Senior Deep Learning Framework Communications Engineer
Senior Deep Learning Framework Communications Engineer at NVIDIA, focusing on integrating and optimizing communication libraries (NCCL, NVSHMEM) within AI frameworks (PyTorch, TRT-LLM, vLLM, JAX) to enhance performance for large-scale AI training and inference. The role involves deep analysis of AI workloads, compiler improvements, and kernel authoring for multi-GPU systems.
ServeEngineeringSanta Clara, CA +4 · Remote7w ago8
Director, System Software Engineering - Metropolis Accelerated and Inferencing Software
NVIDIA is seeking a Director of System Software Engineering to lead teams responsible for the full lifecycle of Vision AI strategy, from model onboarding to production deployment. The role focuses on transforming foundation models into real-time, GPU-accelerated video intelligence systems, scaling multimodal reasoning, and enabling agentic development workflows. Key responsibilities include architecting and operationalizing inference acceleration, driving implementations of frameworks like TensorRT and VLLM, collaborating with partners on custom models, and ensuring performance benchmarking. The ideal candidate has extensive experience in deep learning, GPU optimization, and leading engineering teams in embedded and enterprise platforms.
ServeAgentEngineeringSanta Clara, CA8w ago8
Senior Software Architect - Deep Learning and HPC Communications
Senior Software Architect role at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, developing new communication technologies, exploring hardware/software co-design, and building proofs-of-concept to drive innovation in large-scale GPU clusters.
ServeEngineeringSanta Clara, CA +4 · Remote8w ago8
Senior Solutions Architect - Deep Learning
Senior Solutions Architect focused on Deep Learning and Agentic AI tools, collaborating with customers to build solutions using NVIDIA technology. Responsibilities include technical sales support, integrating NVIDIA tech into HPC, championing Deep Learning internally, and developing demo solutions.
ServeAgentEngineeringTel Aviv, Israel8w ago8
Senior Solutions Architect - AI Factory Deployment
Senior Solutions Architect focused on deploying and validating AI factories, specifically running and debugging AI/LLM workloads on GPU clusters. Responsibilities include setting up environments, executing benchmarks, resolving performance issues, building observability, and recommending optimizations.
ServeEngineeringCA +3 · Remote8w ago8