AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 440 active AI roles, down 53% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $100k–$575k (avg $262k).

Hiring
440 / 623
Momentum (4w)
↓-386 -53%
340 opens last 4w · 726 prior 4w
Salary range · avg $262k
$100k–$575k
USD · disclosed roles only
Tracked since
May '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
Dec 30
1 new role
Mar 10
1 new role
24
1 new role
Apr 28
4 new roles
May 12
5 new roles
19
3 new roles
26
3 new roles
Jun 2
2 new roles
9
1 new role
16
2 new roles
23
3 new roles
30
4 new roles
Jul 7
1 new role
14
2 new roles
28
4 new roles
Aug 11
6 new roles
18
2 new roles
25
3 new roles
Sep 1
8 new roles
15
3 new roles
22
6 new roles
29
2 new roles
Oct 6
2 new roles
13
3 new roles
20
6 new roles
27
9 new roles
Nov 3
8 new roles
10
8 new roles
17
4 new roles
24
11 new roles
Dec 1
9 new roles
8
14 new roles
15
10 new roles
22
8 new roles
29
107 new roles
Jan 5
22 new roles
12
45 new roles
19
32 new roles
26
59 new roles
Feb 2
64 new roles
9
63 new roles
16
83 new roles
23
83 new roles
Mar 2
88 new roles
9
97 new roles
16
72 new roles
23
215 new roles
30
158 new roles
Apr 6
250 new roles
13
199 new roles
20
332 new roles
27
304 new roles
May 4
189 new roles
11
131 new roles
18
102 new roles
25
129 new roles
Jun 1
122 new roles
8
49 new roles
15
40 new roles
22

NVIDIA currently has 496 active AI-related job listings. The majority of these roles, 52%, are focused on serving infrastructure, with agents representing another significant segment at 23%. Engineering is the dominant function, with 441 positions. The United States leads hiring geographies with 287 roles, followed by China with 64. Frequent tech tags include model_serving, inference_infra, and agent_orchestration, suggesting a focus on deployment and management of AI models. Over the last 30 days, NVIDIA posted 214 new AI roles, a 27% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is NVIDIA hiring for?

    NVIDIA currently has 487 active AI-related roles in our index. The most common open titles are: Deep Learning Performance Architect (4), Senior Deep Learning Performance Architect (4), AI Research Scientist (3), Developer Technology Engineer - AI (3), Manager, Deep Learning Algorithms (3). Most positions are in Engineering and Research.

  • What stage of AI development does NVIDIA focus on?

    NVIDIA's active AI hiring is concentrated in: serving infrastructure (54%), agents (21%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is NVIDIA hiring AI talent?

    NVIDIA is hiring AI talent in: United States (286 roles), China (59 roles), Israel (50 roles), Germany (21 roles).

  • What technologies does NVIDIA's AI team work with?

    Job postings at NVIDIA most frequently reference: model serving, inference infra, agent orchestration, llm observability, multimodal.

  • How many AI roles has NVIDIA posted recently?

    In the past 30 days, NVIDIA has posted 110 new AI-related roles. That is a -50% change versus the prior 30 days (218 → 110).

Jobs (224)

434 AI · 1824 total active
FilteredFunctionEngineering×CountryUnited States×Clear all
Show
Active onlyAI only (≥ 7)
Stage
AllData · 17Pretrain · 20Post-train · 28Serve · 236Agent · 95Eval Gate · 5Ship · 33
Function
AllEngineering · 375Research · 57Product · 2
Country
AllUnited States · 259China · 55Israel · 43Germany · 21Switzerland · 18United Kingdom · 14India · 13Poland · 12Vietnam · 12Canada · 10Italy · 7Netherlands · 6Singapore · 6France · 5Taiwan · 4Finland · 2Spain · 2Armenia · 1Czech Republic · 1Hungary · 1Japan · 1Romania · 1South Korea · 1Sweden · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Manager, Engineering - Enterprise AI and Automation
Senior Engineering Manager to lead the strategy and execution for NVIDIA’s agentic developer platform, focusing on building, evaluating, and improving autonomous agents. The role involves identifying gaps, driving POCs, operationalizing approaches into reusable components, and establishing governance and safety mechanisms to scale autonomous systems within NVIDIA.
AgentServeEngineeringSanta Clara, CAFeb 239
Senior High-Performance AI Training Engineer
Senior engineer focused on optimizing AI training workloads for performance on NVIDIA's hardware and software stack, from drivers to DL frameworks, impacting hardware/software roadmap and contributing to MLPerf benchmarks.
Data
51–100 of 224← Prev12345Next →
Serve
Engineering
Santa Clara, CA
Feb 12
9
Senior Research Engineer Neural Reconstruction
Senior Research Engineer focused on neural reconstruction, developing and integrating neural rendering approaches for generative video, segmentation, and 3D reconstruction. The role involves adapting and fine-tuning generative models, collaborating on ML workflows, and contributing to core NVIDIA products. Requires strong Python and ML library skills, with experience in training and optimizing models.
Post-trainServeEngineeringSanta Clara, CAFeb 129
Distinguished Engineer – High Performance AI
Distinguished Engineer role focused on building groundbreaking agentic AI systems for the CUDA ecosystem, encompassing multi-agent runtimes, orchestration, data/evaluation pipelines, training/inference stacks, and GPU-accelerated execution. The role involves defining technical strategy, co-designing solutions with hardware/software teams, developing evaluation frameworks, and driving architecture across the AI stack.
AgentServeEngineeringSanta Clara, CA +5 · RemoteJan 159
Senior GPU Architect, Deep Learning
NVIDIA is seeking a Senior GPU Architect to design and enhance GPU architecture features specifically for deep learning workloads, covering both training and inference. The role involves developing simulators, mapping deep learning algorithms to hardware, and advancing parallel computation. Requires strong C++, C++, Perl, Python programming, and a background in computer architecture and high-performance computing.
ServeEngineeringSanta Clara, CA +2Jan 99
Senior Deep Learning Computer Architect
NVIDIA is seeking a Senior Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics algorithms. The role involves analyzing deep learning methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and core deep learning kernels.
ServeEngineeringSanta Clara, CA +1Jan 99
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect role at NVIDIA focused on developing and analyzing next-generation architectures for AI and HPC applications. This involves performance modeling, simulation, and understanding the interplay of hardware and software for deep learning training and inference.
ServePost-trainEngineeringSanta Clara, CA +1Jan 99
Senior Research Engineer, Foundation Model Training Infrastructure
Senior/Principal Engineer to build cutting-edge infrastructure for large-scale foundation model training in the Generalist Embodied Agent Research (GEAR) group, focusing on Project GR00T for humanoid robots. Responsibilities include designing and optimizing distributed training systems, data loaders, and monitoring tools for multimodal foundation models.
PretrainPost-trainEngineeringSanta Clara, CAJan 99
Senior Manager, AlpaSim and AlpaDreams Production
Engineering leader to scale NVIDIA's interactive world-model platform (OmniDreams, FlashDreams, AlpaSim) into an industry standard, focusing on production engineering, performance, and developer ecosystem growth for applications in AV, robotics, rendering, and simulation.
ShipServeEngineeringSanta Clara, CA +21w ago8
Senior Systems Software Engineer, Semiconductor Systems Inspection
Senior Software Engineer to develop AI products for semiconductor inspection, focusing on computer vision, multimodal AI, anomaly detection, model compression, and deployment optimization. The role involves building models, adaptation workflows, and inference pipelines for production environments, with a focus on advancing roadmap progress and delivering practical systems.
ShipServeEngineeringSanta Clara, CA1w ago8
Senior Inference Engineer, AIConfigurator for Dynamo
Senior Inference Engineer role focused on optimizing LLM inference deployment configurations using AIConfigurator, integrating GPU systems, model serving, and performance modeling for NVIDIA platforms.
ServeEngineeringSanta Clara, CA +1 · Remote2w ago8
Distinguished Engineer - Wireless Infrastructure
NVIDIA is seeking a Distinguished Engineer to lead the technology strategy for next-generation wireless infrastructure, focusing on AI-RAN and Agentic Core. The role involves applying AI/ML to 6G RAN functions, transforming the wireless core into an agentic AI-based architecture, and driving rapid prototyping of GPU-accelerated platforms. Responsibilities include system architecture, design, development, and performance optimization for AI-for-RAN software stacks, as well as driving new applications in Integrated Sensing and Communications (ISAC) and Physical AI at the Edge. The position requires deep expertise in AI/ML, communication systems, and significant industry experience.
AgentDataEngineeringSanta Clara, CA +2 · Remote2w ago8
Senior Software Engineer - Autonomous Driving Simulation
Senior Software Engineer role focused on building and scaling realistic virtual environments for autonomous vehicle (AV) training, testing, and validation. The role involves developing simulation platforms, domain adaptation technologies (Real2Sim, Sim2Real), and optimizing large-scale simulation workflows. It requires strong programming skills in Python, C/C++, PyTorch, and experience with modern software engineering and infrastructure tools, as well as a background in computer vision, deep learning, or simulation systems.
DataAgentEngineeringSanta Clara, CA2w ago8
Senior Applied AI Engineer, Product Simulation
Senior Applied AI Engineer at NVIDIA to lead the rebuild of a silicon productization toolchain around AI. The role involves building agentic systems to demystify chip feature interactions, integrating AI tools into an agent harness, and leading eval-driven development for applied AI in production.
AgentEngineeringSanta Clara, CA2w ago8
Senior Software Engineer, Agentic Engineering
Senior Software Engineer to build agentic workflows for code generation, testing, and tuning within NVIDIA's frameworks and compilers. The role involves partnering with internal teams to develop and integrate AI agents into engineering processes, focusing on multi-agent orchestration and autonomous loops.
AgentEngineeringSanta Clara, CA +1 · Remote2w ago8
Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026
This role focuses on modeling, simulating, and analyzing the system-level performance of agentic AI workloads in datacenter environments. The engineer will develop simulators, characterize LLM serving traffic, identify performance bottlenecks, and provide architectural recommendations for next-generation AI systems. The role requires strong programming skills in C++ and Python, a solid understanding of queueing theory, traffic modeling, and statistics, as well as fundamentals of deep learning and LLM inference serving.
ServeAgentEngineeringSanta Clara, CA +23w ago8
Software Engineering Manager, Robotics Neural Reconstruction and Real2Sim Applications
NVIDIA is seeking an Engineering Manager to lead a team focused on robotics Neural Reconstruction & Real2Sim Applications, advancing technologies for creating digital twins and workflows at scale for physical AI.
ShipDataEngineeringSanta Clara, CA3w ago8
Senior Applied AI and AI Infrastructure Engineer - Chip Design and DFX
Senior Engineer focused on Applied AI and AI Infrastructure for Chip Design and DFX at NVIDIA. The role involves building and managing deployment cycles for ML & Gen AI projects, establishing robust AI infrastructure, and applying AI methods to solve complex problems in Design For Test. Requires expertise in agents, multi-agentic ecosystems, SQL, ETL, data modeling, cloud platforms, and strong programming skills in Python/C++.
AgentServeEngineeringSanta Clara, CA3w ago8
Applied AI Engineer - VLSI Design
NVIDIA is seeking an Applied AI Engineer to develop and deploy AI agents leveraging LLMs to solve complex problems in VLSI design. The role involves designing and building infrastructure for LLM-powered engineering assistants and multi-turn dialogue systems, fine-tuning models, and integrating them with CAD flows.
AgentEngineeringSanta Clara, CA3w ago8
Senior ASIC AI Engineer
Develop AI powered methodologies and Agents to generate micro-architecture, RTL, and physical design starting with specification, using AI agents to process large data and existing codebase to generate skills that can be widely used. Evaluate latest Multi-agent collaboration frameworks and apply them to generate area/power/timing/functionally accurate designs for memory system units in the GPU.
AgentEngineeringSanta Clara, CA3w ago8
Deep Learning Computer Architect - New College Grad 2026
NVIDIA is seeking a Deep Learning Computer Architect to design hardware accelerator and processor architectures for next-generation platforms, enabling state-of-the-art machine learning and data analytics. The role involves analyzing DL methods, proposing new features for acceleration, and studying their benefits, with a focus on LLM workloads and deep learning kernels.
ServeEngineeringSanta Clara, CA +13w ago8
Engineering Manager, Inference Benchmarking — AI Perf
Engineering Manager for NVIDIA's AIPerf platform, a standard for assessing LLM serving performance. The role involves leading a team to build and advance the platform, focusing on core infrastructure, accuracy of benchmark results, and advising on upstream engine integrations for various AI workloads (LLM, multimodal, diffusion, computer vision). Requires strong systems engineering, inference infrastructure, and open-source community experience.
ServeEngineeringSanta Clara, CA +5 · Remote4w ago8
GPU Performance Engineer - Neural Reconstruction
GPU Performance Engineer focused on optimizing neural reconstruction and Gaussian Splatting workloads. This role involves profiling, identifying bottlenecks, and improving performance in CUDA, PyTorch, and C++ for training and rendering, while ensuring reconstruction quality is maintained. It requires strong programming, GPU optimization, and performance analysis skills, with collaboration across research and engineering teams.
ServeDataEngineeringCA +5 · Remote4w ago8
Principal Machine Learning Engineer, Accelerated Apache Spark
This role focuses on applying ML/AI to optimize and accelerate Apache Spark workloads on NVIDIA GPUs, involving performance prediction, adaptive systems, and developing AI agents for system issue resolution and optimization. The role requires significant experience in ML/DL solution design, productionization, and large-scale data processing platforms like Spark, with a focus on LLM/GenAI, reinforcement learning, and adaptive ML systems.
AgentServeEngineeringSanta Clara, CA5w ago8
Senior AI Infrastructure Software Engineer - DGX Cloud
NVIDIA is seeking a Senior AI Infrastructure Software Engineer to design, build, and maintain AI platforms for large-scale AI training, inferencing, fine-tuning, and Agentic AI in production. The role involves developing platform and tools for AI/ML workload efficiency, resiliency, and observability, with a focus on distributed systems and Kubernetes.
ServeEngineeringSanta Clara, CA +3 · Remote6w ago8
Senior Engineer - AI Agents and Systems
Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs, and creating the foundation of the desktop AI operating system.
AgentEngineeringSanta Clara, CA +16w ago8
Senior Engineer - AI Agents and Systems
Senior Engineer role focused on deploying advanced AI agent frameworks and local runtimes to Windows and NVIDIA GeForce RTX GPUs, ensuring open-source AI agents run locally, safely, and efficiently on consumer PCs. The role involves leading development for the foundation of the desktop AI operating system by combining local inference with robust privacy routers and sandboxed execution.
AgentEngineeringSanta Clara, CA +16w ago8
Senior Performance Compiler Engineer - Triton
Senior Performance Compiler Engineer to work on the open-source Triton compiler project, focusing on using compilers to improve AI performance on NVIDIA GPUs for large language models, agents, and other AI applications. The role involves investigating GPU hardware, designing and implementing compiler technology using MLIR to optimize kernel descriptions for efficient GPU code generation, and collaborating with internal teams.
ServeEngineeringRedmond, WA +5 · Remote7w ago8
Senior Systems Engineer, Neural Graphics
Senior Systems Engineer role focused on integrating AI and traditional rendering techniques for real-time visual experiences. The role involves taking innovative techniques like AlpaDreams and driving them into production-ready, real-time pipelines, owning the end-to-end path from prototype to shipping product, and solving complex systems challenges related to latency, memory, and throughput. Requires deep expertise in graphics and AI, with a strong track record of shipping impactful products and experience with systems-level thinking.
ShipAgentEngineeringSanta Clara, CA7w ago8
Senior GPU System Architect
Seeking a Senior GPU System Architect to design multi-GPU scale-up and scale-out systems for AI and HPC datacenters. The role involves defining system architectures that integrate GPU compute, memory, and interconnects for optimal AI performance and scalability. Requires deep experience in system-level fabric/networking architecture and hardware-software co-design.
ServeEngineeringSanta Clara, CA7w ago8
Senior Research Engineer, Robotics Systems
Senior/Principal Engineer in robotics systems, focusing on foundation models and full-stack technology for humanoid robots. Responsibilities include designing teleoperation software, optimizing control stacks, deploying neural network models on hardware, and collaborating on the MLOps lifecycle. Requires strong robotics and software engineering background, with experience in real-time control and deploying ML models on robotic hardware.
ShipDataEngineeringSanta Clara, CA7w ago8
Senior Perception Engineer - Autonomous Vehicles
Senior Perception Engineer at NVIDIA focused on developing and productizing autonomous driving solutions using deep learning and multi-sensor fusion. The role involves applied research, algorithm development, and ensuring solutions meet production requirements for safety, latency, and robustness.
ShipPost-trainEngineeringSanta Clara, CA7w ago8
Senior Deep Learning Performance Architect
Senior Deep Learning Performance Architect at NVIDIA to design and evaluate hardware architectures for AI/HPC applications, focusing on LLM inference and training performance, and optimizing system bottlenecks.
ServePost-trainEngineeringSanta Clara, CA +17w ago8
Senior Data Center Performance Engineer - Benchmarking and Optimization
Senior Data Center Performance Engineer at NVIDIA focused on benchmarking and optimizing data center platforms for AI training, inference, and HPC workloads. Responsibilities include designing benchmarks, characterizing workloads, identifying bottlenecks, and driving performance improvements through system tuning and architectural recommendations.
ServeEngineeringSanta Clara, CA +1 · Remote7w ago8
NCX Engineer, AI Accelerator
This role focuses on engineering and deploying AI infrastructure and solutions for strategic customers, optimizing large-scale training and inference workloads on NVIDIA's AI platform. It involves MLOps, Kubernetes, GPU scheduling, and performance tuning, with a strong emphasis on customer-facing technical support and collaboration.
ServePost-trainEngineeringSanta Clara, CA +17w ago8
Senior Deep Learning Framework Communications Engineer
Senior Deep Learning Framework Communications Engineer at NVIDIA, focusing on integrating and optimizing communication libraries (NCCL, NVSHMEM) within AI frameworks (PyTorch, TRT-LLM, vLLM, JAX) to enhance performance for large-scale AI training and inference. The role involves deep analysis of AI workloads, compiler improvements, and kernel authoring for multi-GPU systems.
ServeEngineeringSanta Clara, CA +4 · Remote7w ago8
Senior Scientific Machine Learning Engineer – Earth-2
Develops and enhances machine learning frameworks (NVIDIA PhysicsNeMo, NVIDIA Earth2Studio) for scientific ML technology in weather, climate, and earth system modeling. Focuses on implementing new deep learning techniques and enhancing Earth-2 technologies.
Post-trainEngineeringSanta Clara, CA +1 · Remote7w ago8
Director, System Software Engineering - Metropolis Accelerated and Inferencing Software
NVIDIA is seeking a Director of System Software Engineering to lead teams responsible for the full lifecycle of Vision AI strategy, from model onboarding to production deployment. The role focuses on transforming foundation models into real-time, GPU-accelerated video intelligence systems, scaling multimodal reasoning, and enabling agentic development workflows. Key responsibilities include architecting and operationalizing inference acceleration, driving implementations of frameworks like TensorRT and VLLM, collaborating with partners on custom models, and ensuring performance benchmarking. The ideal candidate has extensive experience in deep learning, GPU optimization, and leading engineering teams in embedded and enterprise platforms.
ServeAgentEngineeringSanta Clara, CA8w ago8
Director, Isaac for Healthcare Engineering
Director of Engineering for NVIDIA's Isaac for Healthcare initiative, focusing on building a platform for healthcare robotics companies to develop, simulate, train, and deploy physical AI systems. The role involves platform leadership, team building, partner enablement, technical strategy, and cross-functional collaboration, with a strong emphasis on shipping sophisticated software platforms at scale.
ShipDataEngineeringSanta Clara, CA8w ago8
Manager, Solutions Architecture - Global Partner Team
Manager of Solutions Architecture for NVIDIA's Global Partner Team, focusing on leading technical engagements with GSIs and AI consulting firms. The role involves building and scaling Agentic AI services, providing architectural oversight for complex AI workflows, and collaborating with product and engineering teams. Requires deep technical expertise in Generative/Agentic AI, RAG, LLM orchestration, and AI infrastructure.
AgentEngineeringSanta Clara, CA8w ago8
Senior Software Architect - Deep Learning and HPC Communications
Senior Software Architect role at NVIDIA focused on designing and implementing next-generation data center platforms and scalable communication software for AI and HPC workloads. The role involves investigating performance bottlenecks, developing new communication technologies, exploring hardware/software co-design, and building proofs-of-concept to drive innovation in large-scale GPU clusters.
ServeEngineeringSanta Clara, CA +4 · Remote8w ago8
Senior Software Engineer - VLM Microservices for Neural Reconstruction
Senior Software Engineer to design, build, and optimize containerized inference execution for 3D Vision Language Models (VLMs) for neural reconstruction, turning research into production-grade software (NIMs). The role involves developing benchmarks, releasing and maintaining models, contributing to open-source projects like vLLM, and collaborating with research and product teams. Requires experience with AI distributed systems, inference platforms, Python/C++, and software engineering fundamentals.
ServePost-trainEngineeringSanta Clara, CA +18w ago8
Senior Applied Machine Learning Engineer - VLSI Design
NVIDIA is seeking a Senior Applied Machine Learning Engineer to build AI-driven software systems for circuit design, combining automation algorithms, DL models, and agentic workflows. The role involves working on pre-silicon and post-silicon hardware design data, circuit optimization, and AI systems for EDA/design automation, translating requirements into AI/ML and agentic system problems, and testing/releasing models and AI systems.
AgentDataEngineeringSanta Clara, CA8w ago8
Applied Machine Learning Engineer, Circuit Design - New College Grad 2026
NVIDIA is seeking an Applied Machine Learning Engineer for their Circuit Design team, focusing on building AI-driven software systems that combine automation algorithms, DL models, and agentic workflows to accelerate end-to-end circuit design. The role involves working with hardware design data, circuit optimization, and developing AI/ML solutions for EDA, with a focus on agent-driven design exploration and optimization.
AgentEngineeringSanta Clara, CA +1 · Remote8w ago8
Principal AI and ML Infra Software Engineer, GPU Clusters
This role focuses on enhancing the efficiency of AI and ML research on GPU clusters by collaborating with researchers to identify and address infrastructure deficiencies. The engineer will optimize performance, monitor resource utilization, and contribute to the AI/ML infrastructure ecosystem, keeping up-to-date with the latest AI/ML technologies.
ServeEngineeringSanta Clara, CA +18w ago8
Senior Deep Learning Software Engineer - Autonomous Vehicles
Senior Deep Learning Software Engineer focused on developing and productizing deep learning solutions for autonomous vehicles. The role involves training, fine-tuning, optimizing perception DNNs, applying quantization, improving DNN architectures, and enhancing inference speed and power consumption. It requires strong programming skills, experience with deep learning frameworks, computer vision tasks, and familiarity with CNNs and Transformer architectures. Experience with low precision inference, quantization, and NVIDIA software libraries is a plus.
ServePost-trainEngineeringSanta Clara, CA +3 · RemoteApr 248
Compiler Engineer - AI Inference
NVIDIA is seeking an AI Compiler Engineer to optimize kernel generation and computational graph optimizations for AI inference and training workloads on next-generation GPUs. The role involves hands-on development, collaboration on hardware/software co-design, and scaling AI deployments in datacenters.
ServePost-trainEngineeringSanta Clara, CAApr 248
Senior Software Engineer, Metropolis Vision AI
Senior Software Engineer to develop and optimize high-performance Vision AI pipelines and large-scale distributed services for processing video, image, and 3D data. The role involves crafting real-time systems, developing multi-modal perception, using simulation/synthetic data, and profiling/tuning GPU-accelerated inference pipelines. Collaboration with research and platform teams is key, with an emphasis on bringing research into production at scale.
ServePost-trainEngineeringSanta Clara, CAApr 248
Senior Software Engineer, AI Networking
Senior Software Engineer role focused on building and productizing ML tools for optimizing AI workloads (LLM training/inference) across GPU/CPU clusters, with a focus on networking and system resource utilization. Involves distributed deep learning, ML-based optimization techniques, and performance analysis.
ServeAgentEngineeringSanta Clara, CA +1Apr 248
Senior AI-Native Systems Software Engineer, TensorRT
Senior engineer to architect and build an AI-native framework using AI agents for software development, focusing on scaling, performance optimization, and integrating SOTA models for inference.
AgentServeEngineeringSanta Clara, CAApr 218