Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
95 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Wayve | Tech Lead, Autonomy Performance - Robotaxi | Robotics | 9 | Embodied AI · Evals · Model serving |
| Apptronik | Staff MLOps Engineer | Robotics | 9 | Model serving · Evals |
| Waabi | Distillation Lead | Robotics | 9 | Fine-tuning · Model serving · Quantization · Multimodal · Embodied AI |
| Agility Robotics | Senior AI Research Engineer | Robotics | 9 | Embodied AI · Model serving |
| Carbon Robotics | Deep Learning Engineer | Robotics | 9 | Model serving |
| Figure AI | Helix AI Engineer, Reinforcement Learning | Robotics | 9 | RL robotics · Embodied AI · Model serving |
| Waabi | Research Engineer, World Models | Robotics | 9 | Multimodal · Vision · Fine-tuning · Model serving |
| Wayve | Principal Machine Learning Engineer, App SW | Robotics | 9 | Embodied AI · Model serving · Evals · Synthetic data · Fine-tuning |
| Wayve | Staff ML Performance Engineer (Training Efficiency) | Robotics | 9 | Embodied AI |
| Agility Robotics | Director of AI | Robotics | 9 | Embodied AI · LLM observability · Model serving · Synthetic data · Frontier research |
| Waabi | Research Engineer, Neural Rendering | Robotics | 9 | Model serving · Synthetic data |
| Wayve | Machine Learning Engineer | Robotics | 9 | Embodied AI · Model serving · Evals · Synthetic data · Fine-tuning |
| Wayve | Tech Lead, Wayve Labs | Robotics | 9 | Embodied AI · Model serving · Multimodal |
| Figure AI | Senior Reinforcement Learning Engineer, Helix | Robotics | 9 | RL robotics · Embodied AI · Model serving |
| Nuro | Lead ML Research Scientist | Robotics | 9 | Multimodal · Model serving |
| Wayve | Principal Engineer, Model Development Platform | Robotics | 8 | Embodied AI · Model serving |
| Nuro | Software Engineer, ML Infrastructure, Optimization | Robotics | 8 | Model serving · Quantization |
| Wayve | Machine Learning Engineering Manager, App SW | Robotics | 8 | Embodied AI · Model serving |
| Wayve | Machine Learning Engineering Manager, App SW | Robotics | 8 | Embodied AI · Model serving |
| Wayve | Senior ML Performance Engineer (Inference Optimisation) | Robotics | 8 | Model serving · Quantization |
| Applied Intuition | Engineering Manager - ML, Self-Driving Systems | Robotics | 8 | Model serving · Fine-tuning · Evals |
| Applied Intuition | Technical Lead Manager - Perception, Self-Driving Systems | Robotics | 8 | Vision · Multimodal · Fine-tuning · Model serving |
| Waabi | Senior / Staff ML Training Optimization Engineer | Robotics | 8 | Quantization |
| Wayve | Staff Cloud SRE – AI/ML Platform & GPU Compute | Robotics | 8 | Model serving |
| Agility Robotics | Staff AI Engineer, Perception | Robotics | 8 | Vision · Model serving · Evals |
| Nuro | Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency | Robotics | 8 | Agent orchestration · Tool use · Model serving · LLM observability |
| Applied Intuition | Software Engineer - AI Engineering | Robotics | 8 | Agent orchestration · Tool use · Evals · Model serving · LLM observability |
| Agility Robotics | Senior Manager, AI Innovation | Robotics | 8 | Embodied AI · LLM observability · Model serving · Synthetic data |
| Applied Intuition | Engineering Manager - ML Platform and Infrastructure | Robotics | 8 | Model serving |
| Apptronik | Senior Autonomy Software Engineer | Robotics | 8 | Embodied AI · Agent orchestration · Multimodal · Model serving · Guardrails |