Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
195 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Roblox | Distinguished Machine Learning Engineer - Safety | Consumer | 9 | Model serving · Vision |
| Airbnb | Senior Staff Machine Learning Engineer, Post Training | Consumer | 9 | Fine-tuning · Model serving · LLM observability · Guardrails · Multimodal |
| Master's Fall Machine Learning Internship (ATG - Visual Search) | Consumer | 9 | Agent orchestration · Tool use · Model serving · Multimodal · LLM observability | |
| Airbnb | Senior Staff Machine Learning Engineer, Growth Platform Engineering | Consumer | 9 | Agent orchestration · Model serving |
| DoorDash | Senior/Staff Deep Reinforcement Learning Engineer | Consumer | 9 | RL robotics · Embodied AI · Agent orchestration · Model serving |
| Roblox | Principal/Senior Machine Learning Scientist - Search and Discovery | Consumer | 9 | Agent orchestration · Recommender systems · Multimodal · Vision · Model serving |
| Roblox | Principal Machine Learning Engineer, Embodied AI and Smart NPCs | Consumer | 9 | Embodied AI · Agent orchestration · Agent research · RL robotics · Model serving |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Multimodal · Agent research · Model serving · Audio & speech |
| Instacart | Machine Learning Engineer, PhD Intern | Consumer | 9 | LLM observability · RAG · Fine-tuning · Model serving · Recommender systems · Search & ranking · Agent research · Evals |
| Staff Machine Learning Engineer, ML Efficiency | Consumer | 8 | Model serving · Training infra | |
| Senior Machine Learning Engineer, Ads Foundational Representations | Consumer | 8 | Fine-tuning · Multimodal · LLM observability · Recommender systems · Search & ranking · Model serving | |
| DoorDash | Software Engineer, Machine Learning Infrastructure - Gen AI | Consumer | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Principal Engineer, AI Platform | Consumer | 8 | Model serving · Recommender systems | |
| Staff Machine Learning Systems Engineer, Embeddings Platform | Consumer | 8 | Recommender systems · Search & ranking · Model serving | |
| Instacart | Senior Machine Learning Engineer, Search & Recommendations | Consumer | 8 | Recommender systems · Search & ranking · Model serving |
| Spotify | Senior Machine Learning Engineer - Content Intelligence | Consumer | 8 | Model serving · LLM observability · RAG · Vector DB · Fine-tuning · Agent orchestration · Multimodal · Evals |
| Snap | Machine Learning Engineer, Generative ML , Level 5 | Consumer | 8 | Audio & speech · Multimodal · Model serving |
| Chegg | Machine Learning/ AI Engineer | Consumer | 8 | Agent orchestration · Tool use · RAG · LLM observability · Model serving · Recommender systems |
| Instacart | Senior Engineering Manager, Search | Consumer | 8 | Search & ranking · Recommender systems · LLM observability · Model serving · RAG · Vector DB |
| Zillow | Machine Learning Engineer | Consumer | 8 | Model serving · Fine-tuning · Vision |
| Staff Machine Learning Engineer | Consumer | 8 | Agent orchestration · Recommender systems · Search & ranking · Vector DB · Model serving | |
| Roblox | Director of Engineering, Economy ML | Consumer | 8 | Recommender systems · Search & ranking · Model serving · Multimodal |
| DoorDash | Staff Machine Learning Engineer, Fulfillment Planning | Consumer | 8 | Model serving · Recommender systems |
| Airbnb | Principal Machine Learning Engineer- LLM Fine-tuning and Optimization | Consumer | 8 | Fine-tuning · Model serving · LLM observability · Multimodal · Agent orchestration · Evals · Guardrails |
| Roblox | Senior Machine Learning Engineer, GenAI Data | Consumer | 8 | Synthetic data · Multimodal · Evals · Model serving |
| Zillow | Principal Machine Learning Engineer | Consumer | 8 | Agent orchestration · Tool use · Evals · RAG · Vector DB · Model serving |
| Superhuman | Engineering Manager, Go | Consumer | 8 | Agent orchestration · Model serving · LLM observability |
| Airbnb | Senior Staff Machine Learning Engineer, Infrastructure | Consumer | 8 | Model serving · RAG · Agent orchestration · LLM observability |
| Zillow | Principal Software Engineer, Applied AI Services | Consumer | 8 | Model serving · Evals · Guardrails · Agent orchestration |
| Lime | Senior MLOps & Data Systems Engineer | Consumer | 8 | Model serving · Vision · Evals · Fine-tuning |