Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
413 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Airbnb | Senior Machine Learning Engineer, Customer Support Engineering | Consumer | 9 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Fine-tuning · RLHF · Agent research |
| Roblox | Distinguished Machine Learning Engineer - Safety | Consumer | 9 | Inference infra · Vision |
| Director, Machine Learning Engineering – Content & User Understanding | Consumer | 9 | Vision · Multimodal | |
| Whoop | Senior AI Researcher (Foundation AI) | Consumer | 9 | Frontier research · Multimodal · Pretraining · Fine-tuning |
| Spotify | Machine Learning Engineer - Personalization, Horizon | Consumer | 9 | Agent orchestration · LLM observability · Fine-tuning · Recommender systems |
| Airbnb | Senior Staff Machine Learning Engineer, Post Training | Consumer | 9 | Fine-tuning · Inference infra · LLM observability · Guardrails · Multimodal |
| Senior Machine Learning Engineer, GenAI Security | Consumer | 9 | Agent orchestration · Tool use · Evals · Guardrails · Fine-tuning | |
| Zillow | Senior Machine Learning Engineer | Consumer | 9 | Agent orchestration · Multimodal · Evals · Guardrails · LLM observability |
| Master's Fall Machine Learning Internship (ATG - Visual Search) | Consumer | 9 | Agent orchestration · Tool use · Inference infra · Multimodal · LLM observability | |
| Airbnb | Machine Learning Engineer, Customer Support Engineering | Consumer | 9 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Fine-tuning · RL post-training · Agent research |
| Airbnb | Senior Staff Machine Learning Engineer, Growth Platform Engineering | Consumer | 9 | Agent orchestration · Inference infra |
| DoorDash | Senior/Staff Deep Reinforcement Learning Engineer | Consumer | 9 | RL robotics · Embodied AI · Agent orchestration · Inference infra |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Multimodal · Evals · Guardrails · LLM observability · Agent research |
| Machine Learning Engineer II, Computer Vision Applied Science | Consumer | 9 | Vision · Multimodal · Fine-tuning · RLHF · Evals | |
| Roblox | Senior Machine Learning Engineering Manager | Consumer | 9 | Multimodal · Vision · LLM observability · Fine-tuning |
| Uber | Sr Staff Agentic Systems Engineer | Consumer | 9 | Agent orchestration · Agent research · Tool use · LLM observability |
| Roblox | Principal Machine Learning Engineer, Engineering Acceleration | Consumer | 9 | Agent orchestration · Agent research · Synthetic data · Evals · Fine-tuning · Code gen |
| Roblox | Principal/Senior Machine Learning Scientist - Search and Discovery | Consumer | 9 | Agent orchestration · Recommender systems · Multimodal · Vision · Inference infra |
| Roblox | Principal Machine Learning Engineer, Embodied AI and Smart NPCs | Consumer | 9 | Embodied AI · Agent orchestration · Agent research · RL robotics · Inference infra |
| Uber | Senior Staff Machine Learning Engineer – Moonshot AI | Consumer | 9 | Multimodal · Vision · Audio & speech · LLM observability · Evals · Fine-tuning · RAG · Recommender systems |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Multimodal · Agent research · Inference infra · Audio & speech |
| Roblox | Director of Engineering - AI for Roblox Studio | Consumer | 9 | Agent orchestration · Agent research |
| Uber | Principal Machine Learning Engineer - AV Labs | Consumer | 9 | Multimodal · Evals |
| Uber | Staff ML Engineer, Generative AI | Consumer | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Multimodal · Audio & speech |
| Roblox | [2026] Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career | Consumer | 9 | Fine-tuning · Multimodal |
| Roblox | [2026] Senior Machine Learning Engineer, Multimodal AI, Computer Vision and Graphics - PhD Early Career | Consumer | 9 | Vision · Multimodal · Fine-tuning |
| Roblox | [2026] Applied Scientist - PhD Intern | Consumer | 9 | Multimodal · Agent research |
| Instacart | Machine Learning Engineer, PhD Intern | Consumer | 9 | LLM observability · RAG · Fine-tuning · Inference infra · Recommender systems · Search & ranking · Agent research · Evals |
| Staff Machine Learning Engineer, ML Efficiency | Consumer | 8 | Inference infra · Training infra | |
| Senior Machine Learning Engineer, Ads Foundational Representations | Consumer | 8 | Fine-tuning · Multimodal · LLM observability · Recommender systems · Search & ranking · Inference infra |