Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
96 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Disney | Lead Machine Learning Engineer | Media | 9 | Agent orchestration · Agent research · Multimodal · RAG · LLM observability · Evals · Guardrails · Inference infra |
| Disney | Sr Staff R&D Engineer | Media | 9 | Audio & speech · Fine-tuning · Multimodal |
| Warner Bros Discovery | Manager, Machine Learning Engineering | Media | 8 | Evals |
| Comcast | Principal Machine Learning Engineer | Media | 8 | Recommender systems · Search & ranking · Fine-tuning · Agent orchestration · Tool use · Inference infra |
| Disney | Lead Product Manager, AI Platform | Media | 8 | Agent orchestration · RAG · Evals · LLM observability |
| Warner Bros Discovery | Principal Data Scientist | Media | 8 | Recommender systems · Vision |
| Disney | Director, Decision Science AI/ML Engineering & Ops | Media | 8 | Inference infra · LLM observability · Guardrails · Evals |
| Comcast | Engineer 4 - Machine Learning | Media | 8 | Agent orchestration · LLM observability · Fine-tuning · Evals · Guardrails |
| Disney | Manager - Applied AI | Media | 8 | Agent orchestration · Tool use · Evals · RAG · LLM observability |
| Disney | Director, Decision Science Technology | Media | 8 | Agent orchestration |
| Disney | Sr Software Engineer | Media | 8 | Agent orchestration · Tool use · LLM observability · RAG · Fine-tuning · Inference infra |
| Disney | Staff GenAI/ML Engineer (Emerging Tech & AI Automation) Project Hire | Media | 8 | Agent orchestration · RAG · Fine-tuning · Vector DB · LLM observability · Evals |
| Comcast | Engineer 3 - Machine Learning | Media | 8 | Agent orchestration · LLM observability · Inference infra · Guardrails |
| Comcast | Engineer 3 - Machine Learning | Media | 8 | Agent orchestration · LLM observability · Inference infra · Guardrails |
| Comcast | Engineer 2 - Machine Learning | Media | 8 | Agent orchestration · Tool use · LLM observability · Inference infra · Agent research |
| Disney | Software Engineer II | Media | 8 | Agent orchestration · Tool use · LLM observability · Fine-tuning |
| Disney | Sr Data Scientist | Media | 8 | Multimodal · Fine-tuning · RAG · Inference infra · Evals · Vector DB |
| Comcast | Software Engineering Manager, AI Agents | Media | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability |
| Disney | Staff GenAI/ML Engineer (Emerging Tech & AI Automation) Project Hire | Media | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability · Evals |
| Comcast | Machine Learning Engineer 4 | Media | 8 | Agent orchestration · LLM observability · Evals · Guardrails · Inference infra |
| Warner Bros Discovery | Sr. Staff, Data Science & Applied AI | Media | 8 | Agent orchestration · RAG · Evals · Guardrails · LLM observability |
| Disney | Sr Machine Learning Engineer | Media | 8 | Inference infra · Forecasting · LLM observability |
| Warner Bros Discovery | Sr. Staff, Data Science & Applied AI | Media | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Inference infra |
| Comcast | Machine Learning Engineer (GoLang) | Media | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Multimodal · Vision · Audio & speech |
| Disney | Senior Machine Learning Engineer, Ad Platforms | Media | 8 | Agent orchestration · Multimodal · Fine-tuning · Evals · Audio & speech |
| Disney | Lead Machine Learning Engineer, Ads Research | Media | 8 | Agent orchestration · Multimodal · Fine-tuning · Audio & speech |
| Disney | Senior Product Manager II- Commerce and Personalization | Media | 7 | Recommender systems |
| Disney | Sr Software Engineer | Media | 7 | RAG · Vector DB · Fine-tuning · LLM observability |
| The Trade Desk | Staff Applied Scientist | Media | 7 | Forecasting |
| The Trade Desk | Staff Applied Scientist | Media | 7 | Forecasting |