Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
56 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| MetLife | VP, AI Innovation | Insurance | 9 | Agent orchestration · Multimodal · Forecasting · Recommender systems · Search & ranking |
| State Farm | REMOTE - AI Engineering Manager (Databricks) | Insurance | 8 | Agent orchestration · Tool use · Evals · LLM observability |
| GEICO | Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Agent research · LLM observability · RAG · Fine-tuning · Inference infra |
| GEICO | Staff Applied Research Scientist | Insurance | 8 | Recommender systems · LLM observability · Guardrails · RAG · Fine-tuning |
| State Farm | Lead Data Scientist - Gen AI | Insurance | 8 | Agent orchestration · Fine-tuning · Vision |
| GEICO | Distinguished Engineer, Applied AI | Insurance | 8 | Agent orchestration · LLM observability · Inference infra |
| Allstate | AI Engineer Lead | Insurance | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Fine-tuning · Guardrails · Agent research |
| MetLife | Principal Architect (AI, Cloud & Azure) | Insurance | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability · Vector DB |
| GEICO | Senior Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Tool use · Fine-tuning |
| GEICO | Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Tool use · Inference infra · LLM observability |
| GEICO | Senior Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Tool use · Fine-tuning |
| GEICO | Distinguished Engineer, AI Applications | Insurance | 8 | Agent orchestration · RAG · LLM observability · Evals |
| GEICO | Senior Staff Machine Learning Engineer, AI Agent Platform | Insurance | 8 | Agent orchestration · Agent research · Fine-tuning · Inference infra · RAG · Guardrails · LLM observability · Evals · Tool use |
| Allstate | Senior Data Scientist - Agentic AI | Insurance | 8 | Agent orchestration · LLM observability |
| GEICO | Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · RAG · Vector DB · LLM observability |
| Premera Blue Cross | AI Engineer III | Insurance | 8 | Inference infra · RAG · Guardrails · LLM observability |
| MetLife | Lead Data Scientist | Insurance | 8 | Inference infra · Fine-tuning · RAG · Vector DB |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Inference infra · Guardrails · LLM observability |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Inference infra · Guardrails · LLM observability |
| Premera Blue Cross | AI Engineer IV | Insurance | 8 | Agent orchestration · RAG · Evals · Multi-agent |
| GEICO | Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Inference infra |
| GEICO | Senior Staff Engineer – Agentic AI & Enterprise Productivity | Insurance | 7 | Agent orchestration · Tool use |
| GEICO | Director, Machine Learning Engineering | Insurance | 7 | RAG · Agent orchestration · Inference infra · LLM observability |
| GEICO | Senior Machine Learning Engineer | Insurance | 7 | Inference infra · RAG · LLM observability · Guardrails · Evals |
| MetLife | Senior AI Scientist | Insurance | 7 | LLM observability · RAG · Fine-tuning |
| GEICO | Machine Learning Engineer II | Insurance | 7 | Inference infra · Fine-tuning |
| GEICO | Senior Director of Product Management, Enterprise Experience | Insurance | 7 | Agent orchestration · LLM observability · RAG · Vector DB · Search & ranking · Copilot |
| Allstate | Senior AI Engineer | Insurance | 7 | Agent orchestration · LLM observability · RAG · Fine-tuning |
| Allstate | Senior AI Software Engineer | Insurance | 7 | Fine-tuning · RAG · Vector DB |
| Allstate | Applied Machine Learning Engineer (All Levels) | Insurance | 7 | Inference infra · Evals · Interpretability |