Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
41 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| AT&T | Director Cybersecurity - AI/ML/Automation (Cyber Threat Analytics) | Telecom | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · Fine-tuning · Inference infra |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Inference infra · LLM observability |
| Verizon | Princ Engr-AI Science | Telecom | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Inference infra · LLM observability · Multimodal |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Inference infra · Multimodal |
| Verizon | Director - AI/ML Engineering | Telecom | 8 | LLM observability · Agent orchestration · Tool use · Vector DB · Inference infra · Guardrails |
| Verizon | Sr Engr Cslt-Data Science | Telecom | 8 | Agent orchestration · Tool use · RAG · LLM observability · Fine-tuning |
| Verizon | Senior Data Scientist | Telecom | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning |
| Verizon | Principal Data Scientist | Telecom | 8 | Agent orchestration · LLM observability · RAG · Fine-tuning · Recommender systems · Search & ranking |
| Verizon | Sr Engr Cslt-AI/ML Engineering | Telecom | 8 | Inference infra · Agent orchestration |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Agent orchestration · Inference infra |
| Verizon | Sr Engr Cslt-AI Science | Telecom | 8 | Agent orchestration · Agent research · Fine-tuning · Inference infra · LLM observability · RAG |
| T-Mobile | Sr Engineer, Machine Learning Engineering | Telecom | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability · Multimodal · Evals |
| AT&T | Senior Data/AI Engineering | Telecom | 8 | RAG · Vector DB · Agent orchestration · LLM observability · Audio & speech |
| Verizon | Senior Engineering Consultant-Cloud & AI | Telecom | 8 | Agent orchestration · RAG · Tool use · LLM observability · Inference infra |
| AT&T | Lead Cybersecurity - Application Security Architect – AI Models, Frameworks & Implementation | Telecom | 8 | Agent orchestration · RAG · LLM observability · Guardrails |
| T-Mobile | Principal GenAI Software Engineer | Telecom | 8 | Agent orchestration · RAG · Guardrails · LLM observability |
| Verizon | Associate Director-AI Science | Telecom | 7 | |
| AT&T | Lead Data/AI Engineering | Telecom | 7 | |
| AT&T | Sr Specialist Cybersecurity - IAM Operations AIOps | Telecom | 7 | |
| T-Mobile | Sr Data Scientist | Telecom | 7 | Fine-tuning |
| AT&T | Director-Technology | Telecom | 7 | Agent orchestration |
| Verizon | Assoc Dir-AI Science | Telecom | 7 | Inference infra |
| T-Mobile | Senior Data Science Engineer | Telecom | 7 | Inference infra |
| T-Mobile | Sr Data Scientist | Telecom | 7 | |
| AT&T | Principal Data/AI Engineering | Telecom | 7 | Inference infra |
| AT&T | Lead Software Engineer | Telecom | 7 | RAG · LLM observability |
| Verizon | AI Go To Market Leader | Telecom | 7 | Agent orchestration · RAG · LLM observability · Guardrails |
| Verizon | Director of Digital Customer Experience & AI Innovation | Telecom | 7 | LLM observability · Agent orchestration · Guardrails · RAG · Vector DB · Fine-tuning · Recommender systems · Search & ranking · Interpretability · Synthetic data · Agent research · RL post-training · RLHF · Reward modeling · RL robotics · Embodied AI |
| T-Mobile | Sr Engineer, Enterprise AI | Telecom | 7 | Agent orchestration · RAG · LLM observability · Vector DB |
| AT&T | Lead System Engineer (AI Automation Engineer SRE Focus) | Telecom | 7 | Agent orchestration · LLM observability · Inference infra |