Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
1 AI role tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| UPS | UPS Digital Senior Machine Learning Engineer | Logistics | 8 | Agent orchestration · Tool use · Inference infra · Fine-tuning |