Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
97 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Honeywell | Lead AI Engr | Industrial | 9 | Agent orchestration · Agent research · Fine-tuning · Inference infra · RAG · Vector DB · Multimodal · Guardrails · LLM observability |
| Cognite | Principal ML Engineer | Industrial | 8 | Inference infra · Vision · Agent orchestration · RAG · Multimodal |
| Caterpillar | Agentic AI / AI Ops Engineer – Platform Engineering | Industrial | 8 | Agent orchestration · Tool use · LLM observability · Inference infra |
| Cognite | Senior Machine Learning Engineer | Industrial | 8 | Vision · Multimodal · Inference infra · RAG · Vector DB · Fine-tuning · LLM observability · Agent orchestration |
| Honeywell | Sr IT Engineer | Industrial | 8 | Agent orchestration · Agent research · RAG · LLM observability |
| Caterpillar | Lead Architect – Digital Twin & AI Factory | Industrial | 8 | Synthetic data · Inference infra |
| Honeywell | Software Engr II | Industrial | 8 | Agent orchestration · RAG · Inference infra · LLM observability |
| Caterpillar | Lead Data Scientist - Gen AI & Digital Twin | Industrial | 8 | RAG · Fine-tuning · Inference infra |
| Honeywell | Sr. Director Data & AI Platforms | Industrial | 8 | Agent orchestration · Inference infra · RAG · Vector DB · Guardrails |
| Honeywell | Principal AI Engr | Industrial | 8 | Fine-tuning · RAG · Vector DB |
| Honeywell | Sr Advanced AI Platform Engineer | Industrial | 8 | Agent orchestration · RAG · Inference infra · LLM observability |
| Honeywell | Advanced AI Engineer | Industrial | 8 | Agent orchestration · Agent research · RAG · LLM observability · Inference infra · Fine-tuning |
| Caterpillar | Principal AI Engineer | Industrial | 8 | Agent orchestration · Fine-tuning |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Inference infra |
| Honeywell | Sr. Advanced AI Software Engineer | Industrial | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning |
| Honeywell | Sr Advanced AI Engr | Industrial | 8 | Agent orchestration · Tool use · Multimodal · Fine-tuning · LLM observability |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Inference infra · RAG |
| Honeywell | Software Engr II | Industrial | 8 | Agent orchestration · RAG · Inference infra · LLM observability |
| Cognite | Senior AI Platform Engineer, Atlas AI | Industrial | 8 | Agent orchestration · Tool use · LLM observability · Inference infra · Evals |
| John Deere | SR Data Scientist - Global Team - Indaiatuba/SP | Industrial | 7 | |
| Cognite | Machine Learning Engineer | Industrial | 7 | Fine-tuning · Inference infra · RAG · Vector DB · Agent orchestration |
| Honeywell | Advanced Data Scientist | Industrial | 7 | Inference infra |
| Caterpillar | Analyst Applications – ServiceNow Conversational & GenAI | Industrial | 7 | Agent orchestration · LLM observability · Guardrails · RAG · Fine-tuning · Evals |
| Caterpillar | Back Office Engineering Manager | Industrial | 7 | Agent orchestration |
| Honeywell | Senior Advanced Application Engineer - APM | Industrial | 7 | |
| Caterpillar | Senior Manager, Internal Enterprise Analytics & AI Experience Gateway | Industrial | 7 | Inference infra · Agent orchestration · Tool use · RAG · LLM observability |
| Caterpillar | Principal Digital Product Manager, Applied AI | Industrial | 7 | Agent orchestration · LLM observability |
| Caterpillar | Senior Manager - Connectivity Data Analytics | Industrial | 7 | Agent orchestration · Evals · Guardrails · LLM observability · Fine-tuning · Recommender systems · Multimodal |
| Caterpillar | Principal Digital Architect (Autonomy) | Industrial | 7 | Vision · Multimodal · Agent orchestration · Tool use · Inference infra |
| Caterpillar | Senior Analytics Manager - AI Model & Prompt Engineering | Industrial | 7 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Fine-tuning · Multimodal · Agent research |