Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
499 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| JPMorgan Chase | Applied AI ML Researcher Lead | Banking | 9 | Agent orchestration · Multi-agent · Agent research · Evals |
| JPMorgan Chase | Applied Machine Learning Scientist - Vice President | Banking | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Recommender systems · Multimodal · Agent research · RL post-training |
| JPMorgan Chase | Sr Director of Software Engineering - AI/ML platforms (Intelligent Agentic Systems, RAG, LLM Architectures) | Banking | 9 | Agent orchestration · RAG · Search & ranking · Fine-tuning |
| JPMorgan Chase | Applied AI ML Researcher Director | Banking | 9 | Agent orchestration · Agent research · Inference infra |
| Capital One | Applied Researcher II | Banking | 9 | Pretraining · Fine-tuning · Inference infra · Vector DB |
| Capital One | Distinguished Engineer | Banking | 9 | Inference infra · Quantization |
| Capital One | Principal Associate, Data Science - AI Foundations | Banking | 9 | Fine-tuning · Inference infra · Agent orchestration · RAG · Vector DB · LLM observability |
| Capital One | Sr. Distinguished AI Engineer (Agentic AI Platform) | Banking | 9 | Agent orchestration · RAG · Guardrails · LLM observability · Tool use · Vector DB |
| Capital One | Applied Researcher II (AI Foundations, LLM Core and Agentic AI) | Banking | 9 | Pretraining · Fine-tuning · RL post-training · Frontier research · Agent research · Agent orchestration · Vector DB · Recommender systems · Multimodal |
| Capital One | Applied Researcher II | Banking | 9 | Fine-tuning · Frontier research · Pretraining · RL post-training · Vector DB |
| JPMorgan Chase | Generative AI Executive Director | Banking | 9 | Agent orchestration · Multimodal · Fine-tuning · Inference infra |
| JPMorgan Chase | Applied AI ML Researcher Director | Banking | 9 | Agent orchestration · Agent research · Inference infra |
| JPMorgan Chase | Senior Lead Software Engineer- Java/Python/ AI Solutions | Banking | 9 | Agent orchestration · Tool use · LLM observability · RAG · Vector DB · Fine-tuning · Agent research |
| JPMorgan Chase | Generative AI - Vice President | Banking | 9 | Agent orchestration · LLM observability · Inference infra · Fine-tuning · Multimodal |
| JPMorgan Chase | AI Agents Applied Engineer - Senior Associate | Banking | 9 | Agent orchestration · Tool use · Fine-tuning · Inference infra · Guardrails · LLM observability · Recommender systems · Search & ranking · RL post-training |
| JPMorgan Chase | AI Agents Applied Research/Engineering Lead - Vice President | Banking | 9 | Agent orchestration · Tool use · Guardrails · Fine-tuning · Inference infra · Recommender systems · Search & ranking · RL post-training |
| Capital One | Applied Researcher I (AI Foundations, LLM Core and Agentic AI) | Banking | 9 | Fine-tuning · RL post-training · Frontier research · Vector DB |
| Capital One | Senior Manager, Data Scientist - Applied AI | Banking | 9 | Fine-tuning · Vector DB |
| Capital One | Applied Researcher II (AI Foundations, LLM Core and Agentic AI) | Banking | 9 | Fine-tuning · Frontier research · Inference infra · Pretraining · RL post-training · Vector DB |
| JPMorgan Chase | Machine Learning Scientist - Vice President | Banking | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Recommender systems · RL post-training · Multimodal |
| JPMorgan Chase | Applied AI ML Lead Researcher - Commercial and Investment Bank | Banking | 9 | Agent orchestration · Agent research · Frontier research · Inference infra |
| JPMorgan Chase | Applied AI/ML Director Researcher | Banking | 9 | Agent orchestration · Agent research · Frontier research · Inference infra |
| JPMorgan Chase | Generative AI Director | Banking | 9 | LLM observability · Agent orchestration · Tool use · Fine-tuning · Inference infra · Multimodal · Vision · Audio & speech |
| Capital One | Sr. Distinguished Applied Researcher | Banking | 9 | Pretraining · Fine-tuning · Inference infra · Vector DB · Frontier research |
| JPMorgan Chase | Applied AI ML Vice President | Banking | 8 | Agent orchestration · Evals · RAG |
| JPMorgan Chase | Applied AI ML Lead [Multiple Positions Available] | Banking | 8 | Agent orchestration · RAG · LLM observability · Evals · Fine-tuning |
| JPMorgan Chase | Senior AI Application Engineer - Vice President | Banking | 8 | LLM observability · Evals · Guardrails · Inference infra |
| Capital One | Senior Director, Software Engineering - AI | Banking | 8 | Agent orchestration · Inference infra · LLM observability |
| JPMorgan Chase | Risk Management & Compliance - Data Scientist Lead, Executive Director | Banking | 8 | Agent orchestration |
| Capital One | Senior Lead AI Engineer (GenAI Platform Services) | Banking | 8 | Fine-tuning · Inference infra · Guardrails · Vector DB · LLM observability · Evals |