Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
108 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Johnson & Johnson | JJT Intern | Pharma | 9 | Fine-tuning · Evals |
| Iambic | Machine Learning Scientist — Large multimodal models | Pharma | 9 | Multimodal · Fine-tuning · Inference infra · Frontier research |
| Eli Lilly | Advisor - Agent Research | Pharma | 9 | Agent orchestration · Tool use · Inference infra · RAG · Fine-tuning |
| Iambic | Machine Learning Scientist – Clinical Prediction | Pharma | 8 | Fine-tuning · Multimodal · Evals |
| Johnson & Johnson | Sr Director Head of Data Science Services | Pharma | 8 | LLM observability |
| Johnson & Johnson | Sr. Manager - AI Solutions Lead | Pharma | 8 | Agent orchestration · RAG · LLM observability · Evals · Guardrails · Tool use |
| Johnson & Johnson | Dir Tech Prod Mgmt | Pharma | 8 | Agent orchestration |
| Pfizer | Director, AI Solutions Expert—Agent Developer, AIA | Pharma | 8 | Agent orchestration · RAG · Vector DB · LLM observability |
| Pfizer | PharmSci Agentic and Generative AI Architect | Pharma | 8 | Agent orchestration · Tool use · RAG · Evals |
| Johnson & Johnson | Director, Data Science - DDSAI - Therapeutics Discovery | Pharma | 8 | Agent orchestration · Fine-tuning |
| Johnson & Johnson | Data Scientist- Clinical Decision Support- LLMs | Pharma | 8 | LLM observability · RAG · Vector DB · Fine-tuning · Agent orchestration |
| Johnson & Johnson | Director, MedTech Technology AI Orchestration and Enablement | Pharma | 8 | Agent orchestration · Inference infra · Guardrails · LLM observability |
| Johnson & Johnson | VP, Data & Digital Quality | Pharma | 8 | Agent orchestration · Guardrails · Fine-tuning |
| Johnson & Johnson | Lead, Technology Product Management - AI, Data & Insights | Pharma | 8 | Agent orchestration · RAG · LLM observability · Fine-tuning |
| Merck | Senior Scientist, Agentic AI and Machine Learning (PDMB) | Pharma | 8 | Agent orchestration · Tool use · LLM observability · Fine-tuning · RAG · Multimodal |
| Pfizer | Director, AI Engineering--Clinical Development and Operations (CD&O) | Pharma | 8 | Agent orchestration · LLM observability · RAG · Fine-tuning · Inference infra |
| Johnson & Johnson | Senior Machine Learning Engineer - Robotics | Pharma | 8 | Embodied AI · Multimodal · Inference infra · Fine-tuning |
| Pfizer | Sr. Manager, AI Solutions Expert—Agent Developer | Pharma | 8 | Agent orchestration · Vector DB · RAG · LLM observability |
| Pfizer | Senior Director, Applied Intelligence | Pharma | 8 | RAG · Fine-tuning · Guardrails · LLM observability |
| Merck | Senior Specialist, Digital Reg Docs Tech Lead | Pharma | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability |
| Eli Lilly | Associate Director - AI Engineering | Pharma | 8 | Agent orchestration · Tool use · RAG · LLM observability · Inference infra · Guardrails |
| Merck | Senior AI Developer | Pharma | 8 | RAG · LLM observability · Fine-tuning · Vector DB |
| Eli Lilly | AI Architect – Lilly Medicine Foundry (R5-6) | Pharma | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability · Guardrails |
| Johnson & Johnson | Senior Scientist, Computer Vision | Pharma | 8 | Vision · Multimodal · Inference infra |
| Johnson & Johnson | Principal AI Lead – Surgical AI | Pharma | 8 | Agent orchestration |
| Eli Lilly | Sr. Principal or Engineering Advisor - Agentic Lab Automation Integration | Pharma | 8 | Agent orchestration · Tool use |
| Eli Lilly | Associate Vice President - Applied Intelligence for Discovery (AI4D) | Pharma | 8 | |
| Eli Lilly | Director, Discovery Bioinformatics Oncology | Pharma | 8 | Multimodal · Agent orchestration · Fine-tuning · RAG · Vector DB |
| Eli Lilly | Scientific Lead - Forward Deployed AI Engineer, Applied Intelligence for Discovery | Pharma | 8 | Agent orchestration · RAG · LLM observability · Fine-tuning |
| Johnson & Johnson | Design Leader, Polyphonic AI Lab | Pharma | 7 | Inference infra · Multimodal |