Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
267 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Plaid | Machine Learning Engineer (Research Scientist) - DFAI | Fintech | 9 | Pretraining · Fine-tuning · Inference infra · LLM observability |
| Plaid | Senior Machine Learning Engineer (Research Scientist) - DFAI | Fintech | 9 | Pretraining · Fine-tuning · Inference infra · LLM observability |
| Plaid | Staff Machine Learning Engineer (Research Scientist) - DFAI | Fintech | 9 | Pretraining · Fine-tuning |
| Visa | Senior AI Engineer | Fintech | 9 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Inference infra · Frontier research · Interpretability · RL post-training · Agent research · Multimodal |
| Visa | Data Science Manager | Fintech | 9 | Agent orchestration · Fine-tuning · Guardrails · Tool use |
| Visa | Head of Generative AI Research | Fintech | 9 | Frontier research · Pretraining · Multimodal · Agent research · Agent orchestration · LLM observability |
| Upstart | Principal Engineer, LLM | Fintech | 9 | Inference infra · RAG · Vector DB · Evals · LLM observability |
| Plaid | Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI | Fintech | 9 | Pretraining · Fine-tuning · Inference infra · Frontier research |
| PayPal | Principal, Agentic AI | Fintech | 9 | Agent orchestration · Tool use |
| Gusto | Sr. Staff AI/ML Engineer | Fintech | 9 | Agent orchestration · RAG · Evals · LLM observability · Guardrails |
| PitchBook | Manager, Engineering, AI & ML | Fintech | 8 | LLM observability · Fine-tuning · RAG · Vector DB |
| Mastercard | Senior Software Engineer - Backend/Platform Agentic AI | Fintech | 8 | Agent orchestration · Tool use · RAG · LLM observability · Guardrails · Inference infra |
| Mastercard | Software Engineer II - Backend/Platform Agentic AI | Fintech | 8 | Agent orchestration · Inference infra · RAG |
| Plaid | Staff Software Engineer - Instant Access | Fintech | 8 | Agent orchestration · LLM observability · Evals · Guardrails · Code gen |
| Visa | Manager, Visa Consulting and Analytics (VCA) — Senior AI Engineer, Tech Practice | Fintech | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability |
| Stripe | Engineering Manager, AI Conversation Platform | Fintech | 8 | RAG · Fine-tuning · Agent orchestration · LLM observability |
| PitchBook | Sr. Machine Learning Engineer | Fintech | 8 | Semantic search |
| PitchBook | Machine Learning Engineer | Fintech | 8 | LLM observability · RAG · Fine-tuning |
| Gusto | Staff Software Engineer, AI Developer Tools | Fintech | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Fine-tuning |
| Block | Staff Applied Machine Learning Engineer - Fraud & Abuse | Fintech | 8 | Inference infra · Agent orchestration · Evals |
| Block | Senior ML/AI Modeler, Risk Automation Machine Learning | Fintech | 8 | Agent orchestration · Guardrails · Fine-tuning · LLM observability |
| PayPal | Lead Product Manager | Fintech | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability · Guardrails |
| Block | Staff Machine Learning Engineer (Modeling), Support | Fintech | 8 | Agent orchestration · RAG · Fine-tuning · Inference infra · Recommender systems |
| Stripe | Staff Software Engineer, Machine Learning Platform | Fintech | 8 | Inference infra · Agent orchestration · LLM observability |
| Robinhood | Senior Software Engineer | Fintech | 8 | Inference infra · Fine-tuning · Evals |
| Visa | Lead Solutions Architect - GenAI | Fintech | 8 | Vector DB |
| Gusto | Head of ML/AI Engineering | Fintech | 8 | LLM observability · Guardrails |
| Visa | Manager, Intelligence & Data Solutions (IDS), Data Science | Fintech | 8 | Agent orchestration · Tool use · RAG · LLM observability |
| Visa | Software Engineer, Sr. Consultant Level (11-15 years exp, Java-Python-AWS-GenAI) | Fintech | 8 | Agent orchestration · RAG · Vector DB · LLM observability · Guardrails · Inference infra |
| Ripple | Staff Software Engineer, GenAI Platform | Fintech | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Inference infra |