Which companies are hiring for Model serving roles?

The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).

What AI lifecycle stage does Model serving belong to?

Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).

What sectors invest most in Model serving?

The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.

← Tag co-occurrence network

Model serving

Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.

Primary AI lifecycle stage: serving infrastructure.

As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.

Top hiring:

Function

All Engineering · 7314 Research · 444 Product · 285

Status

All Active only

Sort

AI score Recently posted Company A–Z

FilteredsectorFintech×

267 AI roles tagged model_serving.

Company	Title	Sector	AI score	Other tags
Plaid	Machine Learning Engineer (Research Scientist) - DFAI	Fintech	9	Pretraining · Fine-tuning · Inference infra · LLM observability
Plaid	Senior Machine Learning Engineer (Research Scientist) - DFAI	Fintech	9	Pretraining · Fine-tuning · Inference infra · LLM observability
Plaid	Staff Machine Learning Engineer (Research Scientist) - DFAI	Fintech	9	Pretraining · Fine-tuning
Visa	Senior AI Engineer	Fintech	9	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Inference infra · Frontier research · Interpretability · RL post-training · Agent research · Multimodal
Visa	Data Science Manager	Fintech	9	Agent orchestration · Fine-tuning · Guardrails · Tool use
Visa	Head of Generative AI Research	Fintech	9	Frontier research · Pretraining · Multimodal · Agent research · Agent orchestration · LLM observability
Upstart	Principal Engineer, LLM	Fintech	9	Inference infra · RAG · Vector DB · Evals · LLM observability
Plaid	Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI	Fintech	9	Pretraining · Fine-tuning · Inference infra · Frontier research
PayPal	Principal, Agentic AI	Fintech	9	Agent orchestration · Tool use
Gusto	Sr. Staff AI/ML Engineer	Fintech	9	Agent orchestration · RAG · Evals · LLM observability · Guardrails
PitchBook	Manager, Engineering, AI & ML	Fintech	8	LLM observability · Fine-tuning · RAG · Vector DB
Mastercard	Senior Software Engineer - Backend/Platform Agentic AI	Fintech	8	Agent orchestration · Tool use · RAG · LLM observability · Guardrails · Inference infra
Mastercard	Software Engineer II - Backend/Platform Agentic AI	Fintech	8	Agent orchestration · Inference infra · RAG
Plaid	Staff Software Engineer - Instant Access	Fintech	8	Agent orchestration · LLM observability · Evals · Guardrails · Code gen
Visa	Manager, Visa Consulting and Analytics (VCA) — Senior AI Engineer, Tech Practice	Fintech	8	Agent orchestration · Tool use · RAG · Vector DB · LLM observability
Stripe	Engineering Manager, AI Conversation Platform	Fintech	8	RAG · Fine-tuning · Agent orchestration · LLM observability
PitchBook	Sr. Machine Learning Engineer	Fintech	8	Semantic search
PitchBook	Machine Learning Engineer	Fintech	8	LLM observability · RAG · Fine-tuning
Gusto	Staff Software Engineer, AI Developer Tools	Fintech	8	Agent orchestration · RAG · LLM observability · Guardrails · Fine-tuning
Block	Staff Applied Machine Learning Engineer - Fraud & Abuse	Fintech	8	Inference infra · Agent orchestration · Evals
Block	Senior ML/AI Modeler, Risk Automation Machine Learning	Fintech	8	Agent orchestration · Guardrails · Fine-tuning · LLM observability
PayPal	Lead Product Manager	Fintech	8	Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability · Guardrails
Block	Staff Machine Learning Engineer (Modeling), Support	Fintech	8	Agent orchestration · RAG · Fine-tuning · Inference infra · Recommender systems
Stripe	Staff Software Engineer, Machine Learning Platform	Fintech	8	Inference infra · Agent orchestration · LLM observability
Robinhood	Senior Software Engineer	Fintech	8	Inference infra · Fine-tuning · Evals
Visa	Lead Solutions Architect - GenAI	Fintech	8	Vector DB
Gusto	Head of ML/AI Engineering	Fintech	8	LLM observability · Guardrails
Visa	Manager, Intelligence & Data Solutions (IDS), Data Science	Fintech	8	Agent orchestration · Tool use · RAG · LLM observability
Visa	Software Engineer, Sr. Consultant Level (11-15 years exp, Java-Python-AWS-GenAI)	Fintech	8	Agent orchestration · RAG · Vector DB · LLM observability · Guardrails · Inference infra
Ripple	Staff Software Engineer, GenAI Platform	Fintech	8	Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Inference infra

Frequently asked questions

What is Model serving in AI?
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
How many AI roles reference Model serving right now?
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
Which companies are hiring for Model serving roles?
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
What AI lifecycle stage does Model serving belong to?
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
What sectors invest most in Model serving?
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.