Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale. Primary AI lifecycle stage: serving infrastructure.
4,890 active AI roles across 249 companies in our index reference Model serving as of today.
The companies with the most active Model serving listings are: Amazon (625 roles), NVIDIA (356 roles), Google (323 roles), Capital One (161 roles), JPMorgan Chase (156 roles).
Model serving primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Model serving roles concentrate at: agents (40%), serving infrastructure (35%).
The sectors with the most active Model serving hiring are: Big Tech, Enterprise, Semiconductors.
Infrastructure for deploying trained models in production: request batching, autoscaling, KV-cache management, and low-latency inference at scale.
Primary AI lifecycle stage: serving infrastructure.
As of today, 4,890 active AI roles across 249 companies in our index reference Model serving. Hiring concentrates at the agents (40%) and serving infrastructure (35%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors.
20 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Qualtrics | Staff Data Scientist: Semantic Substrate Incubation | Seattle | 9 | Agent orchestration · Agent research · LLM observability · RAG · Vector DB |
| Smartsheet | Sr Principal Data Scientist | Seattle | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Fine-tuning · Recommender systems · LLM observability |
| Smartsheet | Senior Software Engineer II - Applied AI (Remote Eligible) | Seattle | 8 | RAG · Evals · LLM observability · Agent orchestration |
| Smartsheet | Senior Manager, Engineering - AI & Automation | Seattle | 8 | Agent orchestration · LLM observability · RAG · Inference infra |
| Amperity | Lead Machine Learning Engineer | Seattle | 8 | Inference infra |
| Amperity | Senior Machine Learning Engineer | Seattle | 7 | Inference infra |
| Qualtrics | Senior Software Engineer - Experience Agents | Seattle | 7 | Agent orchestration · Evals · LLM observability · RAG |
| Redfin | Machine Learning Developer | Seattle | 7 | Inference infra |
| Qualtrics | Senior Machine Learning Engineer | Seattle | 7 | Inference infra · Fine-tuning |
| Qualtrics | Machine Learning Engineer II | Seattle | 7 | Multimodal |
| Redfin | Remote Senior Applied Machine Learning Engineer - Applied Machine Learning Team | Seattle | 7 | Inference infra · Recommender systems |
| Redfin | Senior Software Engineer - Conversational Search | Seattle | 7 | Agent orchestration · RAG · LLM observability · Inference infra |
| Redfin | Software Developer II - Conversational Search | Seattle | 7 | Agent orchestration · RAG · LLM observability |
| Smartsheet | Manager, AI/ML Ops Engineering (Hybrid in Bangalore) | Seattle | 7 | Inference infra · LLM observability |
| Smartsheet | Senior AI/ML Ops Engineer (Hybrid in Bangalore) | Seattle | 7 | Inference infra · RAG · Vector DB · Fine-tuning |
| Smartsheet | Senior AI/ML Ops Engineer-II (Hybrid in Bangalore) | Seattle | 7 | Inference infra · RAG · Vector DB · Fine-tuning · LLM observability |
| Smartsheet | Sr. Machine Learning Operations Engineer | Seattle | 7 | Inference infra · LLM observability · Fine-tuning |
| Smartsheet | Senior Manager, Engineering - Observability Platform (Remote Eligible) | Seattle | 5 | LLM observability · Agent orchestration · Evals |
| Redfin | Software Engineer II - AI Tooling Platform | Seattle | 5 | Agent orchestration · Guardrails · LLM observability |
| Redfin | Senior Site Reliability Engineer | Seattle | 5 | Inference infra |