Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
34 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| GEICO | Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Agent research · LLM observability · RAG · Fine-tuning · Model serving |
| GEICO | Distinguished Engineer, Applied AI | Insurance | 8 | Agent orchestration · LLM observability · Model serving |
| GEICO | Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Tool use · Model serving · LLM observability |
| GEICO | Senior Staff Machine Learning Engineer, AI Agent Platform | Insurance | 8 | Agent orchestration · Agent research · Fine-tuning · Model serving · RAG · Guardrails · LLM observability · Evals · Tool use |
| Premera Blue Cross | AI Engineer III | Insurance | 8 | Model serving · RAG · Guardrails · LLM observability |
| MetLife | Lead Data Scientist | Insurance | 8 | Model serving · Fine-tuning · RAG · Vector DB |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Model serving · Guardrails · LLM observability |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Model serving · Guardrails · LLM observability |
| GEICO | Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Model serving |
| GEICO | Director, Machine Learning Engineering | Insurance | 7 | RAG · Agent orchestration · Model serving · LLM observability |
| GEICO | Senior Machine Learning Engineer | Insurance | 7 | Model serving · RAG · LLM observability · Guardrails · Evals |
| GEICO | Machine Learning Engineer II | Insurance | 7 | Model serving · Fine-tuning |
| Allstate | Applied Machine Learning Engineer (All Levels) | Insurance | 7 | Model serving · Evals · Interpretability |
| MetLife | MLOps Engineer | Insurance | 7 | Model serving · LLM observability · RAG · Fine-tuning |
| Allstate | Machine Learning Platform Engineer | Insurance | 7 | Model serving |
| Premera Blue Cross | AI Engineer III | Insurance | 7 | Model serving · RAG · Guardrails · LLM observability |
| GEICO | Senior Machine Learning Engineer | Insurance | 7 | Agent orchestration · Tool use · Fine-tuning · Model serving · LLM observability · RAG · Evals |
| GEICO | Senior Staff Machine Learning Engineer | Insurance | 7 | Model serving · Evals · Agent orchestration · RAG |
| Allstate | Enterprise Software Developer Expert | Insurance | 7 | RAG · LLM observability · Model serving |
| MetLife | Senior AI Engineer I | Insurance | 7 | Model serving · Fine-tuning · RAG · Vector DB · LLM observability |
| Allstate | Machine Learning Platform - Lead Engineer | Insurance | 7 | Model serving |
| Allstate | Senior AI Cloud Platform Engineer | Insurance | 7 | Model serving |
| Premera Blue Cross | AI Engineer IV | Insurance | 7 | Model serving |
| Premera Blue Cross | Solution Architect/AI Engineer IV | Insurance | 7 | Model serving · RAG · Guardrails · LLM observability · Fine-tuning |
| Premera Blue Cross | Solution Architect/AI Engineer IV | Insurance | 7 | RAG · Agent orchestration · Multimodal · Model serving · LLM observability · Guardrails |
| Premera Blue Cross | AI Engineer IV | Insurance | 7 | Model serving · RAG |
| GEICO | Senior Staff Engineer, Interactive Voice Response - AI/ML | Insurance | 7 | Agent orchestration · LLM observability · Model serving · Multimodal |
| MetLife | Sr. Architect - Data & AI Platform Strategy | Insurance | 5 | Model serving |
| Premera Blue Cross | Site Reliability Engineer IV | Insurance | 5 | Model serving · LLM observability |
| State Farm | Senior AWS Infrastructure Engineer | Insurance | 5 | RAG · Agent orchestration · Model serving |