Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
43 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Disney | Lead Machine Learning Engineer | Media | 9 | Agent orchestration · Agent research · Multimodal · RAG · LLM observability · Evals · Guardrails · Model serving |
| Comcast | Principal Machine Learning Engineer | Media | 8 | Recommender systems · Search & ranking · Fine-tuning · Agent orchestration · Tool use · Model serving |
| Disney | Director, Decision Science AI/ML Engineering & Ops | Media | 8 | Model serving · LLM observability · Guardrails · Evals |
| Disney | Sr Software Engineer | Media | 8 | Agent orchestration · Tool use · LLM observability · RAG · Fine-tuning · Model serving |
| Comcast | Engineer 3 - Machine Learning | Media | 8 | Agent orchestration · LLM observability · Model serving · Guardrails |
| Comcast | Engineer 3 - Machine Learning | Media | 8 | Agent orchestration · LLM observability · Model serving · Guardrails |
| Comcast | Engineer 2 - Machine Learning | Media | 8 | Agent orchestration · Tool use · LLM observability · Model serving · Agent research |
| Disney | Sr Data Scientist | Media | 8 | Multimodal · Fine-tuning · RAG · Model serving · Evals · Vector DB |
| Comcast | Machine Learning Engineer 4 | Media | 8 | Agent orchestration · LLM observability · Evals · Guardrails · Model serving |
| Disney | Sr Machine Learning Engineer | Media | 8 | Model serving · Forecasting · LLM observability |
| Warner Bros Discovery | Sr. Staff, Data Science & Applied AI | Media | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Model serving |
| Comcast | Engineer 3 - Machine Learning | Media | 7 | Agent orchestration · Tool use · LLM observability · RAG · Model serving |
| Comcast | Engineer 3 - Machine Learning | Media | 7 | Model serving · Forecasting |
| Disney | Senior Principal Machine Learning Engineer, Ad Platforms | Media | 7 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability |
| Comcast | Engineer 4, Machine Learning N-E084660-0533-CIEC | Media | 7 | Model serving |
| Disney | Senior Manager Software Engineering | Media | 7 | LLM observability · Model serving · Vision |
| Disney | Lead Machine Learning Engineer | Media | 7 | Model serving · Recommender systems · Search & ranking |
| Disney | Sr Machine Learning Engineer | Media | 7 | Model serving · Fine-tuning · Vision · Recommender systems |
| Disney | Senior Machine Learning Engineer - News | Media | 7 | Recommender systems · RAG · Model serving |
| Warner Bros Discovery | Director, Machine Learning Engineering, CNN Digital Products & Services | Media | 7 | Recommender systems · Search & ranking · Model serving · LLM observability |
| Warner Bros Discovery | Principal, System Architecture | Media | 7 | Agent orchestration · Model serving |
| The Trade Desk | Data Scientist I | Media | 7 | Model serving |
| Comcast | Sr. Principal Machine Learning Engineer | Media | 7 | Model serving · Recommender systems · Search & ranking |
| Comcast | Sr. Platform Engineer - AI Agentic | Media | 7 | Agent orchestration · Model serving · LLM observability |
| Disney | Principal Software Engineer | Media | 7 | Vision · Model serving |
| Disney | Sr. Manager, Machine Learning Engineering | Media | 7 | Model serving · RAG · Vector DB |
| Disney | Sr Machine Learning Engineer | Media | 7 | RAG · Vector DB · Model serving |
| Disney | Lead Machine Learning Engineer | Media | 7 | RAG · Vector DB · Model serving |
| Disney | Principal Site Reliability Engineer | Media | 7 | RAG · LLM observability · Model serving · Evals |
| Disney | Principal Site Reliability Engineer | Media | 7 | LLM observability · RAG · Model serving · Agent orchestration · Evals · Guardrails |