Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
102 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Target | Principal AI Engineer - Advanced AI (Machine Learning, Python, Deep Learning) | Retail | 9 | Agent orchestration · LLM observability · Evals · Model serving |
| Target | Lead Engineer - GenAI | Retail | 9 | Agent orchestration · Agent research · Tool use · LLM observability · RAG · Fine-tuning · Model serving · Synthetic data |
| Walmart | Principal, Data Scientist | Retail | 9 | Agent orchestration · Model serving · RAG · Vector DB · LLM observability · Evals · Guardrails |
| Walmart | Distinguished, Software Engineer -AI/ML Engineer- Walmart Connect | Retail | 9 | Agent orchestration · Tool use · Multimodal · RAG · Vector DB · Fine-tuning · Model serving · RL post-training · Agent research · LLM observability · Guardrails |
| Walmart | Principal, Data Scientist | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Evals · Model serving |
| Target | Lead Engineer- Advanced AI | Retail | 8 | Agent orchestration · Tool use · RAG · Evals · LLM observability · Model serving |
| Target | Sr. Data Scientist | Retail | 8 | Search & ranking · Recommender systems · RAG · Agent orchestration · Tool use · Guardrails · Fine-tuning · Model serving |
| Walmart | (USA) Senior, Software Engineer - MLE- Agentic AI & AIOps | Retail | 8 | Agent orchestration · RAG · LLM observability · Model serving |
| Walmart | (USA) Senior, Data Scientist - Applied AI | Retail | 8 | Agent orchestration · Evals · Model serving |
| Walmart | Senior, Data Scientist (Machine Learning Engineer) | Retail | 8 | Model serving · Multimodal · RAG · Fine-tuning · Vector DB |
| Target | Sr Applied Data Scientist - Search and Browse (Applied ML, NLP, LLMs) | Retail | 8 | Recommender systems · Search & ranking · Vector DB · RAG · Model serving |
| Walmart | Senior, Software Engineer - AI Systems | Retail | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Model serving |
| Walmart | Software Engineer III– AI Systems | Retail | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Model serving |
| Walmart | (USA) Principal, Software Engineer | Retail | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Evals · Model serving |
| Walmart | (USA) Distinguished, Software Engineer | Retail | 8 | Model serving · LLM observability · Guardrails · Agent orchestration · Tool use |
| Walmart | (USA) Staff, Data Scientist | Retail | 8 | Agent orchestration · RAG · Fine-tuning · Model serving · LLM observability |
| Walmart | Principal, Software Engineer | Retail | 8 | Agent orchestration · Agent research · Model serving · Vision · Multimodal |
| Walmart | Distinguished, Software Engineer | Retail | 8 | Agent orchestration · Tool use · Model serving · LLM observability · Guardrails |
| Walmart | Staff, Data Scientist | Retail | 8 | Agent orchestration · Model serving |
| Walmart | Staff, Software Engineer | Retail | 8 | Recommender systems · Search & ranking · Agent orchestration · RAG · Vector DB · LLM observability · Model serving · Multimodal |
| Walmart | Distinguished, Data Scientist | Retail | 8 | Search & ranking · Recommender systems · Agent orchestration · Tool use · LLM observability · RAG · Model serving |
| Walmart | (USA) Staff, Software Engineer | MLE | Retail | 8 | Multimodal · Vision · Fine-tuning · Model serving |
| Walmart | (USA) Staff, Software Engineer - MLE- Agentic AI & AIOps | Retail | 8 | Agent orchestration · LLM observability · Model serving |
| Chewy | Machine Learning Engineer II | Retail | 7 | Model serving |
| Target | Lead Engineer Cyber AI - AI Expert | Retail | 7 | Agent orchestration · Tool use · LLM observability · Model serving |
| Target | Sr Engineer - Agentic Commerce | Retail | 7 | Agent orchestration · RAG · Agent research · Model serving |
| Target | Sr AI Engineer-Item Science | Retail | 7 | Recommender systems · Search & ranking · Model serving |
| Target | Sr Machine Learning Engineer - Marketing and Corporate Systems (ML Ops) | Retail | 7 | Model serving · Recommender systems |
| Target | Lead Machine Learning Engineer - Merchandising AI (ML Ops) | Retail | 7 | Agent orchestration · Model serving |
| Walmart | (USA) Principal, Software Engineer | Retail | 7 | Evals · Model serving · RAG · LLM observability |