Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
270 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| JPMorgan Chase | Applied AI ML Researcher Director | Banking | 9 | Agent orchestration · Agent research · Model serving |
| Capital One | Applied Researcher II | Banking | 9 | Pretraining · Fine-tuning · Model serving · Vector DB |
| Capital One | Distinguished Engineer | Banking | 9 | Model serving · Quantization |
| Capital One | Principal Associate, Data Science - AI Foundations | Banking | 9 | Fine-tuning · Model serving · Agent orchestration · RAG · Vector DB · LLM observability |
| JPMorgan Chase | Generative AI Executive Director | Banking | 9 | Agent orchestration · Multimodal · Fine-tuning · Model serving |
| JPMorgan Chase | Applied AI ML Researcher Director | Banking | 9 | Agent orchestration · Agent research · Model serving |
| JPMorgan Chase | Generative AI - Vice President | Banking | 9 | Agent orchestration · LLM observability · Model serving · Fine-tuning · Multimodal |
| JPMorgan Chase | AI Agents Applied Engineer - Senior Associate | Banking | 9 | Agent orchestration · Tool use · Fine-tuning · Model serving · Guardrails · LLM observability · Recommender systems · Search & ranking · RL post-training |
| JPMorgan Chase | AI Agents Applied Research/Engineering Lead - Vice President | Banking | 9 | Agent orchestration · Tool use · Guardrails · Fine-tuning · Model serving · Recommender systems · Search & ranking · RL post-training |
| Capital One | Applied Researcher II (AI Foundations, LLM Core and Agentic AI) | Banking | 9 | Fine-tuning · Frontier research · Model serving · Pretraining · RL post-training · Vector DB |
| JPMorgan Chase | Applied AI ML Lead Researcher - Commercial and Investment Bank | Banking | 9 | Agent orchestration · Agent research · Frontier research · Model serving |
| JPMorgan Chase | Applied AI/ML Director Researcher | Banking | 9 | Agent orchestration · Agent research · Frontier research · Model serving |
| JPMorgan Chase | Generative AI Director | Banking | 9 | LLM observability · Agent orchestration · Tool use · Fine-tuning · Model serving · Multimodal · Vision · Audio & speech |
| Capital One | Sr. Distinguished Applied Researcher | Banking | 9 | Pretraining · Fine-tuning · Model serving · Vector DB · Frontier research |
| JPMorgan Chase | Senior AI Application Engineer - Vice President | Banking | 8 | LLM observability · Evals · Guardrails · Model serving |
| Capital One | Senior Director, Software Engineering - AI | Banking | 8 | Agent orchestration · Model serving · LLM observability |
| Capital One | Senior Lead AI Engineer (GenAI Platform Services) | Banking | 8 | Fine-tuning · Model serving · Guardrails · Vector DB · LLM observability · Evals |
| Capital One | Lead AI Engineer (Vision model customization, VML) | Banking | 8 | Vision · Model serving · Guardrails · Vector DB |
| Capital One | Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services) | Banking | 8 | Model serving · Guardrails · Vector DB · RAG · Fine-tuning |
| Capital One | Senior Lead AI Engineer, Gen AI Platform | Banking | 8 | Model serving · Fine-tuning · Guardrails · Vector DB · LLM observability |
| Capital One | Manager, Data Science - GenAI Digital Assistant | Banking | 8 | Agent orchestration · Fine-tuning · Model serving · RAG · Vector DB · LLM observability |
| JPMorgan Chase | Applied AI ML Lead [Multiple Positions Available] | Banking | 8 | Fine-tuning · Model serving · RAG · LLM observability · Evals |
| Capital One | Senior Manager, Data Science - AI Foundations | Banking | 8 | Fine-tuning · Model serving · RAG · Vector DB · LLM observability |
| Capital One | Lead AI Engineer (Vision model customization, VLM) | Banking | 8 | Vision · Multimodal · Model serving · Fine-tuning · RAG · Vector DB · Guardrails · LLM observability |
| JPMorgan Chase | Applied AI ML Senior Associate | Banking | 8 | Agent orchestration · Model serving · RAG · Fine-tuning |
| Capital One | Lead Machine Learning Engineer | Banking | 8 | Agent orchestration · Model serving · Guardrails · LLM observability · Fine-tuning · Evals |
| Capital One | Lead Machine Learning Engineer (Manager IC) | Banking | 8 | Model serving · Guardrails · LLM observability · Agent orchestration · Fine-tuning · RAG |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Fine-tuning · Guardrails · Vector DB · LLM observability · Evals |
| Capital One | Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform) | Banking | 8 | Model serving · Fine-tuning · Guardrails · Vector DB · LLM observability |