Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
109 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Plaid | Machine Learning Engineer (Research Scientist) - DFAI | Fintech | 9 | Pretraining · Fine-tuning · Model serving · LLM observability |
| Plaid | Senior Machine Learning Engineer (Research Scientist) - DFAI | Fintech | 9 | Pretraining · Fine-tuning · Model serving · LLM observability |
| Visa | Senior AI Engineer | Fintech | 9 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Frontier research · Interpretability · RL post-training · Agent research · Multimodal |
| Upstart | Principal Engineer, LLM | Fintech | 9 | Model serving · RAG · Vector DB · Evals · LLM observability |
| Plaid | Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI | Fintech | 9 | Pretraining · Fine-tuning · Model serving · Frontier research |
| Mastercard | Senior Software Engineer - Backend/Platform Agentic AI | Fintech | 8 | Agent orchestration · Tool use · RAG · LLM observability · Guardrails · Model serving |
| Mastercard | Software Engineer II - Backend/Platform Agentic AI | Fintech | 8 | Agent orchestration · Model serving · RAG |
| Block | Staff Applied Machine Learning Engineer - Fraud & Abuse | Fintech | 8 | Model serving · Agent orchestration · Evals |
| Block | Staff Machine Learning Engineer (Modeling), Support | Fintech | 8 | Agent orchestration · RAG · Fine-tuning · Model serving · Recommender systems |
| Stripe | Staff Software Engineer, Machine Learning Platform | Fintech | 8 | Model serving · Agent orchestration · LLM observability |
| Robinhood | Senior Software Engineer | Fintech | 8 | Model serving · Fine-tuning · Evals |
| Visa | Software Engineer, Sr. Consultant Level (11-15 years exp, Java-Python-AWS-GenAI) | Fintech | 8 | Agent orchestration · RAG · Vector DB · LLM observability · Guardrails · Model serving |
| Ripple | Staff Software Engineer, GenAI Platform | Fintech | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Model serving |
| Visa | Senior Director, Software Engineering (GenAI/Cloud) | Fintech | 8 | Agent orchestration · RAG · Vector DB · LLM observability · Model serving |
| Affirm | Senior Manager, Software Engineering (ML Platform) | Fintech | 8 | Model serving · Training infra |
| Affirm | Senior Manager, Software Engineering, (ML Platform) | Fintech | 8 | Model serving · Training infra |
| SoFi | Director, AI Platforms | Fintech | 8 | Model serving · Agent orchestration · RAG · Evals · LLM observability · Guardrails |
| Visa | Machine Learning Engineer | Fintech | 8 | Agent orchestration · RAG · Model serving · Guardrails |
| Mercury | Senior Software Engineer - AI Engineering | Fintech | 8 | Agent orchestration · RAG · Evals · Guardrails · LLM observability · Model serving |
| PayPal | Principal Machine Learning Engineer | Fintech | 8 | Recommender systems · Search & ranking · Model serving |
| PayPal | Staff Software Engineer, Agentic AI | Fintech | 8 | Agent orchestration · Agent research · Model serving |
| Stripe | Software Engineer, Machine Learning Infrastructure | Fintech | 8 | Model serving · LLM observability |
| Ramp | Applied AI Engineer | Fintech | 8 | Agent orchestration · RAG · Fine-tuning · Model serving |
| Upstart | Staff+ Machine Learning Engineer | Fintech | 8 | Model serving |
| Block | Senior Machine Learning Engineer, AI Personalization | Fintech | 8 | Recommender systems · Search & ranking · Agent orchestration · Model serving |
| Mercury | Senior Machine Learning Operations Engineer | Fintech | 7 | Model serving · LLM observability |
| Block | Product Manager, Risk Automation | Fintech | 7 | Agent orchestration · Evals · Guardrails · Fine-tuning · Model serving · LLM observability |
| Visa | Sr. ML Engineer | Fintech | 7 | Model serving |
| Visa | Manager, Analytics & Agentic Platform Operations | Fintech | 7 | Agent orchestration · Agent research · Model serving · LLM observability |
| Visa | Senior Product Manager | Fintech | 7 | Model serving |