Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
604 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Unity | Staff Machine Learning Engineer - Computer Vision & Multi-Modal AI | Enterprise | 9 | Multimodal · Model serving · Fine-tuning · Quantization |
| Rubrik | Senior Machine Learning Engineer | Enterprise | 9 | Fine-tuning · RL post-training · Model serving · Synthetic data · Evals · Guardrails · LLM observability |
| CrowdStrike | Lead AI Engineer, GTM Applications (Remote) | Enterprise | 9 | Agent orchestration · Agent research · RAG · Vector DB · LLM observability · Guardrails · Evals · Model serving |
| Oracle | Senior Principal AI Agent / ML Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving |
| Oracle | Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Axon | Senior Agentic AI Research Scientist | Enterprise | 9 | Agent orchestration · Multimodal · Vision · RAG · Vector DB · Model serving · Evals |
| Oracle | Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving · Agent research |
| Oracle | Senior Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving · Agent research |
| Oracle | Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Model serving |
| Oracle | Senior Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving · Agent research |
| Oracle | Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving · Agent research |
| Oracle | Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving |
| Adobe | Applied Scientist 5.5 | Enterprise | 9 | Fine-tuning · Model serving · Multimodal · Evals |
| Adobe | Applied Scientist 5 | Enterprise | 9 | Fine-tuning · Model serving · Vision · Multimodal |
| Oracle | Snr Director, Applied Science | Enterprise | 9 | Multimodal · Agent orchestration · Model serving · RAG · Evals · Guardrails · LLM observability · Vision · Audio & speech |
| Axon | AI Scientist I | Enterprise | 9 | Vision · Multimodal · Fine-tuning · Model serving |
| ABBYY | Principal Machine Learning Engineer - Model Efficiency Optimization | Enterprise | 9 | Model serving · Fine-tuning · Quantization · Multimodal · Vision |
| Adobe | Director, ML Engineering | Enterprise | 9 | Model serving · Multimodal |
| Oracle | Senior Principal AI Agent / ML Software Engineer (OCI) | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Model serving |
| Canva | Senior Research Engineer - Video | Enterprise | 9 | Multimodal · Fine-tuning · Model serving · Vision |
| Adobe | Machine Learning Engineer 5 | Enterprise | 9 | Fine-tuning · Model serving · Evals · Multimodal |
| Adobe | Machine Learning Engineer 4 | Enterprise | 9 | Model serving · Multimodal · Fine-tuning |
| Adobe | Sr Staff Machine Learning Engineer, Adobe Firefly Services | Enterprise | 9 | Model serving · Fine-tuning |
| Elastic | Lead GenAI Cloud Developer | Enterprise | 9 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Evals · Tool use · Guardrails |
| Dropbox | Senior Machine Learning Engineer, Dash Agentic AI | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · Fine-tuning · Model serving · RAG · Agent research |
| Adobe | Senior Machine Learning Engineer | Enterprise | 9 | Model serving · Fine-tuning · Multimodal · Vision · Audio & speech |
| Adobe | Machine Learning Engineer - II | Enterprise | 9 | Model serving · Fine-tuning · Multimodal |
| ServiceNow | Senior Machine Learning Engineer, Agentic Systems - Moveworks | Enterprise | 9 | Model serving · Fine-tuning · LLM observability · Agent orchestration |
| ServiceNow | Engineering Manager, Agentic Systems - Moveworks | Enterprise | 9 | Model serving · Fine-tuning · Evals |
| F5 | Principal Engineer – AI Specialist | Enterprise | 9 | Agent orchestration · Agent research · LLM observability · Model serving · Multimodal |