Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
50 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Honeywell | Lead AI Engr | Industrial | 9 | Agent orchestration · Agent research · Fine-tuning · Model serving · RAG · Vector DB · Multimodal · Guardrails · LLM observability |
| Cognite | Principal ML Engineer | Industrial | 8 | Model serving · Vision · Agent orchestration · RAG · Multimodal |
| Caterpillar | Agentic AI / AI Ops Engineer – Platform Engineering | Industrial | 8 | Agent orchestration · Tool use · LLM observability · Model serving |
| Cognite | Senior Machine Learning Engineer | Industrial | 8 | Vision · Multimodal · Model serving · RAG · Vector DB · Fine-tuning · LLM observability · Agent orchestration |
| Caterpillar | Lead Architect – Digital Twin & AI Factory | Industrial | 8 | Synthetic data · Model serving |
| Honeywell | Software Engr II | Industrial | 8 | Agent orchestration · RAG · Model serving · LLM observability |
| Caterpillar | Lead Data Scientist - Gen AI & Digital Twin | Industrial | 8 | RAG · Fine-tuning · Model serving |
| Honeywell | Sr. Director Data & AI Platforms | Industrial | 8 | Agent orchestration · Model serving · RAG · Vector DB · Guardrails |
| Honeywell | Sr Advanced AI Platform Engineer | Industrial | 8 | Agent orchestration · RAG · Model serving · LLM observability |
| Honeywell | Advanced AI Engineer | Industrial | 8 | Agent orchestration · Agent research · RAG · LLM observability · Model serving · Fine-tuning |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Model serving |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Model serving · RAG |
| Honeywell | Software Engr II | Industrial | 8 | Agent orchestration · RAG · Model serving · LLM observability |
| Cognite | Senior AI Platform Engineer, Atlas AI | Industrial | 8 | Agent orchestration · Tool use · LLM observability · Model serving · Evals |
| Cognite | Machine Learning Engineer | Industrial | 7 | Fine-tuning · Model serving · RAG · Vector DB · Agent orchestration |
| Honeywell | Advanced Data Scientist | Industrial | 7 | Model serving |
| Caterpillar | Senior Manager, Internal Enterprise Analytics & AI Experience Gateway | Industrial | 7 | Model serving · Agent orchestration · Tool use · RAG · LLM observability |
| Caterpillar | Principal Digital Architect (Autonomy) | Industrial | 7 | Vision · Multimodal · Agent orchestration · Tool use · Model serving |
| Honeywell | Senior IT Architect | Industrial | 7 | Model serving |
| Caterpillar | Senior Autonomy Engineer | Industrial | 7 | Model serving |
| Caterpillar | Agentic AI / AI Ops Engineer – Platform Engineering | Industrial | 7 | Agent orchestration · Tool use · LLM observability · RAG · Model serving |
| Caterpillar | Principal Digital Architect | Industrial | 7 | Agent orchestration · RAG · Model serving |
| Caterpillar | Autonomy Engineering Specialist | Industrial | 7 | Model serving |
| Caterpillar | Electronic Components Project Lead | Industrial | 7 | Model serving |
| Honeywell | Lead Software Engr | Industrial | 7 | Model serving |
| Caterpillar | Global Data Science Architect | Industrial | 7 | Model serving |
| Caterpillar | 视觉算法部署实习生 | Industrial | 7 | Vision · Model serving · Fine-tuning |
| Honeywell | AI Engr II | Industrial | 7 | Model serving |
| Caterpillar | Principal Digital Architect | Industrial | 7 | Model serving |
| Caterpillar | Global Data Science Architect | Industrial | 7 | Model serving |