Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
16 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| AT&T | Director Cybersecurity - AI/ML/Automation (Cyber Threat Analytics) | Telecom | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · Fine-tuning · Model serving |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability |
| Verizon | Princ Engr-AI Science | Telecom | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Multimodal |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Model serving · Multimodal |
| Verizon | Director - AI/ML Engineering | Telecom | 8 | LLM observability · Agent orchestration · Tool use · Vector DB · Model serving · Guardrails |
| Verizon | Sr Engr Cslt-AI/ML Engineering | Telecom | 8 | Model serving · Agent orchestration |
| Verizon | Engr III Cslt-AI Science | Telecom | 8 | Agent orchestration · Model serving |
| Verizon | Sr Engr Cslt-AI Science | Telecom | 8 | Agent orchestration · Agent research · Fine-tuning · Model serving · LLM observability · RAG |
| Verizon | Senior Engineering Consultant-Cloud & AI | Telecom | 8 | Agent orchestration · RAG · Tool use · LLM observability · Model serving |
| Verizon | Assoc Dir-AI Science | Telecom | 7 | Model serving |
| T-Mobile | Senior Data Science Engineer | Telecom | 7 | Model serving |
| AT&T | Principal Data/AI Engineering | Telecom | 7 | Model serving |
| AT&T | Lead System Engineer (AI Automation Engineer SRE Focus) | Telecom | 7 | Agent orchestration · LLM observability · Model serving |
| AT&T | Director of Engineering – Decision Intelligence Platform | Telecom | 7 | Model serving · LLM observability · Recommender systems · Search & ranking |
| T-Mobile | Sr. Solutions Architect, Physical AI | Telecom | 5 | Embodied AI · Agent orchestration |
| T-Mobile | Engineer, System Architecture - AI Enabled Automation | Telecom | 5 | Model serving |