1 AI role tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| UPS | UPS Digital Senior Machine Learning Engineer | Logistics | 8 | Agent orchestration · Tool use · Model serving · Fine-tuning |
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,740 active AI roles across 208 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (59%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,740 active AI roles across 208 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (334 roles), NVIDIA (326 roles), Google (176 roles), Capital One (107 roles), Microsoft (106 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (59%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.