Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding. Primary AI lifecycle stage: serving infrastructure.
2,830 active AI roles across 214 companies in our index reference Inference infra as of today.
The companies with the most active Inference infra listings are: Amazon (355 roles), NVIDIA (323 roles), Google (191 roles), Capital One (117 roles), Microsoft (104 roles).
Inference infra primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Inference infra roles concentrate at: serving infrastructure (58%), agents (25%).
The sectors with the most active Inference infra hiring are: Big Tech, Semiconductors, Enterprise.
Lower-level systems work optimizing how trained models actually run on GPUs: scheduling, custom kernels, paged attention, speculative decoding.
Primary AI lifecycle stage: serving infrastructure.
As of today, 2,830 active AI roles across 214 companies in our index reference Inference infra. Hiring concentrates at the serving infrastructure (58%) and agents (25%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
40 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Iambic | Machine Learning Scientist — Large multimodal models | Pharma | 9 | Multimodal · Fine-tuning · Model serving · Frontier research |
| Eli Lilly | Advisor - Agent Research | Pharma | 9 | Agent orchestration · Tool use · Model serving · RAG · Fine-tuning |
| Johnson & Johnson | Director, MedTech Technology AI Orchestration and Enablement | Pharma | 8 | Agent orchestration · Model serving · Guardrails · LLM observability |
| Pfizer | Director, AI Engineering--Clinical Development and Operations (CD&O) | Pharma | 8 | Agent orchestration · LLM observability · RAG · Fine-tuning · Model serving |
| Johnson & Johnson | Senior Machine Learning Engineer - Robotics | Pharma | 8 | Embodied AI · Multimodal · Model serving · Fine-tuning |
| Eli Lilly | Associate Director - AI Engineering | Pharma | 8 | Agent orchestration · Tool use · RAG · LLM observability · Model serving · Guardrails |
| Johnson & Johnson | Senior Scientist, Computer Vision | Pharma | 8 | Vision · Multimodal · Model serving |
| Johnson & Johnson | Design Leader, Polyphonic AI Lab | Pharma | 7 | Model serving · Multimodal |
| Johnson & Johnson | Senior Program Manager, R&D | Pharma | 7 | Model serving |
| Johnson & Johnson | Senior Program Manager, R&D | Pharma | 7 | Model serving |
| Iambic | Software Engineer I/II, Machine Learning | Pharma | 7 | Fine-tuning · Model serving |
| Johnson & Johnson | Manager, Data Science - Global Finance | Pharma | 7 | Agent orchestration · Fine-tuning · Model serving |
| Merck | Senior Scientist, Hybrid Modeler, Digital Insights, DSCS Digital Technologies | Pharma | 7 | Model serving |
| Pfizer | Staff Platform Engineer, AI/ML Infrastructure | Pharma | 7 | Model serving · LLM observability |
| Eli Lilly | Engineer - MLOps & Scientific Platforms - Data Foundry | Pharma | 7 | Model serving · LLM observability · Guardrails · Agent orchestration |
| Merck | Manager, ML Engineering | Pharma | 7 | Model serving · RAG · LLM observability |
| Johnson & Johnson | Director, MedTech Technology AI/DS Platforms, Innovation | Pharma | 7 | Model serving |
| Merck | Manager, AI Engineer | Pharma | 7 | Model serving · RAG · LLM observability |
| Pfizer | (Sr)Manager, Data & AI Engineer | Pharma | 7 | Agent orchestration · RAG · LLM observability · Model serving |
| Merck | Senior Specialist, DevOps Engineer | Pharma | 7 | Model serving · RAG · LLM observability · Guardrails |
| Pfizer | Senior Manager, ML Ops & Observability Engineer | Pharma | 7 | Model serving · LLM observability · Evals |
| Eli Lilly | Sr. Staff Software Engineer - AI Chat | Pharma | 7 | Agent orchestration · Tool use · Model serving · LLM observability |
| Johnson & Johnson | Principal Software Engineer - Tech Lead | Pharma | 7 | Model serving |
| Eli Lilly | Technical Lead - Software Developer, Data Foundry | Pharma | 7 | Agent orchestration · Model serving |
| Johnson & Johnson | Staff Engineer, AI/ ML/Surgical Robotics - OTTAVA | Pharma | 7 | Model serving |
| Eli Lilly | Technical Lead Software Architect | Pharma | 7 | Agent orchestration · Agent research · RAG · Evals · Model serving |
| Iambic | Platform Engineer, CloudOps Infrastructure | Pharma | 5 | Model serving |
| Merck | Enterprise Data Access Product Owner | Pharma | 5 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Recommender systems · Search & ranking · Vision · Audio & speech · Frontier research · Interpretability · Synthetic data · Agent research · RL post-training · RLHF · Reward modeling · RL robotics · Embodied AI |
| Pfizer | Staff Engineer High Performance Computing | Pharma | 5 | Model serving |
| Merck | Senior Staff Reliability Engineer, Software Engineering | Pharma | 5 | LLM observability · Model serving |