2 AI roles tagged training_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Affirm | Senior Manager, Software Engineering (ML Platform) | Fintech | 8 | Model serving · Inference infra |
| Affirm | Senior Manager, Software Engineering, (ML Platform) | Fintech | 8 | Model serving · Inference infra |
Distributed systems for training large models: data loading, gradient sharding (FSDP, ZeRO, Megatron), checkpointing, and fault tolerance across thousands of GPUs. Primary AI lifecycle stage: pre-training.
41 active AI roles across 28 companies in our index reference Training infra as of today.
The companies with the most active Training infra listings are: OpenAI (6 roles), Jane Street (3 roles), Affirm (2 roles), Amazon (2 roles), Figure AI (2 roles).
Training infra primarily belongs to the pre-training stage of the AI lifecycle. In current hiring, Training infra roles concentrate at: serving infrastructure (46%), data (44%).
The sectors with the most active Training infra hiring are: AI Frontier, Big Tech, Enterprise.
Distributed systems for training large models: data loading, gradient sharding (FSDP, ZeRO, Megatron), checkpointing, and fault tolerance across thousands of GPUs.
Primary AI lifecycle stage: pre-training.
As of today, 41 active AI roles across 28 companies in our index reference Training infra. Hiring concentrates at the serving infrastructure (46%) and data (44%) stages. Most common sectors: AI Frontier, Big Tech, Enterprise.