3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Cohere | Staff Research Engineer, Model Efficiency | AI Frontier | 9 | Model serving · Fine-tuning · Frontier research |
| Cohere | Member of Technical Staff, Model Efficiency | AI Frontier | 9 | Model serving |
| Anthropic | Research Engineer, Interpretability | AI Frontier | 9 | Interpretability · Model serving · Fine-tuning |
| OpenAI | Forward Deployed Engineer (FDE) - NYC | AI Frontier | 9 | Model serving · LLM observability |
| Senior Staff Software Engineer, AI/ML GenAI, Google Cloud AI | Big Tech | 9 | Multimodal · Vision · Model serving | |
| OpenAI | Software Engineer, Hardware | AI Frontier | 9 | Model serving |
| Scale AI | Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI | Data AI | 9 | RL post-training · Agent orchestration · Tool use · Model serving |
| OpenAI | Offensive Security Engineer, Agent Products | AI Frontier | 9 | Agent orchestration · Tool use · Evals · Guardrails · Model serving |
| Cerebras | Senior Runtime Engineer | Semiconductors | 9 | Model serving |
| Amazon | Member of Technical Staff - Reinforcement Learning (Infrastructure), AGI Autonomy | Big Tech | 9 | RL robotics · Model serving |
| Wayve | Machine Learning Engineer | Robotics | 9 | Embodied AI · Model serving · Evals · Synthetic data · Fine-tuning |
| Amazon | Senior Applied Scientist, Delivery Foundation Model | Big Tech | 9 | Multimodal · Fine-tuning · Model serving · Frontier research |
| NVIDIA | Senior LLM Train Framework Engineer | Semiconductors | 9 | Pretraining · Fine-tuning · Model serving · Multimodal |
| Anthropic | Research Engineer / Research Scientist, Tokens | AI Frontier | 9 | Pretraining · Fine-tuning · Model serving · Frontier research |
| OpenAI | Software Engineer, Inference – AMD GPU Enablement | AI Frontier | 9 | Model serving |
| Databricks | Staff Software Engineer - GenAI Performance and Kernel | Data AI | 9 | Model serving · Quantization |
| Databricks | Staff Software Engineer - GenAI inference | Data AI | 9 | Model serving |
| Cerebras | Kernel Engineer | Semiconductors | 9 | Model serving |
| Capital One | Sr. Distinguished Applied Researcher | Banking | 9 | Pretraining · Fine-tuning · Model serving · Vector DB · Frontier research |
| Anthropic | Research Engineer, Pretraining Scaling - London | AI Frontier | 9 | Pretraining · Model serving · LLM observability · Evals |
| NVIDIA | AI Computing Software Development Engineer, TensorRT-LLM | Semiconductors | 9 | Model serving · LLM observability |
| OpenAI | Forward Deployed Engineer - Munich | AI Frontier | 9 | Agent orchestration · Model serving · LLM observability · Evals |
| OpenAI | Forward Deployed Engineer - Paris | AI Frontier | 9 | Model serving · LLM observability · Evals · Agent orchestration |
| OpenAI | Forward Deployed Engineer - Dublin | AI Frontier | 9 | Agent orchestration · Model serving · Evals · LLM observability |
| Anthropic | Performance Engineer, GPU | AI Frontier | 9 | Model serving · Quantization · Pretraining |
| Shield AI | Product Manager, AI Platforms (R4991) | Defense | 9 | Multimodal · Training infra · Evals · Synthetic data · Model serving |
| Glean | Machine Learning Engineer, AI Assistant & Autonomous AI Agents | Enterprise | 9 | Agent orchestration · Agent research · Evals · Model serving |
| ByteDance | Senior Research Scientist - Machine Learning System | Big Tech | 9 | Model serving |
| Datadog | AI Research Engineer - Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · RL post-training · Agent orchestration · Frontier research · Model serving · Evals |
| Moveworks | Engineering Manager - Agentic Systems | Enterprise | 9 | Model serving · Agent orchestration · LLM observability |