2124 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | AI Inference Performance Engineer | Semiconductors | 9 | Model serving · Quantization |
| NVIDIA | Senior Deep Learning Architect, LLM Inference | Semiconductors | 9 | Model serving · LLM observability |
| NVIDIA | Lead Principal Engineer, Enterprise Agentic AI Platform | Semiconductors | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Model serving |
| NVIDIA | Senior Systems Software Engineer - Deep Learning Solutions | Semiconductors | 9 | Model serving · Vision · Multimodal |
| NVIDIA | Senior Deep Learning Compiler Engineer - XLA | Semiconductors | 9 | Model serving |
| NVIDIA | Principal Software Engineer - AI Inference | Semiconductors | 9 | Model serving |
| NVIDIA | Senior DL Algorithms Engineer - Inference Performance | Semiconductors | 9 | Model serving · Multimodal |
| NVIDIA | High-Performance LLM Training Engineer - New College Grad 2026 | Semiconductors | 9 | Model serving |
| NVIDIA | Senior Research Scientist, AI Accelerator Design and VLSI | Semiconductors | 9 | Quantization · Model serving |
| NVIDIA | Deep Learning Performance Software Engineer | Semiconductors | 9 | Model serving |
| NVIDIA | Senior Applied Deep Learning Research Scientist, Efficiency | Semiconductors | 9 | Fine-tuning · Model serving · Quantization · Pretraining |
| Adobe | Applied Scientist - Multimodal | Enterprise | 9 | Multimodal · Guardrails · Fine-tuning · Model serving · Vision · LLM observability · Evals |
| Adobe | Senior ML Engineer - Firefly | Enterprise | 9 | Multimodal · Fine-tuning · Model serving |
| Adobe | Senior Staff Applied Scientist - AI/ML | Enterprise | 9 | Multimodal · Fine-tuning · Model serving · Evals |
| Adobe | Principal Machine Learning Engineer, Firefly | Enterprise | 9 | Model serving · Fine-tuning |
| NVIDIA | Senior DGX Cloud AI Infrastructure Software Engineer | Semiconductors | 9 | Model serving · Pretraining · Fine-tuning · LLM observability |
| Walmart | Distinguished, Software Engineer -AI/ML Engineer- Walmart Connect | Retail | 9 | Agent orchestration · Tool use · Multimodal · RAG · Vector DB · Fine-tuning · Model serving · RL post-training · Agent research · LLM observability · Guardrails |
| Together AI | Senior Machine Learning Engineer, Voice AI | Data AI | 9 | Model serving · Audio & speech |
| NVIDIA | Solutions Architect, Pre-training and Post-training | Semiconductors | 9 | Pretraining · Fine-tuning · RL post-training · Model serving |
| NVIDIA | Senior GPU Networking Architect | Semiconductors | 9 | Model serving |
| Canva | Research Scientist - Efficient AI 高性能AI大模型研究科学家 | Enterprise | 9 | Frontier research · Pretraining · Fine-tuning · Model serving · Multimodal · Quantization · Distillation |
| Snowflake | Staff Research Scientist, AI Agents & LLMs | Data AI | 9 | Agent orchestration · Agent research · Fine-tuning · Model serving · Evals |
| Decagon | Senior Software Engineer, ML Infrastructure | Vertical AI | 9 | Fine-tuning · RL post-training · Model serving · Multimodal |
| OpenAI | Software Engineer, Codex Core Agents | AI Frontier | 9 | Agent orchestration · Tool use · Model serving |
| DoorDash | Senior/Staff Deep Reinforcement Learning Engineer | Consumer | 9 | RL robotics · Embodied AI · Agent orchestration · Model serving |
| Adobe | Principal Architect, Express AI Foundations | Enterprise | 9 | Agent orchestration · Model serving · LLM observability · Evals · Multimodal |
| Weights & Biases | VP of Product, Research and Training Infrastructure | Data AI | 9 | Frontier research · Pretraining · RL post-training · RLHF · Model serving |
| Anthropic | Research Engineer, Performance RL | AI Frontier | 9 | RL post-training · Frontier research · Code gen · Model serving |
| Crusoe | Senior Software Engineer, AI Model LifeCycle | Data AI | 9 | Fine-tuning · RL post-training · Frontier research · Multimodal · Model serving |
| OpenAI | TL, Research Inference | AI Frontier | 9 | Model serving |