2124 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Senior Software Engineer - Agentic AI | Semiconductors | 9 | Agent orchestration · Multimodal · Model serving · Evals |
| Target | Principal AI Engineer - Advanced AI (Machine Learning, Python, Deep Learning) | Retail | 9 | Agent orchestration · LLM observability · Evals · Model serving |
| OpenAI | Performance & Systems Engineer, Codex | AI Frontier | 9 | Model serving · Agent orchestration · LLM observability |
| NVIDIA | Senior Software Engineer, AI Inference Systems | Semiconductors | 9 | Model serving |
| Adobe | Sr Staff Machine Learning Engineer, Adobe Firefly Services | Enterprise | 9 | Model serving · Fine-tuning |
| Elastic | Lead GenAI Cloud Developer | Enterprise | 9 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Evals · Tool use · Guardrails |
| Databricks | Principal Research Scientist - AI Scaling & Optimization | Data AI | 9 | Frontier research · Fine-tuning · Model serving |
| Capital One | Distinguished Engineer | Banking | 9 | Model serving · Quantization |
| Snowflake | AI System Research and Development Engineer - Optimization | Data AI | 9 | Model serving · Agent orchestration |
| Capital One | Principal Associate, Data Science - AI Foundations | Banking | 9 | Fine-tuning · Model serving · Agent orchestration · RAG · Vector DB · LLM observability |
| Expedia | Machine Learning Engineer III (Gen AI & Multi-Agentic Systems) | Hospitality | 9 | Agent orchestration · Fine-tuning · RAG · Vector DB · Multimodal · Model serving · LLM observability · Evals · Guardrails · RL post-training · Code gen |
| Expedia | Senior Machine Learning Engineer (Gen AI & Multi-Agentic Systems) | Hospitality | 9 | Agent orchestration · RAG · Vector DB · Fine-tuning · RL post-training · Model serving · Multimodal · Vision · Audio & speech · Code gen · Evals · Guardrails · LLM observability |
| NVIDIA | Tech Engagement Lead - Model Builder | Semiconductors | 9 | Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Model serving · Quantization |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Multimodal · Embodied AI · Fine-tuning · RL post-training · Model serving · Quantization |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Model serving · Quantization |
| Intel | Senior AI Software Architect - Runtime | Semiconductors | 9 | Model serving |
| NVIDIA | Senior Software Engineer, AI Inference Systems | Semiconductors | 9 | Model serving |
| NVIDIA | Senior Solutions Architect - Generative AI | Semiconductors | 9 | Fine-tuning · RAG · Agent orchestration · Model serving |
| NVIDIA | Senior Software Engineer, Agentic AI | Semiconductors | 9 | Agent orchestration · Evals · Model serving · Code gen |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer - Montreal | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration · Vector DB · Model serving |
| Mistral AI | Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - EMEA | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration · Model serving |
| Mistral AI | Applied AI Engineer, Prototyping | AI Frontier | 9 | Agent orchestration · RAG · Model serving |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer, Critical and Sovereign Institutions, EMEA | AI Frontier | 9 | Agent orchestration · Fine-tuning · RAG · Model serving |
| OpenAI | Software Engineer, Inference - Performance Optimization | AI Frontier | 9 | Model serving |
| NVIDIA | Senior Deep Learning Software Engineer | Semiconductors | 9 | Model serving · Fine-tuning |
| NVIDIA | LLM Reinforcement Learning Framework Engineer | Semiconductors | 9 | RL post-training · Agent research · Agent orchestration · Fine-tuning · Model serving |
| NVIDIA | Senior Applied AI Researcher, Digital Biology | Semiconductors | 9 | Agent orchestration · Tool use · Multimodal · LLM observability · Fine-tuning · Model serving · Frontier research · Interpretability · Code gen |
| OpenAI | Manager, Forward Deployed Engineering - Munich | AI Frontier | 9 | Model serving |
| Anthropic | Research Engineer, RL Infrastructure (Knowledge Work) | AI Frontier | 9 | Evals · LLM observability · Model serving · RL post-training · Agent orchestration |