5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Mistral AI | Applied AI, Technical Lead, Forward Deployed AI Engineer - EMEA | AI Frontier | 9 | Agent orchestration · Fine-tuning · RAG |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer - Montreal | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration · Vector DB · Inference infra |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer - EMEA | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration |
| Mistral AI | Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - EMEA | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration · Inference infra |
| Mistral AI | Applied AI Engineer, Prototyping | AI Frontier | 9 | Agent orchestration · RAG · Inference infra |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer, Critical and Sovereign Institutions, EMEA | AI Frontier | 9 | Agent orchestration · Fine-tuning · RAG · Inference infra |
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer- Singapore | AI Frontier | 9 | Fine-tuning · RAG · Agent orchestration · Vector DB · LLM observability |
| Capital One | Applied Researcher II (AI Foundations, LLM Core and Agentic AI) | Banking | 9 | Pretraining · Fine-tuning · RL post-training · Frontier research · Agent research · Agent orchestration · Vector DB · Recommender systems · Multimodal |
| OpenAI | Software Engineer, Inference - Performance Optimization | AI Frontier | 9 | Inference infra |
| NVIDIA | Senior Deep Learning Software Engineer | Semiconductors | 9 | Inference infra · Fine-tuning |
| NVIDIA | LLM Reinforcement Learning Framework Engineer | Semiconductors | 9 | RL post-training · Agent research · Agent orchestration · Fine-tuning · Inference infra |
| Research Engineer, Frontier Safety Mitigations, DeepMind | Big Tech | 9 | Agent orchestration · Evals · Guardrails · LLM observability · Agent research · Frontier research | |
| Power and Performance Architect, TPU | Big Tech | 9 | Inference infra | |
| Staff Software Engineer, Gemini App Personalization, DeepMind | Big Tech | 9 | Agent orchestration · Tool use · Evals · LLM observability · RAG · Fine-tuning · Recommender systems · Multimodal | |
| Amazon | Senior Applied Scientist | Big Tech | 9 | Multimodal · Embodied AI · Inference infra |
| NVIDIA | Senior Applied AI Researcher, Digital Biology | Semiconductors | 9 | Agent orchestration · Tool use · Multimodal · LLM observability · Fine-tuning · Inference infra · Frontier research · Interpretability · Code gen |
| Workday | Machine Learning Engineer III / Senior Machine Learning Engineer - AI Platform | Enterprise | 9 | Agent orchestration · RAG · Vector DB · Fine-tuning · Evals · LLM observability · Recommender systems · Search & ranking |
| Software Engineer, AI System Hacker, GenAI, DeepMind | Big Tech | 9 | Agent orchestration · Evals · Embodied AI · Code gen | |
| Forward Deployed Engineer, Generative AI (GenMedia), Google Cloud | Big Tech | 9 | Agent orchestration · RAG · Vector DB · Evals · LLM observability · Tool use | |
| OpenAI | Manager, Forward Deployed Engineering - Munich | AI Frontier | 9 | Inference infra |
| OpenAI | Manager, Forward Deployed Engineering - London | AI Frontier | 9 | |
| Apple | Machine Learning Engineer — Camera & Photos, Creative Foundations | Big Tech | 9 | Vision · Multimodal · Fine-tuning · Frontier research · Interpretability |
| Decagon | Staff Research Engineer | Vertical AI | 9 | Agent orchestration · Fine-tuning · LLM observability · RAG · Evals |
| Anthropic | Research Engineer, RL Infrastructure (Knowledge Work) | AI Frontier | 9 | Evals · LLM observability · Inference infra · RL post-training · Agent orchestration |
| Decagon | Senior Research Engineer | Vertical AI | 9 | Agent orchestration · Fine-tuning · LLM observability · RAG · Evals |
| Apple | AIML - Applied Research Engineer, Machine Translation | Big Tech | 9 | Fine-tuning · RL post-training · Multimodal · Audio & speech |
| Databricks | Principal Research Scientist – Scaling | Data AI | 9 | Pretraining · Fine-tuning · Inference infra |
| Forward Deployed Engineer, Generative AI, Google Cloud | Big Tech | 9 | Agent orchestration · RAG · Fine-tuning | |
| Synthesia | Staff Research Engineer - Video Post Training | Multimodal | 9 | Fine-tuning |
| Amazon | Applied Scientist II, Alexa International | Big Tech | 9 | Frontier research · RL post-training · Multimodal · Audio & speech · Vision · LLM observability · Evals · Fine-tuning |