5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Mistral AI | Applied AI, Forward Deployed Machine Learning Engineer - Palo Alto | AI Frontier | 9 | Fine-tuning · RAG · Vector DB · Agent orchestration |
| Writer | Security engineer, detection and response (UK) | AI Frontier | 9 | Inference infra · Guardrails · LLM observability |
| Forward Deployed Engineer, Generative AI, Google Cloud | Big Tech | 9 | Agent orchestration · RAG · Inference infra · Evals · LLM observability · Tool use | |
| JPMorgan Chase | Senior Lead Software Engineer- Java/Python/ AI Solutions | Banking | 9 | Agent orchestration · Tool use · LLM observability · RAG · Vector DB · Fine-tuning · Agent research |
| Mistral AI | Applied AI, Evaluation Engineer | AI Frontier | 9 | Evals · LLM observability · Agent research · Fine-tuning |
| Disney | Sr Staff R&D Engineer | Media | 9 | Audio & speech · Fine-tuning · Multimodal |
| NVIDIA | Senior Software Engineer - AI Inference | Semiconductors | 9 | Inference infra |
| NVIDIA | Senior Software Engineer, RAG and Agentic AI | Semiconductors | 9 | RAG · Agent orchestration · Tool use · Inference infra · Multimodal |
| Adobe | Senior Machine Learning Engineer - Firefly | Enterprise | 9 | Fine-tuning · RL post-training · Multimodal · Evals · LLM observability |
| LangChain | Solutions Architect (London) | Data AI | 9 | Agent orchestration · Inference infra · RAG · Vector DB · Evals |
| Airbnb | Senior Staff Machine Learning Engineer, Growth Platform Engineering | Consumer | 9 | Agent orchestration · Inference infra |
| Cohere | Manager of Technical Staff, Sovereign AI | AI Frontier | 9 | Frontier research · Pretraining |
| OpenAI | Software Engineer, Foundations Retrieval | AI Frontier | 9 | RAG · Vector DB · Agent orchestration · Inference infra · LLM observability |
| Meta | Software Engineer, AI Specialist - Monetization (Technical Leadership) | Big Tech | 9 | Inference infra · Recommender systems · Search & ranking · Frontier research |
| NVIDIA | Senior Solutions Architect, Autonomous Vehicles - Data Center | Semiconductors | 9 | Inference infra · Vision |
| NVIDIA | Solutions Architect, Model Builder - LATAM | Semiconductors | 9 | Agent orchestration · Tool use · Fine-tuning · RAG · Inference infra · LLM observability |
| Staff AI Research Engineer, Large User Models | Big Tech | 9 | Pretraining · Recommender systems · Frontier research | |
| Adobe | Staff Machine Learning Engineer/Architect– Agentic AI & Personalization | Enterprise | 9 | Agent orchestration · Recommender systems · Search & ranking · Inference infra · LLM observability |
| Physical Intelligence | Robotics Research Engineer | AI Frontier | 9 | Embodied AI · Vision · Multimodal · Synthetic data · RL robotics |
| Perplexity | Engineering Manager (AI Research & Model Training) | AI Frontier | 9 | Fine-tuning · RL post-training · Frontier research · Evals |
| Perplexity | Engineering Manager (AI Inference) | AI Frontier | 9 | Inference infra · LLM observability · Quantization |
| Perplexity | Member of Technical Staff (AI Infrastructure Engineer) | AI Frontier | 9 | Inference infra |
| Perplexity | Member of Technical Staff (AI Inference Engineer) | AI Frontier | 9 | Inference infra · LLM observability · Multimodal |
| Perplexity | Member of Technical Staff (AI Researcher) | AI Frontier | 9 | Fine-tuning · RL post-training · Frontier research · Agent research · Agent orchestration · LLM observability |
| Staff Forward Deployed Developer, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Tool use · Evals · LLM observability | |
| Staff Software Engineer, AI/ML GenAI, Google Cloud AI | Big Tech | 9 | Inference infra · Fine-tuning · Evals · Vision · Multimodal | |
| Forward Deployed Engineer IV, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Tool use · Inference infra · LLM observability · Guardrails | |
| Perplexity | Member of Technical Staff (AI Software Engineer, Agents) | AI Frontier | 9 | Agent orchestration · Tool use · Multimodal · RL post-training · Frontier research |
| Sierra | Software Engineer, Agent | AI Frontier | 9 | Agent orchestration · Agent research · RAG · Evals · Audio & speech |
| Forward Deployed Engineer IV, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Tool use · Evals · LLM observability |