5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Senior High-Performance System Architect | Semiconductors | 8 | Inference infra |
| Walmart | Distinguished, Data Scientist | Retail | 8 | Search & ranking · Recommender systems · Agent orchestration · Tool use · LLM observability · RAG · Inference infra |
| Cresta | Forward Deployed Engineering Manager | Vertical AI | 8 | Agent orchestration |
| Axon | Staff AI Embedded Software Engineer - Connected Devices | Enterprise | 8 | Fine-tuning · Multimodal |
| Cresta | Senior Forward Deployed Engineer (AI Agent) - UK | Vertical AI | 8 | Agent orchestration · RAG · Tool use · LLM observability |
| Microsoft | Senior Researcher - GPU Performance | Big Tech | 8 | Inference infra |
| NVIDIA | Manager, Deep Learning Algorithms | Semiconductors | 8 | Inference infra |
| Roblox | Principal Machine Learning Engineer, Alt Defense | Consumer | 8 | Agent orchestration · Inference infra |
| Instacart | Senior Machine Learning Engineer II, Search & Recommendations Ranking | Consumer | 8 | Recommender systems · Search & ranking |
| Weights & Biases | AI Solutions Engineer, Pre-Sales- W&B | Data AI | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Fine-tuning · Inference infra · LLM observability |
| Software Engineer, GKE, PhD, Early Careers | Big Tech | 8 | Agent orchestration · Agent research · Inference infra | |
| Capital One | Senior Lead AI Engineer (FM Hosting, LLM Inference) | Banking | 8 | Inference infra · LLM observability · Guardrails · Vector DB · Fine-tuning |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Inference infra · Guardrails · LLM observability |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Inference infra · Guardrails · LLM observability |
| Walmart | Senior Data Scientists, Conversational AI | Retail | 8 | Fine-tuning · Evals · Multimodal |
| Walmart | Staff Data Scientists, Conversational AI | Retail | 8 | Agent orchestration · Fine-tuning · Multimodal · LLM observability |
| Walmart | Expert Data Scientists, Conversational AI | Retail | 8 | Agent orchestration · Evals · LLM observability |
| OpenAI | Software Engineer, Platform Systems | AI Frontier | 8 | LLM observability · Inference infra |
| Cresta | Senior Machine Learning Engineer Automatic Speech Recognition (ASR) | Vertical AI | 8 | Audio & speech · Evals · Fine-tuning |
| Capital One | Lead AI Engineer (FM Hosting, LLM Inference) | Banking | 8 | Inference infra · LLM observability · Guardrails · Vector DB |
| Decagon | Product Manager, Voice Agent | Vertical AI | 8 | Agent orchestration · Audio & speech · LLM observability |
| Apptronik | Senior Autonomy Software Engineer | Robotics | 8 | Embodied AI · Agent orchestration · Multimodal · Inference infra · Guardrails |
| Anthropic | Forward Deployed Engineer, Federal Civilian | AI Frontier | 8 | Agent orchestration · Tool use · Evals |
| Bank of America | Artificial Intelligence Senior Security Engineer | Banking | 8 | Agent orchestration · LLM observability · Fine-tuning · Evals · Guardrails |
| Microsoft | Applied Scientist - Core AI Speech | Big Tech | 8 | Audio & speech · Multimodal · Fine-tuning · Evals |
| Moveworks | Software Engineer, Agentic AI Systems | Enterprise | 8 | Agent orchestration · LLM observability · Multimodal |
| Moveworks | Staff Software Engineer, Agentic AI Systems | Enterprise | 8 | Agent orchestration · Tool use · LLM observability · Multimodal |
| Moveworks | Staff Software Engineer, Agentic AI Systems | Enterprise | 8 | Agent orchestration · LLM observability |
| Moveworks | Senior Software Engineer I, Agentic AI Product | Enterprise | 8 | Agent orchestration · Inference infra · LLM observability |
| Glean | Founding Forward Deployed Engineer | Enterprise | 8 | Agent orchestration · Agent research · LLM observability |