5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Suki AI | Senior Manager (AI Engineering) | Vertical AI | 8 | LLM observability · RAG · Agent orchestration · Inference infra |
| Roblox | Senior Machine Learning Engineer, Economy | Consumer | 8 | Recommender systems · Search & ranking · Fine-tuning · Inference infra · Evals · Guardrails · Vision |
| Cresta | Senior Forward Deployed Engineer (AI Agent) | Vertical AI | 8 | Agent orchestration · Tool use · RAG · LLM observability |
| Cloudflare | Senior Machine Learning Engineer | Enterprise | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Fine-tuning · Guardrails · LLM observability · Vector DB · Agent orchestration |
| Capital One | Sr. Lead AI Engineer | Banking | 8 | Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Evals · Agent orchestration |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · Evals · LLM observability · Agent orchestration |
| Capital One | Senior Lead AI Engineer (LLM Customization and Finetuning) | Banking | 8 | Fine-tuning · Inference infra · Guardrails · Vector DB · LLM observability · Evals |
| Workday | Machine Learning Engineer - Evisort | Enterprise | 8 | RAG |
| Databricks | Staff Machine Learning Engineer | Data AI | 8 | Fine-tuning · RAG · Evals |
| Anthropic | Staff Software Engineer, Inference | AI Frontier | 8 | Inference infra · LLM observability |
| Amazon | Senior Software Development Engineer, GenAI, Ads Agentic Intelligence | Big Tech | 8 | Agent orchestration · LLM observability |
| DocuSign | Senior Machine Learning Engineer | Enterprise | 8 | Agent orchestration · Agent research · RL post-training · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |
| Snowflake | Principal Machine Learning Engineer- Search Quality | Data AI | 8 | Search & ranking · Recommender systems · RAG · Vector DB · Agent orchestration · Tool use · Evals · LLM observability · Inference infra |
| Meta | Research Engineer, Monetization AI | Big Tech | 8 | Recommender systems · Fine-tuning |
| Amazon | Software Development Engineer (ML), AGI Customization, AGI Customization | Big Tech | 8 | Fine-tuning · Multimodal · Inference infra |
| Amazon | Machine Learning Engineer II , AGI Customization | Big Tech | 8 | Fine-tuning · Evals · Multimodal |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Fine-tuning |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · RAG · Guardrails · Evals · Fine-tuning · Inference infra |
| Cresta | Senior Machine Learning Engineer - Automatic Speech Recognition (ASR) | Vertical AI | 8 | Audio & speech · Evals · Fine-tuning |
| Senior Software Engineer, AI/ML GenAI, Google Cloud Compute | Big Tech | 8 | Multimodal · Vision · Inference infra | |
| NVIDIA | Senior HPC and AI Networking Performance Research and Analysis Engineer | Semiconductors | 8 | Pretraining · Inference infra |
| Microsoft | Research Intern - Applied Sciences Group | Big Tech | 8 | Fine-tuning · Inference infra · Frontier research |
| Amazon | Applied Scientist II, Amazon Payment Products (L5) | Big Tech | 8 | Agent orchestration · LLM observability · Inference infra |
| Amazon | Applied Scientist, Delivery Foundation Model | Big Tech | 8 | Multimodal · Frontier research · Pretraining · Fine-tuning · Inference infra |
| Netflix | Tech Lead Manager, GenAI Sandbox & Tooling (AI Foundations) | Big Tech | 8 | Agent orchestration · Tool use · LLM observability |
| Autodesk | Software Architect | Enterprise | 8 | Agent orchestration · Agent research · Tool use · Inference infra · RAG · Vector DB · LLM observability · Guardrails |
| NVIDIA | Architect, AI Solutions Engineering | Semiconductors | 8 | Agent orchestration · RAG · Fine-tuning · Inference infra |