5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| ByteDance | Tech Lead, Software Engineer - AI Agent Memory Infrastructure | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Multimodal · Inference infra · LLM observability |
| Amazon | Senior Applied Scientist, HST Health Evaluation | Big Tech | 8 | Fine-tuning |
| Amazon | Senior ML Engineer, Fauna | Big Tech | 8 | Inference infra · RL robotics · Embodied AI |
| Netflix | Software Engineer 5 – Model Runtime, AI Platform | Big Tech | 8 | RL post-training · Fine-tuning · Inference infra · Multimodal |
| Capital One | Distinguished AI Engineer | Banking | 8 | Guardrails · Evals · Vector DB |
| Walmart | Staff, Software Engineer | Retail | 8 | Agent orchestration · Agent research · Multimodal · RAG · Fine-tuning · LLM observability |
| Adobe | Senior Machine Learning Engineer | Enterprise | 8 | Fine-tuning · LLM observability · RAG · Recommender systems |
| Adobe | Sr. Applied Scientist | Enterprise | 8 | Fine-tuning · Multimodal · Vision |
| ByteDance | Senior Software Engineer - AI Agent Memory Infrastructure | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Inference infra · Multimodal · LLM observability |
| Microsoft | Senior Software Engineer, CoreAI Workload Engines | Big Tech | 8 | Inference infra · LLM observability · Guardrails |
| Microsoft | Principal Software Engineer, CoreAI Workload Engines | Big Tech | 8 | Inference infra · LLM observability · Guardrails |
| Senior Software Engineer, AI/ML, Google Meet | Big Tech | 8 | Vision · Multimodal · Fine-tuning · Evals | |
| Snowflake | Principal Software Engineer - AI Poland | Data AI | 8 | Inference infra |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Inference infra |
| Cursor | Engineering Manager, Model Routing & Inference | Coding AI | 8 | Inference infra · Agent orchestration |
| Netflix | Senior ML Engineer, GenAI - Games | Big Tech | 8 | Agent orchestration · Fine-tuning · Inference infra · Code gen · Multimodal |
| NVIDIA | Solutions Architect, Physical AI and Robotics | Semiconductors | 8 | Embodied AI · Synthetic data · Evals · Agent orchestration · Inference infra |
| NVIDIA | Senior Solutions Architect - KV Cache and AI Storage | Semiconductors | 8 | Inference infra |
| NVIDIA | Solutions Architect - Top AI Labs | Semiconductors | 8 | Inference infra |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |
| Visa | Machine Learning Engineer | Fintech | 8 | Agent orchestration · RAG · Inference infra · Guardrails |
| Walmart | Director, Data Science | Retail | 8 | Recommender systems · Search & ranking · Agent orchestration · RAG · Vector DB · LLM observability · Guardrails |
| Intel | Senior GenAI Software Architect | Semiconductors | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability |
| NVIDIA | Senior Systems Software Engineer, E-commerce AI Platform - GeForce NOW | Semiconductors | 8 | Agent orchestration · Tool use · RAG · LLM observability |
| Adobe | Applied Scientist 5.5 | Enterprise | 8 | Fine-tuning · Multimodal · Vision |
| Staff Software Engineering, YouTube ML Efficiency | Big Tech | 8 | Recommender systems · Inference infra · Fine-tuning · Evals | |
| Samsara | Senior Manager, Safety AI | Enterprise | 8 | Inference infra |
| Samsara | Senior Manager, Safety AI | Enterprise | 8 | Inference infra · Multimodal |
| Nuro | Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency | Robotics | 8 | Agent orchestration · Tool use · Inference infra · LLM observability |
| Mercury | Senior Software Engineer - AI Engineering | Fintech | 8 | Agent orchestration · RAG · Evals · Guardrails · LLM observability · Inference infra |