5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Software Engineer, Kernels and Performance, Core ML Frameworks | Big Tech | 8 | Inference infra | |
| Amazon | Applied Scientist, Amazon Prime, Prime AI/ML Science | Big Tech | 8 | Recommender systems |
| Nordstrom | Principal Product Manager - Inventory Intelligence (Hybrid - Seattle) | Retail | 8 | Agent orchestration |
| NVIDIA | SOC AI Application Engineer — AI Services, Agents and Knowledge Systems | Semiconductors | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Code gen |
| NVIDIA | Senior Architect - Server Performance | Semiconductors | 8 | Inference infra |
| Workday | Senior/Principal Machine Learning Engineer | Enterprise | 8 | Agent orchestration · RAG · LLM observability · Evals |
| Workday | Machine Learning Engineer III / Senior Machine Learning Engineer - AI Platform | Enterprise | 8 | Agent orchestration · Tool use · RAG · LLM observability · Evals |
| F5 | Principal AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · RAG · Agent research · LLM observability · Guardrails · Inference infra |
| F5 | Principal AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · RAG · Agent research · LLM observability · Guardrails |
| F5 | Principle AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · RAG · Agent research · LLM observability · Guardrails · Vector DB · Inference infra |
| F5 | AI Inference Engineer | Enterprise | 8 | Inference infra · LLM observability |
| NVIDIA | Solutions Architect, Inference Deployments | Semiconductors | 8 | Inference infra |
| NVIDIA | Solutions Architect, Agentic AI | Semiconductors | 8 | Agent orchestration · Agent research · Fine-tuning · Inference infra · Evals · Guardrails · Multimodal · Code gen |
| NVIDIA | Senior Solutions Architect, Generative AI | Semiconductors | 8 | Inference infra · Recommender systems |
| Snorkel AI | Senior Software Engineer - AI / ML | Data AI | 8 | Synthetic data · Agent orchestration · RL post-training · LLM observability · Evals |
| JPMorgan Chase | Data Scientist Lead - Vice President | Banking | 8 | RAG · Agent orchestration · Fine-tuning · LLM observability |
| Senior Software Engineer, Machine Learning, Vertex AI | Big Tech | 8 | Fine-tuning · Multimodal · Vision | |
| Freshworks | Lead - Data Scientist | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Fine-tuning |
| JPMorgan Chase | Applied AI & ML Lead – Markets Operations | Banking | 8 | Agent orchestration · RAG |
| AI Engineer, Google Cloud Consulting (English, French) | Big Tech | 8 | RAG · Vector DB · Fine-tuning | |
| Cribl | Staff AI Platform Engineer, Corporate AI Systems | Enterprise | 8 | Agent orchestration · Inference infra · Guardrails · LLM observability |
| Databricks | Senior Specialist Solutions Architect - AI & ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Evals · LLM observability · Inference infra |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Inference infra |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Inference infra |
| Unity | Principal Machine Learning Engineer, Mobile AI Inference Optimization | Enterprise | 8 | Inference infra · Quantization · Multimodal |
| Sierra | Product Manager, Voice | AI Frontier | 8 | Audio & speech · Inference infra · LLM observability |
| Microsoft | Principal Software Engineer | Big Tech | 8 | Inference infra |
| Microsoft | Principal Product Manager - Foundry Inferencing & Training (CoreAI - multiple roles) | Big Tech | 8 | Inference infra · Training infra |
| Amazon | Applied Scientist, AGI Customization Services | Big Tech | 8 | Fine-tuning · RL post-training · Evals |
| Amazon | Applied Scientist, Mobile Manipulation Robotics (I/O) | Big Tech | 8 | Embodied AI · Fine-tuning · Evals |