5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Intercom | Staff AI Product Manager | Enterprise | 8 | Agent orchestration · Evals |
| Microsoft | Research Intern - LLM Acceleration | Big Tech | 8 | Inference infra |
| Upstart | Staff+ Machine Learning Engineer | Fintech | 8 | Inference infra |
| Microsoft | Research Intern - Systems For Efficient AI | Big Tech | 8 | Inference infra |
| Amazon | Software Development Engineer, AI/ML, AWS Neuron, Model Inference | Big Tech | 8 | Inference infra · Fine-tuning |
| Amazon | Sr. Manager, Applied Science, Sponsored Products and Brands | Big Tech | 8 | Multimodal |
| PitchBook | Machine Learning Engineer | Fintech | 8 | LLM observability · RAG · Fine-tuning |
| Sierra | Software Engineer, Voice | AI Frontier | 8 | Audio & speech · Inference infra · Agent orchestration |
| Anduril | Senior Machine Learning Engineer, Sentry Tower | Defense | 8 | Inference infra |
| Amazon | AI Platform Data Engineer, Ring Decisions Sciences Platform | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning |
| NVIDIA | AI Developer Technology Engineer | Semiconductors | 8 | Inference infra |
| OpenAI | Forward Deployed Engineer - London | AI Frontier | 8 | Inference infra · Agent orchestration · LLM observability · Evals |
| Anthropic | Forward Deployed Engineer, Applied AI | AI Frontier | 8 | Agent orchestration · Tool use · Evals · Fine-tuning |
| Software Engineering Manager II, AI/ML, YouTube | Big Tech | 8 | Inference infra · Audio & speech | |
| Amazon | Applied Scientist III, RBKS AI | Big Tech | 8 | Multimodal · Fine-tuning · Inference infra |
| Moveworks | Sr. MLE, GAI Search Platform - JB0070751 | Enterprise | 8 | Recommender systems · Search & ranking · RAG · Agent orchestration · LLM observability · Inference infra |
| Nuro | Technical Lead Manager, Autonomy Evaluation and Intelligence | Robotics | 8 | Evals · Agent research · Embodied AI · Agent orchestration |
| Cohere | Audio Inference Engineer, Model Efficiency | AI Frontier | 8 | Audio & speech · Inference infra · Fine-tuning |
| Stripe | Machine Learning Engineer, Supportability | Fintech | 8 | Agent orchestration · LLM observability · Evals |
| Fireworks AI | Solutions Architect | Data AI | 8 | Inference infra · Fine-tuning · RAG |
| Anthropic | Staff Software Engineer, Inference | AI Frontier | 8 | Inference infra · LLM observability |
| Tenstorrent | Performance Architect, AI HW | Semiconductors | 8 | Inference infra |
| Apple | Senior Machine Learning Engineer - AI, Search & Knowledge (ML Hub Core) | Big Tech | 8 | Inference infra · RAG · Vector DB · LLM observability |
| Walmart | (USA) Staff, Software Engineer - MLE- Agentic AI & AIOps | Retail | 8 | Agent orchestration · LLM observability · Inference infra |
| Apple | AIML - Sr Backend Engineer, Data and ML Innovation | Big Tech | 8 | Vector DB · Fine-tuning |
| Fireworks AI | Software Engineer, AI Infrastructure | Data AI | 8 | Inference infra |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Guardrails · Vector DB · RAG · Evals · LLM observability · Fine-tuning · Agent orchestration |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Fine-tuning · Guardrails · Vector DB · RAG · Agent orchestration · LLM observability · Evals |
| NVIDIA | Director, Engineering – Software Engineering and AI Inferencing Platforms | Semiconductors | 8 | Inference infra |
| Amazon | Applied Scientist | Big Tech | 8 | Inference infra |