5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Agent orchestration |
| Sierra | Software Engineer, Agent | AI Frontier | 8 | Agent orchestration · Evals · RAG · LLM observability |
| Moveworks | Principal Product Manager, Search Platform | Enterprise | 8 | Agent orchestration · Semantic search · Audio & speech · Inference infra |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Inference infra · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning |
| Sentry | Senior Software Engineer, AI | Enterprise | 8 | Agent orchestration · Inference infra |
| Cerebras | Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai | Semiconductors | 8 | Inference infra |
| Hightouch | Software Engineer, AI Agents | Data AI | 8 | Agent orchestration · Agent research · LLM observability · RAG · Vector DB · Fine-tuning |
| Amazon | Sr. Software Engineer- AI/ML, AWS Neuron Apps | Big Tech | 8 | Inference infra · Multimodal |
| OpenAI | Solutions Engineer- Startups | AI Frontier | 8 | Agent orchestration · RAG · Fine-tuning |
| Amazon | Sr. Applied Scientist, SSG Science | Big Tech | 8 | Fine-tuning · Inference infra · Quantization · Distillation |
| Amazon | Applied Scientist II, Strategic Account Services (SAS) | Big Tech | 8 | Evals |
| Capital One | Manager, Data Science - AI Foundations | Banking | 8 | Fine-tuning · LLM observability · RAG · Vector DB |
| Cohere | Senior/Staff Full-Stack Engineer | AI Frontier | 8 | Agent orchestration · RAG · Inference infra |
| Perplexity | Engineering Site Lead | AI Frontier | 8 | Inference infra |
| OpenAI | AI Deployment Engineer, Startups | AI Frontier | 8 | Agent orchestration |
| Dropbox | Senior Engineering Manager, Core Media & Intelligence | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Fine-tuning · Recommender systems · Search & ranking · Vision · Multimodal |
| Roblox | Senior AI Platform Engineer - Agentic Systems | Consumer | 8 | Agent orchestration · RAG · Vector DB |
| Nuro | Software Engineer, AI Platform - Intern | Robotics | 8 | Inference infra |
| Roblox | Principal Machine Learning Engineer, Communication Safety | Consumer | 8 | LLM observability · Inference infra · Multimodal · Data pipeline |
| Samsara | Senior Machine Learning Engineer - Edge AI | Enterprise | 8 | Multimodal · Inference infra · Quantization · Distillation |
| Klaviyo | Senior AI Engineer | Enterprise | 8 | Agent orchestration · Fine-tuning · Evals · Inference infra |
| NVIDIA | Senior System Software Architect, HPC and AI Networking | Semiconductors | 8 | Inference infra |
| Amazon | Software Engineer- AI/ML, AWS Neuron | Big Tech | 8 | Pretraining · Inference infra |
| ZoomInfo | Senior Product Manager, Context Engineering | Enterprise | 8 | RAG · Vector DB · Agent orchestration · Evals · LLM observability |
| HeyGen | Tech Lead, AI Compute Infrastructure | Multimodal | 8 | Inference infra · Multimodal |
| Uber | Engineering Manager II, Marketplace Pricing | Consumer | 8 | Recommender systems |
| Roblox | Distinguished Engineer, Machine Learning Systems – Economy | Consumer | 8 | Recommender systems · Search & ranking · Inference infra · RAG · Agent orchestration · LLM observability · Multimodal |
| Apptronik | Senior Perception Learning Engineer | Robotics | 8 | Vision · Inference infra · Multimodal |
| Baseten | Software Engineer - Model APIs | Data AI | 8 | Inference infra · Tool use · Multimodal |
| Scale AI | Tech Lead Manager- MLRE, ML Systems | Data AI | 8 | Fine-tuning · RL post-training · Inference infra |