5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Software Engineer III, AI/ML GenAI, Google Cloud Data Management | Big Tech | 8 | Inference infra · Vision · Multimodal | |
| Sierra | Software Engineer, Agent (Arabic speaking) | AI Frontier | 8 | Agent orchestration · Evals · RAG · Agent research |
| Klaviyo | Engineering Manager, Customer Agent | Enterprise | 8 | Agent orchestration · LLM observability |
| Tenstorrent | ML Engineer, AI Models | Semiconductors | 8 | Inference infra · Fine-tuning · Vision · Recommender systems |
| Scale AI | Senior Forward Deployed Data Scientist/Engineer | Data AI | 8 | Evals · LLM observability |
| Amazon | Sr Manager, Applied Science, Alexa Connections | Big Tech | 8 | Fine-tuning |
| GEICO | Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Inference infra |
| Uber | Senior Machine Learning Engineer - Applied AI | Consumer | 8 | Inference infra · Fine-tuning |
| Anduril | Senior Machine Learning/MLOps Engineer | Defense | 8 | Inference infra · RAG · Vector DB · LLM observability · Vision |
| HeyGen | Software Engineer, AI Compute Infrastructure | Multimodal | 8 | Inference infra · LLM observability |
| Microsoft | Research Intern - AI/ML Numerics & Efficiency | Big Tech | 8 | Inference infra · Quantization |
| Amazon | Applied Scientist II, Amazon Smart Vehicles | Big Tech | 8 | LLM observability · Multimodal |
| Capital One | Senior Manager, Data Scientist - US Card (Generative AI Systems) | Banking | 8 | Vision · Multimodal · Fine-tuning · Evals |
| NVIDIA | Deep Learning Performance Architect | Semiconductors | 8 | Inference infra · Vision · Audio & speech |
| Decagon | Staff Software Engineer, Voice Agent | Vertical AI | 8 | Audio & speech · LLM observability · Inference infra |
| Amazon | Sr. AI Process Engineer, Seller Compliance | Big Tech | 8 | Inference infra |
| Microsoft | Member of Technical Staff, LLM Inference - MAI Superintelligence Team | Big Tech | 8 | Inference infra |
| Samsara | Staff Machine Learning Engineer - Edge AI | Enterprise | 8 | Inference infra · Multimodal |
| Cohere | Member of Technical Staff, Synthetic Data | AI Frontier | 8 | Synthetic data · Inference infra · LLM observability |
| Microsoft | Research Intern - AI Systems & Architecture | Big Tech | 8 | Inference infra |
| Snorkel AI | Applied AI Engineer - Federal (TS Required) | Data AI | 8 | Agent orchestration · RAG · Fine-tuning · Evals · Vector DB · Synthetic data |
| LangChain | Product Manager, LangSmith | Data AI | 8 | Evals · LLM observability · Agent orchestration |
| Cerebras | Performance & Reliability Engineer | Semiconductors | 8 | Inference infra |
| Datadog | Senior AI Engineer - APM Experiences | Enterprise | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG |
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Agent orchestration · Evals · RL post-training · Fine-tuning · LLM observability |
| Amazon | Principal Applied Scientist, Sponsored Products and Brands | Big Tech | 8 | Fine-tuning · Recommender systems · Search & ranking · RAG · RLHF |
| Amazon | Applied Science Manager III, RBKS AI | Big Tech | 8 | Multimodal · Fine-tuning · Inference infra |
| Roblox | [2026] Senior Machine Learning Engineer, AI Platform - PhD Early Career | Consumer | 8 | Inference infra · Fine-tuning · RAG · Agent orchestration |
| Wix | Senior Server Engineer - AI Chatbot | Enterprise | 8 | LLM observability · RAG · Fine-tuning · Agent orchestration |
| Uber | Senior Machine Learning Engineer | Consumer | 8 | Inference infra |