5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Staff Software Engineer, AI Reliability Engineering | AI Frontier | 8 | Inference infra · LLM observability |
| Netflix | Research Engineer 4/5 – AI for Member Systems | Big Tech | 8 | Recommender systems · Fine-tuning · Evals · LLM observability |
| Scale AI | Applied AI Engineer, Enterprise GenAI | Data AI | 8 | Agent orchestration · Multimodal · Tool use |
| Glean | Software Engineer, AI Infrastructure | Enterprise | 8 | Inference infra · Agent orchestration · LLM observability |
| Cohere | Member of Technical Staff, MLE (Korea) | AI Frontier | 8 | RAG · Fine-tuning · Inference infra |
| Cohere | Senior Member of Technical Staff, MLE (Middle East) | AI Frontier | 8 | Agent orchestration · RAG · Inference infra |
| Black Forest Labs | Member of Technical Staff - ML Infrastructure Engineer | Multimodal | 8 | Inference infra |
| Baseten | Engineering Manager - Model Performance | Data AI | 8 | Inference infra |
| Cresta | Machine Learning Engineering Intern | Vertical AI | 8 | RAG · Fine-tuning |
| Moveworks | Senior Software Engineer II, Agentic AI Platform | Enterprise | 8 | Agent orchestration · LLM observability |
| Tenstorrent | Sr. Engineer, Software - AI Compiler | Semiconductors | 8 | Inference infra |
| Together AI | Machine Learning Engineer - Inference | Data AI | 8 | Inference infra |
| Anthropic | Software Engineer | AI Frontier | 8 | Inference infra |
| Jane Street | Machine Learning Performance Engineer | Quant | 8 | Inference infra |
| Jane Street | Machine Learning Performance Engineer | Quant | 8 | Inference infra |
| Joby Aviation | Senior AI Engineer | Robotics | 8 | Agent orchestration · Inference infra · RAG · Vector DB · LLM observability · Guardrails · Evals |
| Palantir | Forward Deployed AI Engineer | Enterprise | 8 | Agent orchestration · Fine-tuning · Evals · RAG |
| Scale AI | Applied AI Engineer, Global Public Sector | Data AI | 8 | Agent orchestration · Fine-tuning · Evals |
| Anthropic | Performance Engineer | AI Frontier | 8 | Inference infra |
| Baseten | Software Engineer - Model Performance | Data AI | 8 | Inference infra · Fine-tuning · Quantization |
| Palantir | Forward Deployed AI Engineer | Enterprise | 8 | Agent orchestration · Fine-tuning · Evals · RAG |
| Databricks | Sr. Machine Learning Engineer | Data AI | 8 | Agent orchestration · Fine-tuning · RAG |
| ByteDance | Tech Lead Manager, Large Language Models & Generative AI | Big Tech | 8 | Recommender systems · LLM observability · RAG |
| HeyGen | Research Engineer | Multimodal | 8 | Vision · Multimodal |
| Databricks | Senior Machine Learning Engineer - GenAI Platform | Data AI | 8 | Inference infra |
| Meta | Software Engineer, Systems ML | Big Tech | 8 | Inference infra · Recommender systems · Search & ranking · Agent orchestration |
| Figure AI | Perception / Computer Vision Software Engineer - Helix Team | Robotics | 8 | Vision · Multimodal · Fine-tuning · Evals |
| Glean | Software Engineer, Machine Learning | Enterprise | 8 | Recommender systems · Search & ranking · Agent orchestration · Fine-tuning |
| Glean | Machine Learning Engineer, Search Quality | Enterprise | 8 | Recommender systems · Search & ranking · Fine-tuning · Agent orchestration · RAG |
| Tesla | Power Optimization Engineer, AI Hardware | Auto | 7 | Inference infra |