5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Canva | Engineering Manager (BE) - AI Media Platform | Enterprise | 8 | Inference infra · Multimodal |
| Senior Software Engineer, AI/ML, Search Growth | Big Tech | 8 | Recommender systems · Search & ranking · Inference infra · Fine-tuning · LLM observability · Multimodal | |
| LangChain | Solutions Architect (Remote) | Data AI | 8 | Agent orchestration · Inference infra · RAG · Vector DB · Evals |
| Modal | Member of Technical Staff - ML Performance | Data AI | 8 | Inference infra |
| Amazon | Senior Manager, Science and BI Lead, WWOS Tech | Big Tech | 8 | |
| Amazon | Sr. Applied Scientist, Special Projects | Big Tech | 8 | Inference infra · Frontier research |
| Visa | Staff ML Scientist | Fintech | 8 | Fine-tuning · Evals |
| NVIDIA | Senior AI-Native Systems Software Engineer, TensorRT | Semiconductors | 8 | Agent orchestration · Agent research · Multimodal · Inference infra · Code gen · Vision · Audio & speech |
| Intel | Principal Engineer – Distributed AI Systems Architecture (Heterogeneous Compute) | Semiconductors | 8 | Inference infra |
| GEICO | Senior Staff Machine Learning Engineer | Insurance | 8 | Agent orchestration · Tool use · Fine-tuning |
| Visa | Senior Machine Learning Scientist | Fintech | 8 | Fine-tuning · Evals |
| NVIDIA | Senior Performance Engineer - LLM Inference Frameworks | Semiconductors | 8 | Inference infra · Quantization |
| Uber | Sr Software Engineer | Consumer | 8 | Recommender systems · Search & ranking · Inference infra |
| OpenAI | Performance Modeling Lead | AI Frontier | 8 | Inference infra |
| Opendoor | Applied Scientist - ML/Ai | Consumer | 8 | Multimodal · Recommender systems · Interpretability |
| Software Engineer III, AI/ML, Google Cloud | Big Tech | 8 | Inference infra · Multimodal · Vision | |
| Forward Deployed Architect, Generative AI, Google Cloud | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Inference infra · Evals · LLM observability · Fine-tuning · Multimodal | |
| OpenAI | AI Deployment Engineering Manager, Digital Natives | AI Frontier | 8 | |
| Staff Software Engineer, Games, Inception, DeepMind | Big Tech | 8 | Agent orchestration · Inference infra | |
| Senior Staff ML Engineer, Search & Recommendation | Consumer | 8 | Recommender systems · Search & ranking · Inference infra · RAG · LLM observability · Fine-tuning | |
| Software Engineer III, AI/ML GenAI, YouTube | Big Tech | 8 | Multimodal · Vision · Audio & speech · Inference infra | |
| NVIDIA | OEM Solutions Architect - AI Full Stack Public Sector | Semiconductors | 8 | Inference infra · Fine-tuning |
| Capital One | Principal Associate, Data Scientist - LLM Customization Team | Banking | 8 | Fine-tuning · RAG · Vector DB |
| Microsoft | Customer Experience Program Manager | Big Tech | 8 | Agent orchestration · Fine-tuning |
| VP, Ads Quality Engineering | Consumer | 8 | Recommender systems · Search & ranking | |
| Warner Bros Discovery | Sr. Staff, Data Science & Applied AI | Media | 8 | Agent orchestration · RAG · Evals · Guardrails · LLM observability |
| NVIDIA | AI Computing Development Engineer, TensorRT-LLM | Semiconductors | 8 | Inference infra · Fine-tuning |
| HeyGen | Forward Deployed Engineer, Strategic Accounts | Multimodal | 8 | Agent orchestration · Evals |
| Software Engineer III, AI/ML, Google Cloud Storage | Big Tech | 8 | Agent orchestration | |
| SoFi | Director, AI Platforms | Fintech | 8 | Inference infra · Agent orchestration · RAG · Evals · LLM observability · Guardrails |