5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Software Engineer III, Tensor Processing Units, AI/ML | Big Tech | 8 | Inference infra · Fine-tuning · Vision · Audio & speech · Recommender systems | |
| xAI | Backend Engineer - API | AI Frontier | 8 | Inference infra · LLM observability · Agent orchestration |
| Senior Staff Machine Learning Engineer, ML Understanding | Consumer | 8 | Recommender systems | |
| Celonis | Applied Engineer (Solution Consultant) - Supply Chain | Data AI | 8 | Agent orchestration · Tool use · Guardrails · RAG · LLM observability · Fine-tuning |
| JPMorgan Chase | Director - Applied AI ML (Software Engineering/Data & Agentic Systems) | Banking | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · LLM observability |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Inference infra · RAG |
| Microsoft | Senior Researcher - Efficient AI | Big Tech | 8 | Inference infra · Quantization |
| Cloudflare | Software Engineer, AI Agents | Enterprise | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Agent research |
| Apple | Machine Learning Engineer | Big Tech | 8 | Agent orchestration · Tool use · LLM observability · Inference infra |
| Amazon | Machine Learning Engineer , Data & Machine Learning (DML) | Big Tech | 8 | Fine-tuning |
| NVIDIA | Machine Learning Intern - 2026 | Semiconductors | 8 | Inference infra |
| Johnson & Johnson | Principal AI Lead – Surgical AI | Pharma | 8 | Agent orchestration |
| NVIDIA | Senior Autonomous Driving Software Engineer, L4 Planning | Semiconductors | 8 | Embodied AI · Inference infra |
| Salesforce | Forward Deployed Engineer (Multiple Levels) <<based in South Korea>> | Enterprise | 8 | Agent orchestration · Agent research · LLM observability · RAG · Vector DB · Fine-tuning |
| Capital One | Senior Lead AI Engineer, AI Foundations | Banking | 8 | Inference infra · Fine-tuning · Guardrails · Vector DB · LLM observability · Evals |
| Capital One | Lead AI Engineer, AI Foundations | Banking | 8 | Inference infra · Fine-tuning · Guardrails · Vector DB · RAG · LLM observability |
| Capital One | Senior Lead AI Engineer (Gen AI Platform Services) | Banking | 8 | Inference infra · Fine-tuning · Guardrails · Vector DB · RAG · LLM observability · Evals |
| Snap | Machine Learning Engineer, CV | Consumer | 8 | Inference infra |
| NVIDIA | Deep Learning Architect, LLM Inference - New College Grad 2026 | Semiconductors | 8 | Inference infra · LLM observability · Agent orchestration · Tool use |
| NVIDIA | Senior Deep Learning Scientist, Speech Synthesis | Semiconductors | 8 | Audio & speech · Fine-tuning · Evals |
| Adobe | Senior Manager, Machine Learning | Enterprise | 8 | Agent orchestration |
| OpenAI | SOC Architect | AI Frontier | 8 | Inference infra |
| Whatnot | Software Engineer, Machine Learning Infrastructure | Consumer | 8 | Inference infra |
| Apple | Staff Machine Learning Engineer | Big Tech | 8 | Inference infra · RAG |
| Software Engineer III, AI/ML GenAI, Google Ads | Big Tech | 8 | Inference infra · Multimodal · Vision · Audio & speech · Code gen | |
| UiPath | Principal Forward Deployed Engineering Manager | Enterprise | 8 | Agent orchestration · Agent research · Evals · Fine-tuning · Inference infra · LLM observability |
| UiPath | Director, Forward Deployed Engineering | Enterprise | 8 | Agent orchestration · Evals |
| Software Engineer III, AI/ML, YouTube Shopping | Big Tech | 8 | Fine-tuning · Evals · Recommender systems | |
| Dropbox | Senior Machine Learning Engineer, Dash Agentic AI | Enterprise | 8 | Agent orchestration · Agent research · Tool use · RAG · LLM observability · Inference infra · Recommender systems · Search & ranking · Fine-tuning |
| Staff Software Engineer, AI/ML, Agent Assist | Big Tech | 8 | Agent orchestration · Tool use · Inference infra · Audio & speech · RL robotics |