5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Glean | Founding Forward Deployed Engineer | Enterprise | 8 | Agent orchestration · Agent research · LLM observability |
| Cerebras | ML Systems Performance Engineer | Semiconductors | 8 | Inference infra |
| Datadog | Senior AI Engineer - Bits AI Security Analyst | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · RAG · LLM observability |
| OpenAI | Software Engineer, Codex Cloud | AI Frontier | 8 | Agent orchestration · Inference infra |
| Amazon | Software Development Engineer III, Annapurna Labs | Big Tech | 8 | Agent orchestration · Inference infra |
| Amazon | Principal Software Engineer, AI Domains, Alexa AI | Big Tech | 8 | Inference infra · Agent orchestration · Multimodal · Vision · LLM observability · Evals · Guardrails |
| NVIDIA | Senior Deep Learning Engineer - AI for Wireless Systems | Semiconductors | 8 | Inference infra · Fine-tuning · Evals |
| NVIDIA | Engineering Manager - AI for RAN and 6G Wireless Systems | Semiconductors | 8 | Inference infra · Fine-tuning · Evals |
| NVIDIA | System Software Engineer - Deep Learning | Semiconductors | 8 | Inference infra · Fine-tuning · Vision |
| Cribl | Staff Software Engineer, Cribl AI | Enterprise | 8 | Fine-tuning · Inference infra |
| Writer | Software engineer, generative AI | AI Frontier | 8 | Agent orchestration · Tool use · RAG · Vector DB · Inference infra |
| Amazon | Applied Scientist II - Gen AI & LLM, PXT | Big Tech | 8 | RAG · Fine-tuning · Evals · Agent orchestration |
| Silicon RTL Design Engineer, PhD, Early Career | Big Tech | 8 | Inference infra | |
| NVIDIA | Senior AI Infrastructure Software Engineer | Semiconductors | 8 | Agent orchestration · Inference infra · RAG · Vector DB · Fine-tuning |
| LangChain | Deployed Engineer (UK) | Data AI | 8 | Agent orchestration · Tool use · LLM observability · Guardrails |
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Code gen · RAG · Evals · Fine-tuning |
| JPMorgan Chase | AWM Risk Analytics Group – Data Scientist - Vice President | Banking | 8 | Fine-tuning · Inference infra · LLM observability · Evals |
| Writer | AI engineer | AI Frontier | 8 | Agent orchestration · LLM observability · Inference infra |
| Anthropic | Applied AI Engineer, Beneficial Deployments | AI Frontier | 8 | Agent orchestration · Evals · LLM observability |
| Moveworks | Senior Software Engineer II, Agentic AI Platform | Enterprise | 8 | Agent orchestration · LLM observability |
| JPMorgan Chase | Agentic Development - Vice President | Banking | 8 | Agent orchestration · Agent research · LLM observability · RAG · Inference infra · Tool use |
| OpenAI | Manager, Forward Deployed Engineering | AI Frontier | 8 | |
| Cohere | Staff Software Engineer, GPU Infrastructure (HPC) | AI Frontier | 8 | Inference infra |
| Cerebras | AI Models, Product Manager | Semiconductors | 8 | Inference infra · Agent orchestration · Quantization · Fine-tuning |
| Whatnot | Senior Engineering Manager, ML Platform | Consumer | 8 | Inference infra |
| JPMorgan Chase | Applied AI Engineer - Agentic Systems - Senior Associate | Banking | 8 | Agent orchestration · RAG · Vector DB · Tool use · Guardrails · LLM observability · Fine-tuning |
| Amazon | Sr Software Development Manager, Generative AI for AWS Neuron | Big Tech | 8 | Agent orchestration · Inference infra · Code gen |
| Walmart | (USA) Staff, Software Engineer | MLE | Retail | 8 | Multimodal · Vision · Fine-tuning · Inference infra |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Evals |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |