5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Agent orchestration · LLM observability · Inference infra |
| JPMorgan Chase | Associate Applied AI & ML Scientist – Markets Operations | Banking | 8 | |
| Senior Software Engineer, AI/ML GenAI, Google Workspace | Big Tech | 8 | Multimodal · Vision · Inference infra | |
| Capital One | Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |
| Premera Blue Cross | AI Engineer IV | Insurance | 8 | Agent orchestration · RAG · Evals · Multi-agent |
| Microsoft | Senior AI Software Architect | Big Tech | 8 | Inference infra · Quantization · Fine-tuning |
| Datadog | Manager I, Engineering - AI Platform - Training & Serving | Enterprise | 8 | Inference infra |
| Amazon | Member of Technical Staff, AGI Autonomy | Big Tech | 8 | Agent orchestration · Agent research · RL robotics · Embodied AI |
| Amazon | Sr. Machine Learning Engineer, AWS Applied AI Solution | Big Tech | 8 | Agent orchestration · Inference infra · Fine-tuning |
| Capital One | Director, AI Engineering | Banking | 8 | Agent orchestration · Agent research · Inference infra |
| Cohere | Site Reliability Engineer, Inference Infrastructure | AI Frontier | 8 | Inference infra |
| Cohere | Staff Software Engineer, Inference Infrastructure | AI Frontier | 8 | Inference infra |
| JPMorgan Chase | Lead Machine Learning Engineer-MLOps | Banking | 8 | Inference infra · LLM observability · Vector DB · Recommender systems |
| Synthesia | Senior Research Engineer - Audio Post-Training | Multimodal | 8 | Audio & speech · Fine-tuning · RL post-training · Inference infra · Multimodal |
| Ramp | Applied AI Engineer | Fintech | 8 | Agent orchestration · RAG · Fine-tuning · Inference infra |
| Bill.com | Senior Machine Learning Engineer | Fintech | 8 | Agent orchestration · Agent research · LLM observability · RAG · Fine-tuning |
| Capital One | Principal Associate, Data Scientist - LLM Customization Team | Banking | 8 | Fine-tuning · RAG · Vector DB · LLM observability · Agent orchestration |
| NVIDIA | Distinguished Engineer, JAX | Semiconductors | 8 | Inference infra |
| NVIDIA | Senior Software Architect - Deep Learning and HPC Communications | Semiconductors | 8 | Inference infra |
| PitchBook | Sr. Machine Learning Engineer | Fintech | 8 | Semantic search |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Inference infra · Guardrails · LLM observability · RAG · Vector DB · Evals |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Inference infra · Guardrails · LLM observability · RAG · Vector DB · Evals |
| NVIDIA | Distinguished Engineer - Dynamo | Semiconductors | 8 | Inference infra |
| NVIDIA | Principal Software Engineer - Dynamo | Semiconductors | 8 | Inference infra · LLM observability · Agent orchestration |
| NVIDIA | Principal Software Engineer – Large-Scale LLM Memory and Storage Systems | Semiconductors | 8 | Inference infra |
| NVIDIA | Senior Software Engineer, Deep Learning - MLIR TRT | Semiconductors | 8 | Inference infra · Quantization |
| NVIDIA | Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK | Semiconductors | 8 | Multimodal · Inference infra |
| NVIDIA | Manager, Deep Learning Algorithms | Semiconductors | 8 | Inference infra |
| NVIDIA | Senior Software Architect - Deep Learning and HPC Communications | Semiconductors | 8 | Inference infra |
| NVIDIA | Senior Deep Learning Performance Architect | Semiconductors | 8 | Inference infra |