5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Uber | Manager II, Technical Program Management, GenAI | Consumer | 8 | Evals · Agent orchestration · Fine-tuning · RL post-training |
| Microsoft | Principal Architect | Big Tech | 8 | Agent orchestration · LLM observability · Guardrails |
| Snowflake | Staff Applied AI Engineer | Data AI | 8 | Agent orchestration · LLM observability · RAG · Evals · Guardrails |
| Apple | Sr. Machine Learning Engineer, Siri Speech | Big Tech | 8 | Fine-tuning · Inference infra · Audio & speech |
| Toast | Staff Software Engineer, AI Foundations | Enterprise | 8 | Agent orchestration · Tool use · LLM observability |
| Amazon | Applied Scientist II, Sponsored Products and Brands-Agent | Big Tech | 8 | Agent orchestration · LLM observability · Tool use · Fine-tuning |
| Amazon | Software Development Engineer, Sponsored Products and Brands | Big Tech | 8 | Agent orchestration · Inference infra · Guardrails |
| Amazon | Software Development Engineer - AI/ML, Amazon Neuron, Multimodal Inference | Big Tech | 8 | Inference infra |
| Amazon | Software Development Engineer, ML Systems, Annapurna Labs | Big Tech | 8 | Agent orchestration · Inference infra |
| NVIDIA | Machine Learning Intern - AI Agents Conversational AI | Semiconductors | 8 | Agent orchestration · RAG · Vector DB · Audio & speech · LLM observability · Inference infra |
| NVIDIA | Machine Learning Intern - 2026 | Semiconductors | 8 | Inference infra |
| Adobe | Machine Learning Engineer 5 | Enterprise | 8 | Inference infra · Fine-tuning |
| Adobe | Senior Applied Scientist | Enterprise | 8 | Fine-tuning · RL post-training · Reward modeling · Multimodal · Vision |
| Expedia | Principal Software Development Engineer - Gen AI | Hospitality | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning |
| Expedia | Senior Machine Learning Engineer | Hospitality | 8 | Inference infra · RAG · Agent orchestration · LLM observability · Guardrails |
| NVIDIA | Senior Performance Compiler Engineer - Triton | Semiconductors | 8 | Inference infra |
| NVIDIA | Senior Systems Engineer, Neural Graphics | Semiconductors | 8 | Inference infra · Vision · Multimodal · Agent orchestration |
| NVIDIA | Senior Data and AI Solutions Engineer | Semiconductors | 8 | Agent orchestration · Tool use · Evals · RAG · Inference infra |
| Capital One | Senior Distinguished Engineer, AI Compute (Remote Eligible) | Banking | 8 | Inference infra · Pretraining · Fine-tuning · Agent orchestration |
| Capital One | Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) | Banking | 8 | Agent orchestration · Inference infra · Guardrails · Vector DB |
| Zendesk | Staff Security Engineer | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · Inference infra |
| BCG | Global IT GenAI Software Engineer Director - AI & Innovation | Consulting | 8 | Agent orchestration · Multi-agent · RAG · Vector DB · Fine-tuning · LLM observability · Guardrails |
| Autodesk | Principal Developer, AI/ML | Enterprise | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability |
| Ford | Full Stack Software Engineer, AI Integration | Auto | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Evals |
| Senior Staff Software Engineer, AI/ML, Google Workspace | Big Tech | 8 | Inference infra · Fine-tuning · Audio & speech | |
| Software Engineer III, Google Home Video Intelligence | Big Tech | 8 | Vision · Fine-tuning · Inference infra | |
| Skydio | Autonomy Engineer - ML & DL Infrastructure | Defense | 8 | Training infra |
| Microsoft | Principal Software Engineer - Performance | Big Tech | 8 | Inference infra · LLM observability |
| Robinhood | Senior Engineering Manager, Agentic AI | Fintech | 8 | Agent orchestration · Tool use · Evals · LLM observability |
| JPMorgan Chase | Senior Lead Security Engineer, AI | Banking | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Guardrails · Evals · LLM observability |