5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Salesforce | Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack | Enterprise | 8 | Inference infra · Training infra · LLM observability |
| NVIDIA | Senior Solutions Architect, Generative AI | Semiconductors | 8 | Inference infra · Recommender systems |
| Adobe | Senior Data Science Engineer, GenAI Platforms & Data Infrastructure | Enterprise | 8 | Agent orchestration · RAG · Tool use · LLM observability |
| Adobe | Machine Learning Engineer | Enterprise | 8 | Agent orchestration · Tool use · Evals · RAG |
| Adobe | Applied Scientist 4 | Enterprise | 8 | Fine-tuning · Multimodal |
| Snowflake | Staff Software Engineer, Cortex AI Infrastructure | Data AI | 8 | Agent orchestration · RAG · Vector DB · Evals · Guardrails · LLM observability · Inference infra |
| Snowflake | Sr. Enterprise Data & AI Architect | Data AI | 8 | Agent orchestration · RAG · LLM observability |
| Microsoft | Member of Technical Staff, Applied AI Engineer | Big Tech | 8 | Agent orchestration · RAG · Evals · LLM observability · Multimodal · Recommender systems · Search & ranking · Tool use |
| Sr. Staff Machine Learning Engineer, Content Quality | Consumer | 8 | LLM observability · Vision · Recommender systems | |
| Mistral AI | Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - Morocco | AI Frontier | 8 | Fine-tuning · RAG · Vector DB · Agent orchestration · LLM observability · Inference infra |
| Duolingo | Staff AI Research Engineer | Consumer | 8 | Recommender systems · Fine-tuning · LLM observability |
| JPMorgan Chase | Applied AI ML Director | Banking | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Evals |
| JPMorgan Chase | Applied AI Lead | Banking | 8 | Agent orchestration · LLM observability · Fine-tuning |
| Honeywell | Sr Advanced AI Engr | Industrial | 8 | Agent orchestration · Tool use · Multimodal · Fine-tuning · LLM observability |
| Software Engineer III, Machine Learning, Research and Products | Big Tech | 8 | Agent orchestration · RAG · LLM observability · Fine-tuning · Audio & speech | |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Agent orchestration · Inference infra |
| Forward Deployed Engineer II, Generative AI, Google Cloud | Big Tech | 8 | Agent orchestration · Multi-agent · RAG · Vector DB · Evals · LLM observability | |
| Software Engineering Manager, Cloud ML Compute Services (Mandarin, English) | Big Tech | 8 | Inference infra · Fine-tuning · Evals | |
| Disney | Staff GenAI/ML Engineer (Emerging Tech & AI Automation) Project Hire | Media | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability · Evals |
| NVIDIA | Principal Cloud Services Software Engineer | Semiconductors | 8 | Inference infra |
| NVIDIA | Principal AI and ML Infra Software Engineer, GPU Clusters | Semiconductors | 8 | Inference infra |
| Comcast | Machine Learning Engineer 4 | Media | 8 | Agent orchestration · LLM observability · Evals · Guardrails · Inference infra |
| Salesforce | Software Engineering PMTS | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra |
| Walmart | Senior, Data Scientist | Retail | 8 | Multimodal · RAG · Fine-tuning |
| Amazon | Manager, Applied Science, Alexa AI | Big Tech | 8 | Agent orchestration · LLM observability · Evals |
| Sigma Computing | Staff AI/ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Inference infra · Fine-tuning · Multimodal |
| Expedia | Senior Software Development Engineer (GenAI, Agentic AI) | Hospitality | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Inference infra · LLM observability · Guardrails |
| NVIDIA | Compiler Engineer - AI Inference | Semiconductors | 8 | Inference infra |
| Apple | Sr. Machine Learning Research Engineer, Siri Speech | Big Tech | 8 | Audio & speech · Fine-tuning |
| ByteDance | Software Engineer - AI Agent Memory Infrastructure | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Multimodal · LLM observability |