5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Staff + Sr. Software Engineer, Inference | AI Frontier | 8 | Inference infra |
| Cohere | Software Engineer, Internal Infrastructure (North America) | AI Frontier | 8 | Inference infra |
| Postman | Applied AI Scientist, Small Language Model and AI Training | Enterprise | 8 | Fine-tuning · Frontier research · Guardrails · Interpretability |
| Glean | Software Engineer, Agentic Runtime | Enterprise | 8 | Agent orchestration · Tool use · Inference infra · LLM observability · Guardrails |
| Databricks | Software Engineer - GenAI inference | Data AI | 8 | Inference infra · LLM observability |
| Walmart | (USA) Staff, Data Scientist | Retail | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability |
| Anthropic | Creative Technologist, Editorial | AI Frontier | 8 | LLM observability · Multimodal · Agent orchestration · Tool use |
| Roblox | Sr Machine Learning Engineer - Safety Experience | Consumer | 8 | Multimodal · Fine-tuning |
| Amazon | Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference | Big Tech | 8 | Inference infra |
| Cohere | Member of Technical Staff, MLE (UK/EU) | AI Frontier | 8 | Agent orchestration · RAG · Fine-tuning |
| Uber | Sr. Staff Engineer (Conversational/Voice AI) | Consumer | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Audio & speech · Multimodal |
| Peloton | Staff Enterprise AI Engineer | Consumer | 8 | Agent orchestration · RAG · Vector DB · Inference infra |
| NVIDIA | Software Engineer, LLM Inference | Semiconductors | 8 | Inference infra |
| Tenstorrent | Machine Learning Engineer, AI Models | Semiconductors | 8 | Inference infra · Fine-tuning · Vision · LLM observability |
| Roblox | Principal Machine Learning Engineer, Safety Experience | Consumer | 8 | Vision · Multimodal |
| NVIDIA | Compute Architecture Software Engineer | Semiconductors | 8 | Inference infra |
| Klaviyo | Lead AI Software Engineer | Enterprise | 8 | Agent orchestration |
| Deloitte | Agentic AI, AI & Data Manager | Consulting | 8 | Agent orchestration · Tool use · Agent research · RAG · Vector DB · LLM observability · Guardrails |
| Cerebras | Product Manager, Strategic Verticals | Semiconductors | 8 | Inference infra · Fine-tuning |
| Amazon | Software Development Engineer AI/ML, Inference Serving, AWS Neuron | Big Tech | 8 | Inference infra · LLM observability · Multimodal |
| NVIDIA | Software Engineer, cuDNN - Deep Learning | Semiconductors | 8 | Inference infra |
| Software Engineer III, AI/ML GenAI, Google Cloud AI | Big Tech | 8 | Inference infra · Fine-tuning · Evals · Multimodal · Vision · Audio & speech · Code gen | |
| JPMorgan Chase | Asset Management - AI Engineer - Associate/VP | Banking | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability |
| OpenAI | Forward Deployed Engineer - Tokyo | AI Frontier | 8 | LLM observability · Evals |
| Roblox | Senior Machine Learning - Avatar, Core AI | Consumer | 8 | Inference infra · Fine-tuning |
| LangChain | Fullstack Software Engineer, Applied AI | Data AI | 8 | Agent orchestration · RAG · Evals · LLM observability |
| Capital One | Principal Data Scientist, AI Foundations | Banking | 8 | Fine-tuning · LLM observability · RAG · Vector DB |
| Moveworks | Senior Software Engineer II, Agentic AI Systems | Enterprise | 8 | Agent orchestration · Tool use · LLM observability · Multimodal |
| Moveworks | Senior Software Engineer II, Agentic AI Systems | Enterprise | 8 | Agent orchestration · Tool use · LLM observability · Multimodal |
| Moveworks | Senior Machine Learning Engineer II - LLM | Enterprise | 8 | Inference infra · LLM observability |