5375 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Walmart | Distinguished, Software Engineer | Retail | 8 | Agent orchestration · Tool use · Inference infra · LLM observability · Guardrails |
| Deloitte | Manager - GenAI Full Stack Developer | Consulting | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning |
| SoFi | Principal Product Manager, AI SDLC | Fintech | 8 | Agent orchestration · Tool use · Evals · LLM observability · Fine-tuning |
| Sierra | Software Engineer, Agent - Healthcare | AI Frontier | 8 | Agent orchestration · LLM observability · Tool use |
| Meta | Creative Coder | Big Tech | 8 | Agent orchestration · Tool use · LLM observability · RAG · Fine-tuning · Audio & speech · Multimodal · Agent research |
| Stripe | Software Engineer, Machine Learning Infrastructure | Fintech | 8 | Inference infra · LLM observability |
| JPMorgan Chase | Asset Management - Quantitative Analyst - Artificial Intelligence & Machine Learning Focus | Banking | 8 | LLM observability · RAG · Agent orchestration · Fine-tuning · Recommender systems · Search & ranking · Evals |
| Amazon | Applied Scientist, Alexa Smart Properties | Big Tech | 8 | LLM observability |
| Amazon | Senior Software Development Engineer , Stores Foundational AI - Rufus | Big Tech | 8 | Fine-tuning · RL post-training · Reward modeling · Inference infra · Agent orchestration · Tool use · LLM observability |
| NVIDIA | Senior GPU System Architect | Semiconductors | 8 | Inference infra |
| Walmart | Staff, Data Scientist | Retail | 8 | Agent orchestration · Inference infra |
| Physical Intelligence | ML Infra Engineer (Supercomputing) | AI Frontier | 8 | Inference infra |
| OpenAI | Engineering Manager ChatGPT Infra | AI Frontier | 8 | Inference infra |
| NVIDIA | Senior System Software Engineer, Speech AI | Semiconductors | 8 | Audio & speech · Inference infra · Fine-tuning |
| NVIDIA | Senior System Software Engineer, Speech AI | Semiconductors | 8 | Audio & speech · Inference infra |
| xAI | Member of Technical Staff - Imagine Product | AI Frontier | 8 | Multimodal · Inference infra · Vision · Audio & speech |
| Agility Robotics | Senior Manager, AI Innovation | Robotics | 8 | Embodied AI · LLM observability · Inference infra · Synthetic data |
| Cloudflare | Senior Software Engineer (Security) | Enterprise | 8 | Agent orchestration · Agent research · LLM observability · Fine-tuning |
| Amazon | Applied Scientist, Selection Monitoring | Big Tech | 8 | Agent orchestration · Fine-tuning · RL post-training |
| Amazon | Sr. Applied Scientist, Amazon Robotics, Structured Field Coordinated Planning & Control | Big Tech | 8 | Multi-agent |
| NVIDIA | NIM Solutions Architect | Semiconductors | 8 | Inference infra · Fine-tuning · Agent orchestration · Multimodal |
| NVIDIA | Solution Architecture Intern, AI in Industry - 2026 | Semiconductors | 8 | Inference infra · Fine-tuning · Multimodal · Audio & speech · RL robotics |
| Premera Blue Cross | AI Engineer III | Insurance | 8 | Inference infra · RAG · Guardrails · LLM observability |
| Anthropic | Engineering Manager, Cloud Inference AWS | AI Frontier | 8 | Inference infra |
| Together AI | Engineering Manager, Model Serving | Data AI | 8 | Inference infra · Fine-tuning · LLM observability |
| Amazon | Senior AI Solution Architect | Big Tech | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Inference infra · Evals · LLM observability · Multimodal |
| Capital One | Lead AI Engineer | Banking | 8 | Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| JPMorgan Chase | Senior Associate -Applied AI Data Scientist | Banking | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · LLM observability · Guardrails |
| Staff Software Engineer, AI/ML GenAI, Google Workspace | Big Tech | 8 | Inference infra · Fine-tuning · Vision · Multimodal | |
| Senior Software Engineer, Cloud AI/ML Infrastructure | Big Tech | 8 | Inference infra |