5242 AI roles tagged model_serving.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Principal Architect, AI Networking | Semiconductors | 9 | Inference infra |
| NVIDIA | Senior Software Engineer, RL Post-Training Frameworks | Semiconductors | 9 | RL post-training · Inference infra |
| Target | Lead Engineer - GenAI | Retail | 9 | Agent orchestration · Agent research · Tool use · LLM observability · RAG · Fine-tuning · Inference infra · Synthetic data |
| Capital One | Applied Researcher II | Banking | 9 | Fine-tuning · Frontier research · Pretraining · RL post-training · Vector DB |
| Walmart | Senior Data Scientist: Associate AI Experience | Retail | 9 | Agent orchestration · Tool use · Evals · RAG · Vector DB · Fine-tuning · LLM observability |
| Walmart | Principal Data Scientist: Associate AI experience | Retail | 9 | Agent orchestration · Tool use · Evals · RAG · Vector DB · Fine-tuning · Multi-agent |
| Walmart | Distinguished Data Scientist: Associate AI Experience | Retail | 9 | Agent orchestration · Tool use · Evals · RAG · Vector DB · Fine-tuning · Multi-agent |
| JPMorgan Chase | Generative AI Executive Director | Banking | 9 | Agent orchestration · Multimodal · Fine-tuning · Inference infra |
| NVIDIA | Manager, Deep Learning – Autonomous Vehicles and Robotics | Semiconductors | 9 | Inference infra · Vision · Multimodal · Agent orchestration |
| NVIDIA | Senior Deep Learning Algorithms Engineer - BioNeMo | Semiconductors | 9 | Inference infra · Quantization · Vision |
| Master's Fall Machine Learning Internship (ATG - Visual Search) | Consumer | 9 | Agent orchestration · Tool use · Inference infra · Multimodal · LLM observability | |
| Wayve | Tech Lead, ML Engineer - AV Product engineering | Robotics | 9 | Embodied AI · Multimodal · Vision · Fine-tuning · Evals |
| Senior Software Engineering Manager, Emergent AI Infrastructure | Big Tech | 9 | Inference infra | |
| Upstart | Principal Engineer, LLM | Fintech | 9 | Inference infra · RAG · Vector DB · Evals · LLM observability |
| Dropbox | Senior Machine Learning Engineer, Dash Agentic AI | Enterprise | 9 | Agent orchestration · Tool use · Evals · Guardrails · Fine-tuning · Inference infra · RAG · Agent research |
| Mistral AI | Open-Source Software, Machine Learning Engineer | AI Frontier | 9 | Inference infra · Fine-tuning |
| OpenAI | Machine Learning Engineer, API Multicloud | AI Frontier | 9 | Fine-tuning · RL post-training · Evals · Agent orchestration · Tool use · Audio & speech |
| NVIDIA | Senior AI Software Engineer, Kernel Libraries | Semiconductors | 9 | Inference infra · GPU kernels |
| NVIDIA | Senior Software Engineer, AI and DL Kernel Libraries | Semiconductors | 9 | Inference infra |
| NVIDIA | Senior AI Compiler Engineer, MLIR | Semiconductors | 9 | Inference infra |
| Adobe | Senior Machine Learning Engineer | Enterprise | 9 | Fine-tuning · Multimodal · Vision · Audio & speech · Inference infra |
| Adobe | Machine Learning Engineer - II | Enterprise | 9 | Fine-tuning · Multimodal · Inference infra |
| Skydio | Autonomy Engineer Intern - Deep Learning (Computational Photography) | Defense | 9 | Inference infra · Fine-tuning · Synthetic data · Multimodal |
| ServiceNow | Senior Machine Learning Engineer, Agentic Systems - Moveworks | Enterprise | 9 | Inference infra · Fine-tuning · LLM observability · Agent orchestration |
| ServiceNow | Engineering Manager, Agentic Systems - Moveworks | Enterprise | 9 | Inference infra · Fine-tuning · Evals |
| Forward Deployed Engineer, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Tool use · Evals · LLM observability · RAG · Fine-tuning | |
| NVIDIA | Solutions Architect, AI Models | Semiconductors | 9 | Multimodal · Audio & speech · Inference infra · Fine-tuning · RL post-training · Evals |
| NVIDIA | Senior Solutions Architect, Retail | Semiconductors | 9 | Agent orchestration · RAG · Inference infra · Tool use |
| NVIDIA | Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles | Semiconductors | 9 | Inference infra · Quantization · Distillation · Embodied AI · Multimodal |
| F5 | Principal Engineer – AI Specialist | Enterprise | 9 | Agent orchestration · Agent research · LLM observability · Inference infra · Multimodal |