3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Snap | Working Student - Machine Learning | Consumer | 8 | Vision · Model serving · Quantization |
| NVIDIA | Senior Product Manager, AI Inference - Dynamo | Semiconductors | 8 | Model serving · Agent orchestration · Agent research · LLM observability |
| Anthropic | Anthropic Fellows Program — ML Systems & Performance | AI Frontier | 8 | Synthetic data · Model serving |
| NVIDIA | AI and FSI Developer Technology Engineer - New College Grad 2026 | Semiconductors | 8 | Model serving |
| Capital One | Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI) | Banking | 8 | Model serving · Guardrails · Vector DB · LLM observability |
| GEICO | Senior Staff Machine Learning Engineer, AI Agent Platform | Insurance | 8 | Agent orchestration · Agent research · Fine-tuning · Model serving · RAG · Guardrails · LLM observability · Evals · Tool use |
| NVIDIA | Senior Software Engineer, RAG and Agentic AI | Semiconductors | 8 | RAG · Agent orchestration · Tool use · Model serving · LLM observability · Evals |
| Apple | AI/ML Software Engineer - SES Gen AI Solutions, IS&T | Big Tech | 8 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Guardrails · Fine-tuning · Model serving · Multimodal |
| Staff Software Engineer, On-Device Machine Learning | Big Tech | 8 | Model serving · Fine-tuning · Evals · Audio & speech | |
| Microsoft | Principal Software Engineer | Big Tech | 8 | Model serving |
| ByteDance | Tech Lead, Software Engineer - AI Agent Memory Infrastructure | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Multimodal · Model serving · LLM observability |
| Amazon | Senior ML Engineer, Fauna | Big Tech | 8 | Model serving · RL robotics · Embodied AI |
| Netflix | Software Engineer 5 – Model Runtime, AI Platform | Big Tech | 8 | RL post-training · Fine-tuning · Model serving · Multimodal |
| ByteDance | Senior Software Engineer - AI Agent Memory Infrastructure | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Model serving · Multimodal · LLM observability |
| Microsoft | Senior Software Engineer, CoreAI Workload Engines | Big Tech | 8 | Model serving · LLM observability · Guardrails |
| Microsoft | Principal Software Engineer, CoreAI Workload Engines | Big Tech | 8 | Model serving · LLM observability · Guardrails |
| Snowflake | Principal Software Engineer - AI Poland | Data AI | 8 | Model serving |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Model serving |
| Cursor | Engineering Manager, Model Routing & Inference | Coding AI | 8 | Model serving · Agent orchestration |
| Netflix | Senior ML Engineer, GenAI - Games | Big Tech | 8 | Agent orchestration · Fine-tuning · Model serving · Code gen · Multimodal |
| NVIDIA | Solutions Architect, Physical AI and Robotics | Semiconductors | 8 | Embodied AI · Synthetic data · Evals · Agent orchestration · Model serving |
| NVIDIA | Senior Solutions Architect - KV Cache and AI Storage | Semiconductors | 8 | Model serving |
| NVIDIA | Solutions Architect - Top AI Labs | Semiconductors | 8 | Model serving |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Visa | Machine Learning Engineer | Fintech | 8 | Agent orchestration · RAG · Model serving · Guardrails |
| Staff Software Engineering, YouTube ML Efficiency | Big Tech | 8 | Recommender systems · Model serving · Fine-tuning · Evals | |
| Samsara | Senior Manager, Safety AI | Enterprise | 8 | Model serving |
| Samsara | Senior Manager, Safety AI | Enterprise | 8 | Model serving · Multimodal |
| Nuro | Senior Software Engineer – GenAI Infrastructure & Agent Systems for Engineering Efficiency | Robotics | 8 | Agent orchestration · Tool use · Model serving · LLM observability |
| Mercury | Senior Software Engineer - AI Engineering | Fintech | 8 | Agent orchestration · RAG · Evals · Guardrails · LLM observability · Model serving |