3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Software Engineering Manager II, AI/ML GenAI, Google Cloud AI | Big Tech | 8 | Model serving · Fine-tuning · Multimodal · Vision | |
| Microsoft | Principal Software Engineering-CoreAI | Big Tech | 8 | Agent orchestration · Model serving |
| Staff Software Engineer, TPU, Performance | Big Tech | 8 | Model serving | |
| JPMorgan Chase | Applied AI ML Lead - Payments | Banking | 8 | Model serving · RAG · Agent orchestration · Tool use · Guardrails · LLM observability · Evals |
| Staff Software Engineer, Discover AI Transformation, GenAI Personalization | Big Tech | 8 | Agent orchestration · Fine-tuning · Model serving · Recommender systems · LLM observability · Evals · RAG | |
| Wayve | Staff Cloud SRE – AI/ML Platform & GPU Compute | Robotics | 8 | Model serving |
| NVIDIA | Senior Research Engineer, Robotics Systems | Semiconductors | 8 | Embodied AI · Model serving · Multimodal · RL robotics |
| NVIDIA | Senior Perception Engineer - Autonomous Vehicles | Semiconductors | 8 | Vision · Fine-tuning · Model serving · Evals |
| NVIDIA | Senior Software Engineer, Agentic AI | Semiconductors | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Model serving |
| Capital One | Lead AI/ML Engineer (Platform, kubeflow) | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Verizon | Senior Engineering Consultant-Cloud & AI | Telecom | 8 | Agent orchestration · RAG · Tool use · LLM observability · Model serving |
| Walmart | Senior, Software Engineer - AI Systems | Retail | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Model serving |
| Walmart | Software Engineer III– AI Systems | Retail | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Model serving |
| F5 | Senior Site Reliability Engineer, AI Inference | Enterprise | 8 | Model serving · LLM observability |
| JPMorgan Chase | Applied AI/ML Lead | Banking | 8 | Vision · Multimodal · Fine-tuning · Model serving · RAG · LLM observability |
| JPMorgan Chase | Data Scientist Lead | Banking | 8 | Vision · Multimodal · Fine-tuning · Model serving · RAG · Evals |
| NVIDIA | Senior Deep Learning Performance Architect | Semiconductors | 8 | Model serving · LLM observability · Fine-tuning · Frontier research |
| Sigma Computing | Staff AI/ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Model serving · Fine-tuning · Multimodal |
| Sigma Computing | Senior AI/ML Engineer | Data AI | 8 | Agent orchestration · Model serving · Fine-tuning · Multimodal |
| Sigma Computing | Senior AI/ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Model serving · Recommender systems · Multimodal |
| Senior Blackbelt Engineer, Gemini Cloud Assist (English) | Big Tech | 8 | Agent orchestration · Semantic search · Model serving | |
| Roblox | Principal Model Optimization Engineer | Consumer | 8 | Model serving · Fine-tuning · Quantization |
| Pendo | Sr. AI Engineer | Enterprise | 8 | RAG · Agent orchestration · LLM observability · Guardrails · Model serving · Fine-tuning |
| Pendo | AI Engineer | Enterprise | 8 | RAG · Agent orchestration · Tool use · Model serving · Evals · Guardrails · LLM observability · Fine-tuning |
| Senior Software Engineering Manager, AI/ML | Big Tech | 8 | Model serving · Audio & speech | |
| Mistral AI | Research Software Engineer - Paris/London | AI Frontier | 8 | Model serving |
| Staff Software Engineer, Machine Learning, Discover Ads Retrieval | Big Tech | 8 | Recommender systems · Search & ranking · Model serving | |
| Suki AI | Architect | Vertical AI | 8 | Agent orchestration · Model serving · LLM observability · Agent research |
| Amazon | Applied Scientist II, Payment Risk Machine Learning | Big Tech | 8 | Agent orchestration · Agent research · LLM observability · Model serving |
| NVIDIA | Senior GenAI Engagement Lead, Partner Platforms | Semiconductors | 8 | Agent orchestration · RAG · Vector DB · Model serving · LLM observability |