3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Walmart | (USA) Staff, Software Engineer - MLE- Agentic AI & AIOps | Retail | 8 | Agent orchestration · LLM observability · Model serving |
| Fireworks AI | Software Engineer, AI Infrastructure | Data AI | 8 | Model serving |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Guardrails · Vector DB · RAG · Evals · LLM observability · Fine-tuning · Agent orchestration |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Fine-tuning · Guardrails · Vector DB · RAG · Agent orchestration · LLM observability · Evals |
| NVIDIA | Director, Engineering – Software Engineering and AI Inferencing Platforms | Semiconductors | 8 | Model serving |
| Amazon | Applied Scientist | Big Tech | 8 | Model serving |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Agent orchestration |
| Moveworks | Principal Product Manager, Search Platform | Enterprise | 8 | Agent orchestration · Semantic search · Audio & speech · Model serving |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning |
| Sentry | Senior Software Engineer, AI | Enterprise | 8 | Agent orchestration · Model serving |
| Cerebras | Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai | Semiconductors | 8 | Model serving |
| Amazon | Sr. Software Engineer- AI/ML, AWS Neuron Apps | Big Tech | 8 | Model serving · Multimodal |
| Amazon | Sr. Applied Scientist, SSG Science | Big Tech | 8 | Fine-tuning · Model serving · Quantization · Distillation |
| Cohere | Senior/Staff Full-Stack Engineer | AI Frontier | 8 | Agent orchestration · RAG · Model serving |
| Perplexity | Engineering Site Lead | AI Frontier | 8 | Model serving |
| Nuro | Software Engineer, AI Platform - Intern | Robotics | 8 | Model serving |
| Roblox | Principal Machine Learning Engineer, Communication Safety | Consumer | 8 | LLM observability · Model serving · Multimodal · Data pipeline |
| Samsara | Senior Machine Learning Engineer - Edge AI | Enterprise | 8 | Multimodal · Model serving · Quantization · Distillation |
| Klaviyo | Senior AI Engineer | Enterprise | 8 | Agent orchestration · Fine-tuning · Evals · Model serving |
| NVIDIA | Senior System Software Architect, HPC and AI Networking | Semiconductors | 8 | Model serving |
| Amazon | Software Engineer- AI/ML, AWS Neuron | Big Tech | 8 | Pretraining · Model serving |
| HeyGen | Tech Lead, AI Compute Infrastructure | Multimodal | 8 | Model serving · Multimodal |
| Roblox | Distinguished Engineer, Machine Learning Systems – Economy | Consumer | 8 | Recommender systems · Search & ranking · Model serving · RAG · Agent orchestration · LLM observability · Multimodal |
| Apptronik | Senior Perception Learning Engineer | Robotics | 8 | Vision · Model serving · Multimodal |
| Amazon | Research Scientist, SSG Science | Big Tech | 8 | Quantization · Distillation |
| Baseten | Software Engineer - Model APIs | Data AI | 8 | Model serving · Tool use · Multimodal |
| Scale AI | Tech Lead Manager- MLRE, ML Systems | Data AI | 8 | Fine-tuning · RL post-training · Model serving |
| Anthropic | Staff + Sr. Software Engineer, Inference | AI Frontier | 8 | Model serving |
| Cohere | Software Engineer, Internal Infrastructure (North America) | AI Frontier | 8 | Model serving |
| Glean | Software Engineer, Agentic Runtime | Enterprise | 8 | Agent orchestration · Tool use · Model serving · LLM observability · Guardrails |