3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Senior System Software Engineer, Speech AI | Semiconductors | 8 | Audio & speech · Model serving · Fine-tuning |
| NVIDIA | Senior System Software Engineer, Speech AI | Semiconductors | 8 | Audio & speech · Model serving |
| xAI | Member of Technical Staff - Imagine Product | AI Frontier | 8 | Multimodal · Model serving · Vision · Audio & speech |
| Agility Robotics | Senior Manager, AI Innovation | Robotics | 8 | Embodied AI · LLM observability · Model serving · Synthetic data |
| NVIDIA | NIM Solutions Architect | Semiconductors | 8 | Model serving · Fine-tuning · Agent orchestration · Multimodal |
| NVIDIA | Solution Architecture Intern, AI in Industry - 2026 | Semiconductors | 8 | Model serving · Fine-tuning · Multimodal · Audio & speech · RL robotics |
| Premera Blue Cross | AI Engineer III | Insurance | 8 | Model serving · RAG · Guardrails · LLM observability |
| Anthropic | Engineering Manager, Cloud Inference AWS | AI Frontier | 8 | Model serving |
| Together AI | Engineering Manager, Model Serving | Data AI | 8 | Model serving · Fine-tuning · LLM observability |
| Amazon | Senior AI Solution Architect | Big Tech | 8 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Model serving · Evals · LLM observability · Multimodal |
| Capital One | Lead AI Engineer | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| Staff Software Engineer, AI/ML GenAI, Google Workspace | Big Tech | 8 | Model serving · Fine-tuning · Vision · Multimodal | |
| Senior Software Engineer, Cloud AI/ML Infrastructure | Big Tech | 8 | Model serving | |
| Cohere | Engineering Manager, North | AI Frontier | 8 | Agent orchestration · Model serving |
| Amazon | Senior Software Engineer, Speech MLOps | Big Tech | 8 | Audio & speech · Model serving |
| NVIDIA | Senior Software Engineer – ADAS | Semiconductors | 8 | Model serving |
| Whatnot | Machine Learning Platform Engineer | Consumer | 8 | Model serving |
| Whatnot | LLM Platform Engineer | Consumer | 8 | Agent orchestration · RAG · Evals · LLM observability · Model serving |
| Microsoft | MTS - Platform Engineer (Tools) | Big Tech | 8 | Agent orchestration · Tool use · Model serving |
| Modal | Forward Deployed Engineer - ML | Data AI | 8 | Model serving · Fine-tuning · RL post-training |
| NVIDIA | Performance Engineer Intern, Deep Learning and HPC - 2026 | Semiconductors | 8 | Model serving |
| Senior Software Engineer, AI/ML, Google Cloud | Big Tech | 8 | Model serving | |
| Autodesk | Machine Learning Engineering Manager, Model Delivery | Enterprise | 8 | Model serving · LLM observability · Evals |
| Amperity | Lead Machine Learning Engineer | Seattle | 8 | Model serving |
| Whatnot | Technical Lead Manager, ML Infrastructure | Consumer | 8 | Model serving · LLM observability |
| Capital One | Lead AI Engineer | Banking | 8 | Model serving · Guardrails · Vector DB · LLM observability |
| Anthropic | Senior Staff Software Engineer, API | AI Frontier | 8 | Model serving · Agent orchestration · Vector DB |
| Amazon | Data Scientist II, RufusX Science UK | Big Tech | 8 | Agent orchestration · Multimodal · Recommender systems · LLM observability · Model serving |
| NVIDIA | Senior Solutions Architect, AI Factory | Semiconductors | 8 | Model serving |
| NVIDIA | Software Engineering Manager, Robotics | Semiconductors | 8 | Embodied AI · RL robotics · Model serving · Sim-to-real |