3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Honeywell | AI Engr | Industrial | 7 | Model serving |
| Asana | AI Engineer | Enterprise | 7 | Agent orchestration · Tool use · RAG · LLM observability · Model serving |
| Amazon | Software Development Engineer, AGI - Web information retrieval | Big Tech | 7 | RAG · Vector DB · Search & ranking · Recommender systems · Multimodal · Model serving |
| Amazon | Software Development Engineer (Java/AWS), Alexa AI | Big Tech | 7 | Agent orchestration · LLM observability · Model serving |
| Amazon | Software Dev Engineer III, Conversational Ad Experiences | Big Tech | 7 | Agent orchestration · LLM observability · Model serving · RAG |
| Amazon | Software Development Engineer II | Big Tech | 7 | Embodied AI · Model serving |
| Amazon | Software Development Engineer, CreativeX | Big Tech | 7 | Model serving |
| NVIDIA | AI Factory CPU focused Solutions Architect | Semiconductors | 7 | Model serving · Agent orchestration · Tool use · Evals |
| NVIDIA | Senior Solutions Architect, AI Factory Infrastructure | Semiconductors | 7 | Model serving · Agent orchestration · Synthetic data |
| Intel | Research Intern: Agent-CC System | Semiconductors | 7 | Agent orchestration · Agent research · Model serving |
| Expedia | Machine Learning Scientist III - Multi-Product AI | Hospitality | 7 | Recommender systems · Search & ranking · Model serving · RAG · Agent orchestration |
| NVIDIA | Senior Software Engineer - Verification AI Infrastructure | Semiconductors | 7 | Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 7 | Model serving · Fine-tuning · Quantization · Distillation |
| Intel | Workload optimization intern | Semiconductors | 7 | Model serving |
| Visa | SW Engineer | Fintech | 7 | Agent orchestration · LLM observability · Model serving |
| Microsoft | Principal Software Engineer | Big Tech | 7 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Model serving |
| Character AI | Technical Program Manager, AI Infrastructure | AI Frontier | 7 | Model serving · Evals |
| Uber | Software Engineer II | Consumer | 7 | Model serving |
| Handshake | Senior Software Engineer, FDE | Enterprise | 7 | LLM observability · Model serving |
| SoC Vision Architect, Silicon, Google Cloud | Big Tech | 7 | Model serving · Vision · Multimodal | |
| GitLab | Senior Backend Engineer (AI), Pipeline Execution | Enterprise | 7 | Agent orchestration · Model serving · LLM observability |
| Stripe | Backend Engineer, AI Security | Fintech | 7 | Model serving · Guardrails · Agent orchestration · Tool use |
| Senior Software Engineer, Infrastructure, CoreOS Agentic Engineering | Big Tech | 7 | Agent orchestration · Model serving | |
| Amazon | Sr. Technical Program Manager | Big Tech | 7 | Model serving · Agent orchestration |
| Amazon | Sr. Software Development Engineer, Bedrock AgentCore Knowledge Bases | Big Tech | 7 | RAG · Agent orchestration · Model serving |
| Amazon | Senior SDE, Prime Video Personalization & Discovery | Big Tech | 7 | Recommender systems · Model serving |
| Amazon | Senior Delivery Consultant - Modernization, Professional Services Israel | Big Tech | 7 | RAG · Agent orchestration · Model serving |
| Amazon | Software Dev Engineer, EC2 Nitro | Big Tech | 7 | Model serving · Multimodal |
| Amazon | Senior Software Development Engineer, EC2 Nitro | Big Tech | 7 | Model serving · Multimodal |
| Amazon | Sr Software Development Engineer, EC2 Nitro Machine Learning Systems | Big Tech | 7 | Model serving · LLM observability · Multimodal |