3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Microsoft | Principal Software Engineer, CoreAI | Big Tech | 8 | Model serving · LLM observability · Training infra |
| Microsoft | Principal Software Engineering - AI Frameworks | Big Tech | 8 | Model serving |
| Stability AI | Solutions Engineer | AI Frontier | 8 | Model serving · Multimodal |
| Senior Software Engineer, AI/ML GenAI, GCP | Big Tech | 8 | Multimodal · Vision · Model serving | |
| NVIDIA | Robotics and Agent Solution Architecture Intern - 2026 | Semiconductors | 8 | Agent orchestration · Model serving · Multimodal |
| Capital One | Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services) | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability |
| F5 | Engineer III - AI Specialist | Enterprise | 8 | Agent orchestration · Tool use · Model serving · LLM observability |
| Walmart | (USA) Staff, Data Scientist | Retail | 8 | Agent orchestration · RAG · Fine-tuning · Model serving · LLM observability |
| NVIDIA | Technical Lead, GenAI - Autonomous Vehicles | Semiconductors | 8 | Agent orchestration · Model serving · Multimodal |
| NVIDIA | Senior Software Engineer, Computer Vision - Autonomous Vehicles | Semiconductors | 8 | Model serving |
| Perplexity | Member of Technical Staff (AI Infrastructure Engineer) | AI Frontier | 8 | Model serving |
| Perplexity | Member of Technical Staff (AI Inference Engineer) | AI Frontier | 8 | Model serving · Multimodal |
| Perplexity | Member of Technical Staff (Software Engineer, AI Platform) | AI Frontier | 8 | Agent orchestration · Evals · Multimodal · Model serving |
| Senior Software Engineer, Machine Learning, Core ML | Big Tech | 8 | Pretraining · Model serving | |
| JPMorgan Chase | Machine Learning Engineer – Document Digitization (LLMs)-Senior Associate | Banking | 8 | LLM observability · RAG · Fine-tuning · Model serving · Agent orchestration · Tool use · Evals · Guardrails |
| Cerebras | Senior Performance Engineer, Inference | Semiconductors | 8 | Model serving |
| Apple | WW Consulting Engineer - AI/ML | Big Tech | 8 | Model serving |
| NVIDIA | Developer Technology Engineer - AI | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Solutions Architect, CSP System | Semiconductors | 8 | Agent orchestration · Model serving |
| NVIDIA | Senior Integration Engineer - Autonomous Vehicles | Semiconductors | 8 | Model serving |
| Adobe | Senior Director, Product Management | Enterprise | 8 | Agent orchestration · RAG · Fine-tuning · Model serving · Agent research |
| NVIDIA | Senior AI Software Development Engineer, TensorRT-LLM | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Product Manager, AI Frameworks | Semiconductors | 8 | Recommender systems · Fine-tuning · Model serving |
| Snowflake | Technical Director for AI Functions | Data AI | 8 | LLM observability · Model serving |
| Software Engineer III, Generative AI | Big Tech | 8 | Multimodal · Vision · Model serving | |
| Warner Bros Discovery | Sr. Staff, Data Science & Applied AI | Media | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Model serving |
| Capital One | Sr. Distinguished AI Engineer | Banking | 8 | Model serving · LLM observability · Guardrails · Vector DB · Fine-tuning · Multimodal |
| Deloitte | Lead OpenAI Forward Deployed Engineer - GPS | Consulting | 8 | Agent orchestration · RAG · Evals · Model serving |
| Deloitte | Lead Google Forward Deployed Engineer - GPS | Consulting | 8 | Agent orchestration · RAG · Evals · Model serving |
| Deloitte | Lead Forward Deployed Engineer - AWS | Consulting | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Evals · Model serving |