3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Postman | Senior Offensive Security Manager | Enterprise | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Agent research · Multimodal |
| Tech Lead, YouTube Shorts Discovery, ML Recommendations | Big Tech | 8 | Recommender systems · Model serving · RL robotics · Agent orchestration | |
| Weights & Biases | Staff AI Security Engineer | Data AI | 8 | Agent orchestration · Guardrails · LLM observability · Model serving |
| Together AI | Forward Deployed Engineer (GPU Clusters) | Data AI | 8 | Model serving |
| Senior Software Engineer, TPU Performance, Hardware, Software Co-Design | Big Tech | 8 | Model serving | |
| Honeywell | Sr Advanced Software Engr | Industrial | 8 | Model serving |
| NVIDIA | SOC AI Application Engineer — AI Services, Agents and Knowledge Systems | Semiconductors | 8 | Agent orchestration · RAG · Vector DB · Model serving · Evals · Guardrails · LLM observability · Tool use |
| Capital One | Lead AI Engineer (Gen AI Platform Services) | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| Adobe | Machine Learning Engineer, Firefly Services | Enterprise | 8 | Model serving · Fine-tuning |
| Microsoft | Software Engineering, CoreAI | Big Tech | 8 | Pretraining · Fine-tuning · Model serving |
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Recommender systems · Search & ranking · Agent orchestration · RAG · LLM observability · Model serving |
| Microsoft | Senior Software Engineering | Big Tech | 8 | Model serving · Fine-tuning · Training infra |
| Microsoft | Principal Software Engineer | Big Tech | 8 | Model serving · Training infra |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Model serving · Evals |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Model serving |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Model serving |
| Senior Software Engineering Manager, AI/ML, Compute Infrastructure | Big Tech | 8 | Model serving · Audio & speech | |
| Axon | Sr. Full Stack Member of Technical Staff | Enterprise | 8 | Multimodal · Model serving |
| NVIDIA | Director, Product Platform Retail and CPG Industries | Semiconductors | 8 | Agent orchestration · RAG · Model serving |
| NVIDIA | Senior Solutions Architect - AI Factory Deployment | Semiconductors | 8 | Model serving · LLM observability |
| Intel | Principal Engineer: XeSS and Neural Graphics | Semiconductors | 8 | Vision · Model serving · Fine-tuning · Evals · Multimodal |
| Salesforce | Senior/Lead AI Software Engineer, Agentforce for Supply Chain | Enterprise | 8 | Agent orchestration · Tool use · Model serving |
| Axon | Sr. FullStack Member of Technical Staff | Enterprise | 8 | Multimodal · Model serving |
| NVIDIA | Senior Software Engineer, Deep Learning Inference | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Hardware Architect, Deep Learning GPU and System | Semiconductors | 8 | Model serving |
| Fireworks AI | Associate Product Manager | Data AI | 8 | Model serving · Fine-tuning · Agent orchestration · Multimodal |
| Verkada | Software Engineering Manager - Search | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Vision · Multimodal · Evals · Guardrails |
| Apple | Machine Learning Engineer, Video Search Team | Big Tech | 8 | Recommender systems · Search & ranking · RAG · Vector DB · Model serving |
| Staff Software Engineer, Content Safety, Productionization | Big Tech | 8 | Agent orchestration · Multimodal · Model serving · RL post-training · Guardrails · LLM observability | |
| Klaviyo | Engineering Manager, ML Platform | Enterprise | 8 | Model serving |