3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Software Engineer, AI/ML, Youtube Ads | Big Tech | 8 | Recommender systems · Search & ranking · Model serving · Audio & speech | |
| Microsoft | Senior Software Engineer | Big Tech | 8 | LLM observability · Model serving |
| Microsoft | Senior Applied Scientist | Big Tech | 8 | Agent orchestration · RAG · Recommender systems · Search & ranking · LLM observability · Model serving |
| Software Engineer, Recommendations, Rankings, Predictions, Consumer Shopping Commerce | Big Tech | 8 | Agent orchestration · Recommender systems · Model serving | |
| Staff Software Engineer, GPU Performance | Big Tech | 8 | Model serving | |
| Apple | Senior Machine Learning Engineer, Agentic Workflows - Software Delivery | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Code gen · Model serving · Evals |
| Microsoft | Applied Scientist II | Big Tech | 8 | RAG · Search & ranking · Recommender systems · LLM observability · Model serving |
| Amazon | Software Development Engineer, Sponsored Products and Brands | Big Tech | 8 | Agent orchestration · Tool use · Fine-tuning · Model serving · RAG · LLM observability · Guardrails · RL post-training · Reward modeling |
| Senior Software Developing Manager, ML Infrastructure, Core Infra | Big Tech | 8 | Model serving · Fine-tuning · Evals · Multimodal · Vision | |
| Senior Software Engineer, AI/ML GenAI, Google Cloud Compute Infrastructure | Big Tech | 8 | Model serving · Multimodal · Vision | |
| Apple | Director of Algorithms, Ads Engineering | Big Tech | 8 | Recommender systems · Search & ranking · Model serving |
| Glean | Tech Lead Manager, Agentic Runtime | Enterprise | 8 | Agent orchestration · Tool use · Model serving · LLM observability |
| JPMorgan Chase | Applied AI/ML Lead - Vice President - Payments | Banking | 8 | Fine-tuning · Model serving |
| Amazon | Applied Scientist, SSG Science | Big Tech | 8 | Fine-tuning · Model serving · Quantization · Distillation |
| Capital One | Lead AI Engineer (FM Hosting, LLM Inference) | Banking | 8 | Model serving · LLM observability · Guardrails · Vector DB · Fine-tuning |
| Intel | Senior GenAI Software Solutions Engineer | Semiconductors | 8 | Agent orchestration · Tool use · Model serving · Quantization · Distillation · RAG · Vector DB · Fine-tuning |
| NVIDIA | Senior AI Infrastructure Software Engineer - DGX Cloud | Semiconductors | 8 | Model serving · LLM observability · Agent orchestration |
| NVIDIA | Software Engineer - AI Research Clusters | Semiconductors | 8 | Model serving · Agent orchestration |
| Pfizer | Director, AI Engineering--Clinical Development and Operations (CD&O) | Pharma | 8 | Agent orchestration · LLM observability · RAG · Fine-tuning · Model serving |
| ZoomInfo | Senior Machine Learning Engineer | Enterprise | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · Evals |
| AI/ML Software Engineering Manager, Google Home Camera | Big Tech | 8 | Model serving · Fine-tuning · Evals · Vision | |
| JPMorgan Chase | GenAI Engineering - Executive Director | Banking | 8 | LLM observability · Evals · Guardrails · Fine-tuning · Model serving |
| Amazon | Applied Scientist II, Amazon Travel & Events | Big Tech | 8 | Multimodal · Vision · RAG · Fine-tuning · Model serving · Interpretability |
| Capital One | Sr. Lead AI Engineer (GenAI Platform) | Banking | 8 | Model serving · Fine-tuning · RAG · Vector DB · Guardrails · LLM observability · Evals |
| NVIDIA | Senior Engineer - AI Agents and Systems | Semiconductors | 8 | Agent orchestration · Agent research · Model serving · Tool use · Guardrails |
| NVIDIA | Senior Engineer - AI Agents and Systems | Semiconductors | 8 | Agent orchestration · Agent research · Model serving |
| BCG | BCG Platinion | Principal Architect - AI Platforms | Consulting | 8 | Agent orchestration · RAG · LLM observability · Model serving |
| BCG | BCG Platinion | Lead IT Architect - AI Platforms | Consulting | 8 | Agent orchestration · RAG · LLM observability · Model serving |
| Baseten | Engineering Manager - Forward Deployed Engineering (LLM) | Data AI | 8 | Model serving · LLM observability |
| Baseten | Manager, Solutions Architect | Data AI | 8 | Model serving · LLM observability |