3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Walmart | (USA) Principal, Software Engineer | Retail | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Evals · Model serving |
| Deloitte | AI Engineer Consultant | Consulting | 8 | RAG · Vector DB · Fine-tuning · Model serving · Guardrails · LLM observability · Tool use |
| Skydio | Autonomy Engineer Intern - Deep Learning (Computational Photography) | Defense | 8 | Fine-tuning · Model serving · Vision · Synthetic data |
| JPMorgan Chase | SR Principal Software Engineer - LLM Engineering | Banking | 8 | Model serving |
| Staff Software Engineer, On-Device Machine Learning Infrastructure | Big Tech | 8 | Model serving · Fine-tuning · Audio & speech · Evals | |
| Software Engineering Manager, Automotive AI Agent | Big Tech | 8 | Agent orchestration · LLM observability · Model serving · Multimodal | |
| Canva | Engineering Manager (BE) - AI Media Platform | Enterprise | 8 | Model serving · Multimodal |
| Senior Software Engineer, AI/ML, Search Growth | Big Tech | 8 | Recommender systems · Search & ranking · Model serving · Fine-tuning · LLM observability · Multimodal | |
| LangChain | Solutions Architect (Remote) | Data AI | 8 | Agent orchestration · Model serving · RAG · Vector DB · Evals |
| Modal | Member of Technical Staff - ML Performance | Data AI | 8 | Model serving |
| Amazon | Sr. Applied Scientist, Special Projects | Big Tech | 8 | Model serving · Frontier research |
| NVIDIA | Senior AI-Native Systems Software Engineer, TensorRT | Semiconductors | 8 | Agent orchestration · Agent research · Multimodal · Model serving · Code gen · Vision · Audio & speech |
| Intel | Principal Engineer – Distributed AI Systems Architecture (Heterogeneous Compute) | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Performance Engineer - LLM Inference Frameworks | Semiconductors | 8 | Model serving · Quantization |
| Uber | Sr Software Engineer | Consumer | 8 | Recommender systems · Search & ranking · Model serving |
| OpenAI | Performance Modeling Lead | AI Frontier | 8 | Model serving |
| Software Engineer III, AI/ML, Google Cloud | Big Tech | 8 | Model serving · Multimodal · Vision | |
| Forward Deployed Architect, Generative AI, Google Cloud | Big Tech | 8 | Agent orchestration · RAG · Vector DB · Model serving · Evals · LLM observability · Fine-tuning · Multimodal | |
| Staff Software Engineer, Games, Inception, DeepMind | Big Tech | 8 | Agent orchestration · Model serving | |
| Senior Staff ML Engineer, Search & Recommendation | Consumer | 8 | Recommender systems · Search & ranking · Model serving · RAG · LLM observability · Fine-tuning | |
| Software Engineer III, AI/ML GenAI, YouTube | Big Tech | 8 | Multimodal · Vision · Audio & speech · Model serving | |
| NVIDIA | OEM Solutions Architect - AI Full Stack Public Sector | Semiconductors | 8 | Model serving · Fine-tuning |
| NVIDIA | AI Computing Development Engineer, TensorRT-LLM | Semiconductors | 8 | Model serving · Fine-tuning |
| SoFi | Director, AI Platforms | Fintech | 8 | Model serving · Agent orchestration · RAG · Evals · LLM observability · Guardrails |
| Intercom | AI Infrastructure Engineer | Enterprise | 8 | Model serving · LLM observability |
| Intercom | AI Infrastructure Engineer | Enterprise | 8 | Model serving |
| NVIDIA | Senior Software Engineer, JAX | Semiconductors | 8 | Model serving |
| Intel | Research and Pathfinding Internship: AI Workload Compiler Optimization for CPU and GPU | Semiconductors | 8 | Model serving |
| Walmart | (USA) Distinguished, Software Engineer | Retail | 8 | Model serving · LLM observability · Guardrails · Agent orchestration · Tool use |
| Adobe | Machine Learning Architect 5 - GenAI Experiences | Enterprise | 8 | Agent orchestration · RAG · LLM observability · Recommender systems · Model serving |