3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | NCX Engineer, AI Accelerator | Semiconductors | 8 | Model serving · Recommender systems |
| NVIDIA | Senior HPC and AI Networking Performance Research and Analysis Engineer | Semiconductors | 8 | Pretraining · Model serving |
| Capital One | Distinguished AI Engineer | Banking | 8 | Model serving · Guardrails · Vector DB · LLM observability |
| Autodesk | Senior Applied Scientist, Personalization & Agentic Systems | Enterprise | 8 | Agent orchestration · LLM observability · RAG · Model serving · Recommender systems · Tool use |
| Workday | Senior Machine Learning Engineer | Enterprise | 8 | Agent orchestration · LLM observability · Model serving · Recommender systems · RAG · Fine-tuning |
| Visa | Senior Director, Software Engineering (GenAI/Cloud) | Fintech | 8 | Agent orchestration · RAG · Vector DB · LLM observability · Model serving |
| NVIDIA | Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 | Semiconductors | 8 | Model serving |
| Anyscale | Distributed LLM Inference Engineer | Data AI | 8 | Model serving |
| Apple | Applied AI Engineer - iCloud Data | Big Tech | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| DoorDash | Director of Engineering, Logistics | Consumer | 8 | Model serving |
| Microsoft | Senior Software Engineer - CoreAI | Big Tech | 8 | Agent orchestration · Model serving · Multimodal |
| Staff Machine Learning Engineer, AI Serving | Consumer | 8 | Model serving · LLM observability | |
| JPMorgan Chase | AWM Quant Modelling- Senior Associate | Banking | 8 | Agent orchestration · Fine-tuning · Model serving · RAG · LLM observability |
| NVIDIA | Senior AI Solutions Architect | Semiconductors | 8 | Model serving |
| Eli Lilly | Associate Director - AI Engineering | Pharma | 8 | Agent orchestration · Tool use · RAG · LLM observability · Model serving · Guardrails |
| Disney | Sr Data Scientist | Media | 8 | Multimodal · Fine-tuning · RAG · Model serving · Evals · Vector DB |
| NVIDIA | Senior Deep Learning Framework Communications Engineer | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Solutions Architect, Generative AI Data Processing | Semiconductors | 8 | Agent orchestration · Model serving · LLM observability |
| Staff Software Engineer, AI/ML, Google Cloud | Big Tech | 8 | Model serving · Audio & speech | |
| JPMorgan Chase | AI Engineering Director | Banking | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Vector DB · Model serving |
| World Labs | Research Platform Engineer | AI Frontier | 8 | Model serving |
| Elastic | Lead GenAI Cloud Developer | Enterprise | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Evals |
| Amazon | Sr. Machine Learning Compiler Engineer, AWS Neuron, Annapurna Labs | Big Tech | 8 | Model serving |
| Intel | GPU Power Architect | Semiconductors | 8 | Model serving |
| NVIDIA | Director, System Software Engineering - Metropolis Accelerated and Inferencing Software | Semiconductors | 8 | Model serving · Multimodal · Vision · Agent orchestration · LLM observability |
| NVIDIA | Director, Isaac for Healthcare Engineering | Semiconductors | 8 | Synthetic data · Model serving · Embodied AI |
| NVIDIA | Senior Solutions Architect - Deep Learning | Semiconductors | 8 | Model serving · Agent orchestration |
| NVIDIA | Senior Software Architect - Deep Learning and HPC Communications | Semiconductors | 8 | Model serving |
| ClickUp | Staff AI Engineer - AI Platform | Enterprise | 8 | Agent orchestration · Model serving · LLM observability · RAG |
| ClickUp | Senior AI Engineer - AI Platform | Enterprise | 8 | Agent orchestration · Model serving · LLM observability · Guardrails |