3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Senior AI Infrastructure Software Engineer | Semiconductors | 8 | Agent orchestration · Model serving · RAG · Vector DB · Fine-tuning |
| JPMorgan Chase | AWM Risk Analytics Group – Data Scientist - Vice President | Banking | 8 | Fine-tuning · Model serving · LLM observability · Evals |
| Writer | AI engineer | AI Frontier | 8 | Agent orchestration · LLM observability · Model serving |
| JPMorgan Chase | Agentic Development - Vice President | Banking | 8 | Agent orchestration · Agent research · LLM observability · RAG · Model serving · Tool use |
| Cohere | Staff Software Engineer, GPU Infrastructure (HPC) | AI Frontier | 8 | Model serving |
| Cerebras | AI Models, Product Manager | Semiconductors | 8 | Model serving · Agent orchestration · Quantization · Fine-tuning |
| Whatnot | Senior Engineering Manager, ML Platform | Consumer | 8 | Model serving |
| Amazon | Sr Software Development Manager, Generative AI for AWS Neuron | Big Tech | 8 | Agent orchestration · Model serving · Code gen |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Evals |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Walmart | (USA) Staff, Software Engineer | MLE | Retail | 8 | Multimodal · Vision · Fine-tuning · Model serving |
| Microsoft | Principal Applied Scientist | Big Tech | 8 | Agent orchestration · LLM observability · Model serving |
| Senior Software Engineer, AI/ML GenAI, Google Workspace | Big Tech | 8 | Multimodal · Vision · Model serving | |
| Capital One | Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services) | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving |
| Microsoft | Senior AI Software Architect | Big Tech | 8 | Model serving · Quantization · Fine-tuning |
| Datadog | Manager I, Engineering - AI Platform - Training & Serving | Enterprise | 8 | Model serving |
| Amazon | Sr. Machine Learning Engineer, AWS Applied AI Solution | Big Tech | 8 | Agent orchestration · Model serving · Fine-tuning |
| Capital One | Director, AI Engineering | Banking | 8 | Agent orchestration · Agent research · Model serving |
| Cohere | Site Reliability Engineer, Inference Infrastructure | AI Frontier | 8 | Model serving |
| Cohere | Staff Software Engineer, Inference Infrastructure | AI Frontier | 8 | Model serving |
| JPMorgan Chase | Lead Machine Learning Engineer-MLOps | Banking | 8 | Model serving · LLM observability · Vector DB · Recommender systems |
| Synthesia | Senior Research Engineer - Audio Post-Training | Multimodal | 8 | Audio & speech · Fine-tuning · RL post-training · Model serving · Multimodal |
| Ramp | Applied AI Engineer | Fintech | 8 | Agent orchestration · RAG · Fine-tuning · Model serving |
| NVIDIA | Distinguished Engineer, JAX | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Software Architect - Deep Learning and HPC Communications | Semiconductors | 8 | Model serving |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Model serving · Guardrails · LLM observability · RAG · Vector DB · Evals |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Model serving · Guardrails · LLM observability · RAG · Vector DB · Evals |
| NVIDIA | Distinguished Engineer - Dynamo | Semiconductors | 8 | Model serving |
| NVIDIA | Principal Software Engineer - Dynamo | Semiconductors | 8 | Model serving · LLM observability · Agent orchestration |
| NVIDIA | Principal Software Engineer – Large-Scale LLM Memory and Storage Systems | Semiconductors | 8 | Model serving |