3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Microsoft | Research Intern - Applied Sciences Group | Big Tech | 8 | Fine-tuning · Model serving · Frontier research |
| Amazon | Applied Scientist II, Amazon Payment Products (L5) | Big Tech | 8 | Agent orchestration · LLM observability · Model serving |
| Amazon | Applied Scientist, Delivery Foundation Model | Big Tech | 8 | Multimodal · Frontier research · Pretraining · Fine-tuning · Model serving |
| Autodesk | Software Architect | Enterprise | 8 | Agent orchestration · Agent research · Tool use · Model serving · RAG · Vector DB · LLM observability · Guardrails |
| NVIDIA | Architect, AI Solutions Engineering | Semiconductors | 8 | Agent orchestration · RAG · Fine-tuning · Model serving |
| NVIDIA | Senior High-Performance System Architect | Semiconductors | 8 | Model serving |
| Walmart | Distinguished, Data Scientist | Retail | 8 | Search & ranking · Recommender systems · Agent orchestration · Tool use · LLM observability · RAG · Model serving |
| Microsoft | Senior Researcher - GPU Performance | Big Tech | 8 | Model serving |
| NVIDIA | Manager, Deep Learning Algorithms | Semiconductors | 8 | Model serving |
| Roblox | Principal Machine Learning Engineer, Alt Defense | Consumer | 8 | Agent orchestration · Model serving |
| Weights & Biases | AI Solutions Engineer, Pre-Sales- W&B | Data AI | 8 | Agent orchestration · Tool use · Evals · Guardrails · RAG · Vector DB · Fine-tuning · Model serving · LLM observability |
| Software Engineer, GKE, PhD, Early Careers | Big Tech | 8 | Agent orchestration · Agent research · Model serving | |
| Capital One | Senior Lead AI Engineer (FM Hosting, LLM Inference) | Banking | 8 | Model serving · LLM observability · Guardrails · Vector DB · Fine-tuning |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Model serving · Guardrails · LLM observability |
| GEICO | Sr Staff Engineer - Applied AI | Insurance | 8 | Agent orchestration · Agent research · RAG · Vector DB · Model serving · Guardrails · LLM observability |
| OpenAI | Software Engineer, Platform Systems | AI Frontier | 8 | LLM observability · Model serving |
| OpenAI | Software Engineer, Research - Human Data | AI Frontier | 8 | RL post-training · Evals |
| Capital One | Lead AI Engineer (FM Hosting, LLM Inference) | Banking | 8 | Model serving · LLM observability · Guardrails · Vector DB |
| Apptronik | Senior Autonomy Software Engineer | Robotics | 8 | Embodied AI · Agent orchestration · Multimodal · Model serving · Guardrails |
| Moveworks | Senior Software Engineer I, Agentic AI Product | Enterprise | 8 | Agent orchestration · Model serving · LLM observability |
| Cerebras | ML Systems Performance Engineer | Semiconductors | 8 | Model serving |
| OpenAI | Software Engineer, Codex Cloud | AI Frontier | 8 | Agent orchestration · Model serving |
| Amazon | Software Development Engineer III, Annapurna Labs | Big Tech | 8 | Agent orchestration · Model serving |
| Amazon | Principal Software Engineer, AI Domains, Alexa AI | Big Tech | 8 | Model serving · Agent orchestration · Multimodal · Vision · LLM observability · Evals · Guardrails |
| NVIDIA | Senior Deep Learning Engineer - AI for Wireless Systems | Semiconductors | 8 | Model serving · Fine-tuning · Evals |
| NVIDIA | Engineering Manager - AI for RAN and 6G Wireless Systems | Semiconductors | 8 | Model serving · Fine-tuning · Evals |
| NVIDIA | System Software Engineer - Deep Learning | Semiconductors | 8 | Model serving · Fine-tuning · Vision |
| Cribl | Staff Software Engineer, Cribl AI | Enterprise | 8 | Fine-tuning · Model serving |
| Writer | Software engineer, generative AI | AI Frontier | 8 | Agent orchestration · Tool use · RAG · Vector DB · Model serving |
| Silicon RTL Design Engineer, PhD, Early Career | Big Tech | 8 | Model serving |