3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Software Engineer, AI/ML, AI and Infrastructure | Big Tech | 8 | Model serving · Audio & speech | |
| Smartsheet | Senior Manager, Engineering - AI & Automation | Seattle | 8 | Agent orchestration · LLM observability · RAG · Model serving |
| Apple | Machine Learning Engineer, Siri Attention & Invocation | Big Tech | 8 | Audio & speech · Fine-tuning · Model serving · Evals |
| Senior Software Engineer, AI/ML GenAI, YouTube | Big Tech | 8 | Multimodal · Vision · Model serving | |
| Microsoft | Principal AI Software Architect | Big Tech | 8 | Model serving · Training infra |
| Writer | Senior software engineer, enterprise AI platform (UK) | AI Frontier | 8 | Agent orchestration · Vector DB · Model serving |
| Senior Software Engineer, Kernels and Performance, Core ML Frameworks | Big Tech | 8 | Model serving | |
| NVIDIA | Senior Architect - Server Performance | Semiconductors | 8 | Model serving |
| F5 | Principal AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · RAG · Agent research · LLM observability · Guardrails · Model serving |
| F5 | Principle AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · RAG · Agent research · LLM observability · Guardrails · Vector DB · Model serving |
| F5 | AI Inference Engineer | Enterprise | 8 | Model serving · LLM observability |
| NVIDIA | Solutions Architect, Inference Deployments | Semiconductors | 8 | Model serving |
| NVIDIA | Solutions Architect, Agentic AI | Semiconductors | 8 | Agent orchestration · Agent research · Fine-tuning · Model serving · Evals · Guardrails · Multimodal · Code gen |
| NVIDIA | Senior Solutions Architect, Generative AI | Semiconductors | 8 | Model serving · Recommender systems |
| Cribl | Staff AI Platform Engineer, Corporate AI Systems | Enterprise | 8 | Agent orchestration · Model serving · Guardrails · LLM observability |
| Databricks | Senior Specialist Solutions Architect - AI & ML Engineer | Data AI | 8 | Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Evals · LLM observability · Model serving |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Model serving |
| Intercom | Engineering Manager, AI Models Infrastructure | Enterprise | 8 | Model serving |
| Unity | Principal Machine Learning Engineer, Mobile AI Inference Optimization | Enterprise | 8 | Model serving · Quantization · Multimodal |
| Sierra | Product Manager, Voice | AI Frontier | 8 | Audio & speech · Model serving · LLM observability |
| Microsoft | Principal Software Engineer | Big Tech | 8 | Model serving |
| Microsoft | Principal Product Manager - Foundry Inferencing & Training (CoreAI - multiple roles) | Big Tech | 8 | Model serving · Training infra |
| Disney | Sr Machine Learning Engineer | Media | 8 | Model serving · Forecasting · LLM observability |
| CrowdStrike | Sr. Software Engineer (GenAI Platform) (Hybrid, ROU) | Enterprise | 8 | Agent orchestration · RAG · Model serving |
| NVIDIA | Principal Deep Learning Communication Architect | Semiconductors | 8 | Model serving · Agent orchestration |
| Adobe | Sr. AI Systems Engineer- Agentic and Productivity Systems | Enterprise | 8 | Agent orchestration · Model serving · LLM observability · Evals · RAG · Vector DB |
| LangChain | Solutions Architect (Amsterdam) | Data AI | 8 | Agent orchestration · Agent research · Model serving · RAG · Vector DB · Evals · Guardrails · LLM observability |
| LangChain | Solutions Architect (Austin) | Data AI | 8 | Agent orchestration · Model serving · RAG · Vector DB · Fine-tuning · Evals |
| LangChain | Solutions Architect (Dallas) | Data AI | 8 | Agent orchestration · Model serving · RAG · Vector DB · Evals |
| LangChain | Solutions Architect (San Francisco) | Data AI | 8 | Agent orchestration · Model serving · RAG · Vector DB · Evals · Guardrails · LLM observability |