3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Solution Architect, Energy | Semiconductors | 8 | Model serving |
| Capital One | Senior Distinguished AI Engineer | Banking | 8 | Model serving · Fine-tuning · Guardrails · LLM observability · Vector DB |
| Capital One | Lead AI Engineer (MLX) | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals |
| Hex | AI Engineering Lead | Data AI | 8 | Agent orchestration · Agent research · Evals · LLM observability · Model serving · Search & ranking |
| Samsara | Staff ML Engineer - ML Infrastructure | Enterprise | 8 | Model serving |
| Cerebras | Engineering Manager, Inference ML Runtime | Semiconductors | 8 | Model serving · Multimodal · LLM observability |
| Amazon | Software Engineer II- AI/ML, AWS Neuron | Big Tech | 8 | Model serving · Fine-tuning |
| Amazon | Principal GenAI Specialist SA | Big Tech | 8 | Agent orchestration · Fine-tuning · Model serving · RAG · Vector DB · LLM observability |
| NVIDIA | Developer Relations Manager – AI Natives | Semiconductors | 8 | Model serving · Agent orchestration · Multimodal |
| Microsoft | Principal Software Engineer - CoreAI Model Inference & Serving | Big Tech | 8 | Model serving · LLM observability |
| Microsoft | Principal Software Engineer, CoreAI | Big Tech | 8 | Model serving · Multimodal |
| Microsoft | Member of Technical Staff, AI Systems Engineer - Microsoft Superintelligence | Big Tech | 8 | Model serving |
| JPMorgan Chase | Software Engineer III - Applied AI | Banking | 8 | Agent orchestration · Model serving · RAG · Fine-tuning |
| Amazon | Applied Scientist | Big Tech | 8 | Recommender systems · Model serving |
| Amazon | Software Development Engineer II, Items and Relationships Platform | Big Tech | 8 | Model serving · LLM observability · Vector DB · RAG · Agent orchestration · Multimodal · Vision |
| Capital One | Lead AI Engineer | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Capital One | Senior Lead AI Engineer | Banking | 8 | Model serving · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning |
| NVIDIA | Senior AI Performance and Efficiency Engineer | Semiconductors | 8 | Model serving |
| NVIDIA | Senior AI Developer Technology Engineer | Semiconductors | 8 | Model serving |
| Capital One | Lead AI Engineer (Gen AI Platform, Agentic AI & LLM Infrastructure & Orchestration) | Banking | 8 | Agent orchestration · LLM observability · RAG · Vector DB · Guardrails · Model serving |
| Klaviyo | Sr. Lead AI Engineer | Enterprise | 8 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Model serving |
| Cerebras | ML Performance Benchmarking Engineer | Semiconductors | 8 | Model serving · LLM observability |
| Airbnb | Senior Software Engineer, BizTech(AI Products) | Consumer | 8 | Agent orchestration · RAG · LLM observability · Model serving |
| Netflix | Technical Director, GenAI - Games | Big Tech | 8 | Multimodal · Model serving · Fine-tuning |
| Capital One | Senior Lead AI Engineer | Banking | 8 | Model serving · Fine-tuning · Guardrails · LLM observability · Vector DB · RAG · Evals |
| Senior Staff Software Engineer, AI/ML GenAI, Google Ads | Big Tech | 8 | Vision · Model serving | |
| Senior Software Engineering Manager, AI/ML, Google Cloud AI | Big Tech | 8 | Model serving · Fine-tuning · Evals · Audio & speech · RL robotics | |
| NVIDIA | Engineering Manager, AI Developer Technology | Semiconductors | 8 | Model serving · Recommender systems · Multimodal |
| NVIDIA | Senior Developer Technology Engineer - AI | Semiconductors | 8 | Model serving · Recommender systems |
| Capital One | Lead AI Engineer (AI Foundations, LLM Customization and Finetuning) | Banking | 8 | Fine-tuning · Model serving · Guardrails · Vector DB · LLM observability · Evals |