3035 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Software Engineer, AI/ML Infrastructure | Big Tech | 8 | Model serving · Fine-tuning | |
| Apple | Sr. Machine Learning Engineer, Siri Speech | Big Tech | 8 | Fine-tuning · Model serving · Audio & speech |
| Amazon | Software Development Engineer, Sponsored Products and Brands | Big Tech | 8 | Agent orchestration · Model serving · Guardrails |
| Amazon | Software Development Engineer - AI/ML, Amazon Neuron, Multimodal Inference | Big Tech | 8 | Model serving |
| Amazon | Software Development Engineer, ML Systems, Annapurna Labs | Big Tech | 8 | Agent orchestration · Model serving |
| NVIDIA | Machine Learning Intern - AI Agents Conversational AI | Semiconductors | 8 | Agent orchestration · RAG · Vector DB · Audio & speech · LLM observability · Model serving |
| NVIDIA | Machine Learning Intern - 2026 | Semiconductors | 8 | Model serving |
| Adobe | Machine Learning Engineer 5 | Enterprise | 8 | Model serving · Fine-tuning |
| Expedia | Senior Machine Learning Engineer | Hospitality | 8 | Model serving · RAG · Agent orchestration · LLM observability · Guardrails |
| NVIDIA | Senior Performance Compiler Engineer - Triton | Semiconductors | 8 | Model serving |
| NVIDIA | Senior Systems Engineer, Neural Graphics | Semiconductors | 8 | Model serving · Vision · Multimodal · Agent orchestration |
| NVIDIA | Senior Data and AI Solutions Engineer | Semiconductors | 8 | Agent orchestration · Tool use · Evals · RAG · Model serving |
| Capital One | Senior Distinguished Engineer, AI Compute (Remote Eligible) | Banking | 8 | Model serving · Pretraining · Fine-tuning · Agent orchestration |
| Capital One | Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Guardrails · Vector DB |
| Zendesk | Staff Security Engineer | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · Model serving |
| Senior Staff Software Engineer, AI/ML, Google Workspace | Big Tech | 8 | Model serving · Fine-tuning · Audio & speech | |
| Software Engineer III, Google Home Video Intelligence | Big Tech | 8 | Vision · Fine-tuning · Model serving | |
| Microsoft | Principal Software Engineer - Performance | Big Tech | 8 | Model serving · LLM observability |
| Discord | Senior Software Engineer, Machine Learning (Ads) | Consumer | 8 | Recommender systems · Search & ranking · Model serving |
| Software Engineer, Gemini Live, DeepMind | Big Tech | 8 | Multimodal · Model serving · Agent orchestration · LLM observability | |
| Lead Software Engineer, Infrastructure Quality, Robotics, DeepMind | Big Tech | 8 | Agent orchestration · Model serving · LLM observability · Embodied AI | |
| JPMorgan Chase | Sr. Lead Architect-Applied AI/ML | Banking | 8 | Model serving · Recommender systems · RAG · Agent orchestration · Agent research |
| Amazon | Machine Learning SDE, Scanless Technologies | Big Tech | 8 | Model serving · Fine-tuning · Evals |
| NVIDIA | Solutions Architect - AI for Drug Discovery | Semiconductors | 8 | Model serving · Fine-tuning · RL post-training · Agent orchestration · Multimodal |
| NVIDIA | Solution Architect, Generative AI | Semiconductors | 8 | Model serving · Agent orchestration · Fine-tuning · Pretraining |
| NVIDIA | Senior GPU System Architect | Semiconductors | 8 | Model serving |
| Capital One | Distinguished AI Engineer (Remote) | Banking | 8 | Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Capital One | Lead Machine Learning Engineer (Gen AI, Python, Go, AWS) | Banking | 8 | Model serving · Agent orchestration |
| Johnson & Johnson | Senior Machine Learning Engineer - Robotics | Pharma | 8 | Embodied AI · Multimodal · Model serving · Fine-tuning |
| NVIDIA | AI Research Engineer - Applied Scientist Compilers | Semiconductors | 8 | Fine-tuning · RL post-training · Model serving · Evals |