848 AI roles tagged inference_infra.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Staff Software Engineer, Applied AI | Big Tech | 9 | Agent orchestration · Model serving · Fine-tuning · Evals · Audio & speech · RL robotics | |
| Apple | Machine Learning Architect - Conversational Speech | Big Tech | 9 | Audio & speech · Multimodal · Model serving · Fine-tuning |
| Amazon | Member of Technical Staff - Machine Learning, Frontier AI Robotics | Big Tech | 9 | Embodied AI · RL robotics · Synthetic data · Multimodal · Model serving |
| Apple | Senior Machine Learning Manager, Search & Knowledge Platform | Big Tech | 9 | Fine-tuning · RL post-training · Reward modeling · LLM observability · Model serving · RAG |
| Staff Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Model serving · Fine-tuning · Quantization | |
| Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Model serving · Fine-tuning · Frontier research · Multimodal · Quantization | |
| Staff Software Engineer, Applied Research, Foundation User Models | Big Tech | 9 | Fine-tuning · Recommender systems · Model serving | |
| Senior Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Model serving · Fine-tuning · Frontier research · Quantization · Distillation | |
| Microsoft | Principal Researcher | Big Tech | 9 | Model serving |
| Amazon | Manager, Research Analysis, RBS Tech | Big Tech | 9 | Agent orchestration · RAG · Model serving · Guardrails · LLM observability · Fine-tuning |
| Staff Datacloud Blackbelt Engineer, Data and AI | Big Tech | 9 | Agent orchestration · Model serving · Multimodal · Vision | |
| Amazon | Applied Scientist, Navigation | Big Tech | 9 | Embodied AI · Agent orchestration · Model serving · Vision · Multimodal |
| Microsoft | Principal Software Engineer, Foundry Agents - CoreAI | Big Tech | 9 | Agent orchestration · Tool use · Fine-tuning · Model serving · LLM observability · Evals |
| Research Engineer, Pretraining, DeepMind | Big Tech | 9 | Pretraining · Fine-tuning · Model serving | |
| Microsoft | Research Intern - AI Agents & Efficiency | Big Tech | 9 | Agent orchestration · Agent research · Multi-agent · Tool use · Model serving |
| Amazon | Applied Scientist, Trustworthy Shopping Experience (TSE) | Big Tech | 9 | Agent orchestration · Agent research · Multimodal · Vision · Fine-tuning · Model serving |
| Senior Research Engineer, On-Device Inference, Robotics, DeepMind | Big Tech | 9 | Model serving | |
| Amazon | Senior Applied Scientist, Navigation | Big Tech | 9 | Embodied AI · Agent orchestration · Model serving |
| Amazon | Software Development Engineer, Neuron Collectives, Annapurna Labs | Big Tech | 9 | Model serving |
| Senior Machine Learning Engineer, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Model serving · Fine-tuning · Evals · Multimodal · Vision | |
| Power and Performance Architect, TPU | Big Tech | 9 | Model serving | |
| Amazon | Senior Applied Scientist | Big Tech | 9 | Multimodal · Embodied AI · Model serving |
| Senior Software Engineering Manager, Emergent AI Infrastructure | Big Tech | 9 | Model serving | |
| ByteDance | Research Engineer - LLM Training Infrastructure - Seed Infra | Big Tech | 9 | Pretraining · Model serving |
| Staff Software Engineer, On-Device Hybrid Multimodal AI | Big Tech | 9 | Agent orchestration · Multimodal · LLM observability · Model serving · Vision · Audio & speech · Fine-tuning | |
| Forward Deployed Engineer, Generative AI, Google Cloud | Big Tech | 9 | Agent orchestration · RAG · Model serving · Evals · LLM observability · Tool use | |
| Meta | Software Engineer, AI Specialist - Monetization (Technical Leadership) | Big Tech | 9 | Model serving · Recommender systems · Search & ranking · Frontier research |
| Staff Software Engineer, AI/ML GenAI, Google Cloud AI | Big Tech | 9 | Model serving · Fine-tuning · Evals · Vision · Multimodal | |
| Forward Deployed Engineer IV, GenAI, Google Cloud | Big Tech | 9 | Agent orchestration · Tool use · Model serving · LLM observability · Guardrails | |
| ByteDance | Research Engineer - LLM/VLM Inference Optimization (Seed Infra) | Big Tech | 9 | Model serving · LLM observability |