Models that process or generate across modalities — text, images, audio, video — within a single architecture; covers training, fine-tuning, and application-layer integration. Primary AI lifecycle stage: pre-training, post-training, and application.
1,073 active AI roles across 126 companies in our index reference Multimodal as of today. New postings fell 30% in the last 30 days versus the prior 30 (473 → 332).
The companies with the most active Multimodal listings are: Amazon (227 roles), Google (104 roles), NVIDIA (80 roles), Adobe (77 roles), Apple (41 roles).
Multimodal primarily belongs to the pre-training, post-training, and application stages of the AI lifecycle. In current hiring, Multimodal roles concentrate at: agents (34%), post-training (21%).
The sectors with the most active Multimodal hiring are: Big Tech, Enterprise, Semiconductors.
Models that process or generate across modalities — text, images, audio, video — within a single architecture; covers training, fine-tuning, and application-layer integration.
Primary AI lifecycle stage: pre-training, post-training, and application.
As of today, 1,073 active AI roles across 126 companies in our index reference Multimodal. Hiring concentrates at the agents (34%) and post-training (21%) stages. Most common sectors: Big Tech, Enterprise, Semiconductors. New postings fell 30% in the last 30 days versus the prior 30 (473 → 332).
93 AI roles tagged multimodal.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Uber | Senior Machine Learning Engineer - AV Foundation, AV Labs | Consumer | 10 | Frontier research · Pretraining · Fine-tuning |
| Director, Machine Learning Engineering – Content & User Understanding | Consumer | 9 | Vision · Model serving | |
| Airbnb | Principal AI/ML Researcher / Engineer Reasoning, Planning, and Decision-making systems | Consumer | 9 | Agent orchestration · Agent research · RL post-training |
| Whoop | Senior AI Researcher (Foundation AI) | Consumer | 9 | Frontier research · Pretraining · Fine-tuning · Model serving |
| Airbnb | Principal Engineer -In Bayesian, Large Foundational Systems, and Distributional Reinforcement Learning | Consumer | 9 | Agent orchestration · Multi-agent · LLM observability · Recommender systems · Frontier research |
| Uber | Machine Learning Engineer II - AV Foundation, AV Labs | Consumer | 9 | Frontier research |
| Spotify | Senior Machine Learning Engineer - Personalization, Horizon | Consumer | 9 | Agent orchestration · LLM observability · Fine-tuning · Recommender systems |
| Airbnb | Senior Staff Machine Learning Engineer, Post Training | Consumer | 9 | Fine-tuning · Model serving · Inference infra · LLM observability · Guardrails |
| Zillow | Senior Machine Learning Engineer | Consumer | 9 | Agent orchestration · Evals · Guardrails · LLM observability · Model serving |
| DoorDash | AI Research Fellowship, (Summer and Fall 2026) | Consumer | 9 | Agent orchestration · Tool use · Evals · Forecasting · Vision · Audio & speech · Frontier research · Synthetic data |
| Master's Fall Machine Learning Internship (ATG - Visual Search) | Consumer | 9 | Agent orchestration · Tool use · Model serving · Inference infra · LLM observability | |
| Uber | Director, Engineering - AV Labs | Consumer | 9 | |
| Sr. Data Scientist, Responsible AI | Consumer | 9 | Evals · Guardrails · LLM observability · Agent research | |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Evals · Guardrails · LLM observability · Model serving · Agent research |
| Zillow | Principal Applied Scientist, Agentic AI | Consumer | 9 | RL post-training · RLHF · Reward modeling · Fine-tuning · Guardrails · Agent orchestration · Evals · Vector DB |
| Machine Learning Engineer II, Computer Vision Applied Science | Consumer | 9 | Vision · Fine-tuning · RLHF · Model serving · Evals | |
| Roblox | Senior Machine Learning Engineering Manager | Consumer | 9 | Vision · LLM observability · Model serving · Fine-tuning |
| Roblox | Principal/Senior Machine Learning Scientist - Search and Discovery | Consumer | 9 | Agent orchestration · Recommender systems · Vision · Inference infra · Model serving |
| Uber | Senior Staff Machine Learning Engineer – Moonshot AI | Consumer | 9 | Vision · Audio & speech · LLM observability · Evals · Fine-tuning · RAG · Model serving · Recommender systems |
| Zillow | Principal Machine Learning Engineer, Agentic AI | Consumer | 9 | Agent orchestration · Agent research · Model serving · Inference infra · Audio & speech |
| Uber | Principal Machine Learning Engineer - AV Labs | Consumer | 9 | Model serving · Evals |
| Zillow | Distinguished Scientist | Consumer | 9 | Agent orchestration · Agent research · Multi-agent · Fine-tuning · RL post-training · Evals · LLM observability |
| Uber | Staff ML Engineer, Generative AI | Consumer | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Audio & speech |
| Roblox | [2026] Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career | Consumer | 9 | Fine-tuning · Model serving |
| Roblox | [2026] Senior Machine Learning Engineer, Multimodal AI, Computer Vision and Graphics - PhD Early Career | Consumer | 9 | Vision · Fine-tuning · Model serving |
| Whoop | Staff AI/ML Researcher (Foundation AI) | Consumer | 9 | Pretraining · Fine-tuning |
| Zillow | AI Applied Scientist - PhD Intern, Generative Computer Vision | Consumer | 9 | Vision · Fine-tuning · Evals |
| Zillow | AI Applied Scientist - PhD Intern, Foundational IQ | Consumer | 9 | Fine-tuning · Agent orchestration · Evals |
| Zillow | AI Applied Scientist - PhD Intern, Next-Gen Agentic and Multi-Modal Home Exploration Experience | Consumer | 9 | Agent orchestration · Agent research · Vision · LLM observability · Tool use · Fine-tuning |
| Zillow | AI Applied Scientist - PhD Intern, 3D Computer Vision | Consumer | 9 | Vision · Fine-tuning · Evals |