Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone.
Primary AI lifecycle stage: application and pre-training.
As of today, 525 active AI roles across 89 companies in our index reference Vision. Hiring concentrates at the agents (25%) and application (19%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.
Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone. Primary AI lifecycle stage: application and pre-training.
525 active AI roles across 89 companies in our index reference Vision as of today.
The companies with the most active Vision listings are: Amazon (117 roles), Google (61 roles), NVIDIA (51 roles), Adobe (25 roles), ByteDance (21 roles).
Vision primarily belongs to the application and pre-training stages of the AI lifecycle. In current hiring, Vision roles concentrate at: agents (25%), application (19%).
The sectors with the most active Vision hiring are: Big Tech, Semiconductors, Enterprise.
27 AI roles tagged vision.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Luma AI | Research Scientist - World Model | AI Frontier | 10 | Frontier research · Embodied AI · Multimodal · Model serving · Agent research · RL robotics |
| Mistral AI | Applied Scientist / Research Engineer - EMEA | AI Frontier | 10 | Pretraining · Fine-tuning · Model serving · Inference infra · Agent orchestration · RAG · Multimodal · Audio & speech |
| OpenAI | RE/RS, Data Understanding (MM) | AI Frontier | 9 | Synthetic data · Multimodal · Audio & speech |
| Mistral AI | Applied Scientist / Research Engineer - Singapore | AI Frontier | 9 | Pretraining · Fine-tuning · Model serving · Agent orchestration · RAG · Multimodal · Audio & speech |
| xAI | Member of Technical Staff - Multimodal Understanding | AI Frontier | 9 | Multimodal · Pretraining · Fine-tuning · Model serving · Inference infra · Evals · Audio & speech · Agent orchestration · Tool use · Frontier research |
| Physical Intelligence | Robotics Research Engineer | AI Frontier | 9 | Embodied AI · Multimodal · Synthetic data · RL robotics · Model serving |
| Mistral AI | AI Scientist - Robotics | AI Frontier | 9 | Embodied AI · Multimodal · Fine-tuning · Evals · Model serving |
| Character AI | Research Engineer, Multimodal | AI Frontier | 9 | Fine-tuning · RLHF · Multimodal · Audio & speech · Model serving · Inference infra · Synthetic data |
| xAI | Member of Technical Staff - Imagine Model | AI Frontier | 9 | Multimodal · Audio & speech · Fine-tuning · RL post-training · Agent orchestration · Model serving · Inference infra · Evals · Synthetic data |
| World Labs | Research Engineer (Scaling Multimodal Data) | AI Frontier | 9 | Synthetic data · Multimodal · Model serving · Evals |
| Lila Sciences | Machine Learning Scientist I/II, Multi-Modal Scientific Reasonings | AI Frontier | 9 | Multimodal · Fine-tuning · RAG · Evals |
| Stability AI | Multimodal Generative AI Researcher | AI Frontier | 9 | Fine-tuning · Multimodal · LLM observability · RAG · Agent research · Frontier research · Interpretability · Synthetic data · Model serving · Inference infra |
| Anthropic | Research Engineer / Research Scientist, Vision | AI Frontier | 9 | Multimodal · Fine-tuning · RL post-training · Agent orchestration · Evals · LLM observability |
| Stability AI | Generative AI Inference Engineer | AI Frontier | 9 | Model serving · Inference infra · Multimodal |
| Lila Sciences | ML Research Scientist I/II, Multimodal Data Extraction | AI Frontier | 9 | Multimodal · Fine-tuning · LLM observability |
| Cohere | Senior Member of Technical Staff, Multimodal AI | AI Frontier | 9 | Multimodal · Audio & speech · Fine-tuning · Model serving · Evals |
| OpenAI | Full Stack Software Engineer, ChatGPT ImageGen | AI Frontier | 8 | Multimodal · Model serving · Inference infra |
| Perplexity | Member of Technical Staff (AI Software Engineer, Multimodal) | AI Frontier | 8 | Multimodal · Agent orchestration · Model serving · Inference infra · Audio & speech · Evals |
| Anthropic | Software Engineer, Claude Design | AI Frontier | 8 | Multimodal · Model serving · Inference infra |
| Lila Sciences | Principal / Sr. Principal BioML Scientist | AI Frontier | 8 | Agent orchestration · Embodied AI · Evals · Multimodal |
| xAI | Member of Technical Staff - Imagine Product | AI Frontier | 8 | Multimodal · Model serving · Inference infra · Audio & speech |
| Perplexity | Member of Technical Staff (Data Scientist, Evals) | AI Frontier | 8 | Evals · LLM observability · RAG · Tool use |
| Lila Sciences | Co-Op, Data Extraction | AI Frontier | 7 | Fine-tuning · Evals · Multimodal |
| Anthropic | Staff+ Software Engineer, Backend | AI Frontier | 7 | Model serving · Inference infra · Agent orchestration · Tool use |
| OpenAI | Research Engineer, 3D & Multi-View Geometry | AI Frontier | 7 | Multimodal · Inference infra · Model serving |
| xAI | Image Tutor | AI Frontier | 7 | Synthetic data · Fine-tuning |
| Lila Sciences | Senior Automated Systems Engineer | AI Frontier | 5 | Agent orchestration |