Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone. Primary AI lifecycle stage: application and pre-training.
524 active AI roles across 86 companies in our index reference Vision as of today.
The companies with the most active Vision listings are: Amazon (116 roles), Google (60 roles), NVIDIA (51 roles), Adobe (25 roles), ByteDance (20 roles).
Vision primarily belongs to the application and pre-training stages of the AI lifecycle. In current hiring, Vision roles concentrate at: agents (26%), post-training (20%).
The sectors with the most active Vision hiring are: Big Tech, Semiconductors, Enterprise.
766 AI roles tagged vision.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Staff Software Engineer, Generative AI | Big Tech | 9 | Agent orchestration · Inference infra · Model serving · Evals · Fine-tuning · Multimodal | |
| NVIDIA | AI Inference Performance Engineer - New College Grad 2026 | Semiconductors | 9 | Inference infra · Model serving · Quantization · Audio & speech |
| Oracle | Snr Director, Applied Science | Enterprise | 9 | Multimodal · Agent orchestration · Model serving · Inference infra · RAG · Evals · Guardrails · LLM observability · Audio & speech |
| Microsoft | Member of Technical Staff, Microsoft Robotics (Spatial AI) | Big Tech | 9 | Embodied AI · Multimodal · Agent orchestration · Model serving · Inference infra |
| Microsoft | Member of Technical Staff, Microsoft Robotics (Robot Learning) | Big Tech | 9 | Embodied AI · RL robotics · Multimodal · Fine-tuning · Model serving · Evals |
| OpenAI | RE/RS, Data Understanding (MM) | AI Frontier | 9 | Synthetic data · Multimodal · Audio & speech |
| Research Scientist, Frontier Health, DeepMind | Big Tech | 9 | Agent orchestration · Multimodal · RL post-training · Reward modeling · Evals · Tool use · Audio & speech | |
| Aurora Innovation | Applied Researcher | Robotics | 9 | Multimodal |
| Aurora Innovation | Applied Researcher | Robotics | 9 | Multimodal |
| Aurora Innovation | Applied Researcher | Robotics | 9 | Multimodal |
| Amazon | Applied Scientist II, GenAI Evaluation Media (GEM) | Big Tech | 9 | Agent orchestration · Multimodal · Model serving · Evals |
| Amazon | Member of Technical Staff - Science, Frontier AI & Robotics (FAR) | Big Tech | 9 | Multimodal · Frontier research · Fine-tuning · Model serving · Embodied AI |
| GE Healthcare | AI Research Intern | Healthcare | 9 | Frontier research · Multimodal · Fine-tuning |
| Skydio | Autonomy Engineer - Deep Learning | Defense | 9 | Multimodal · Fine-tuning · Model serving · Inference infra · Synthetic data · Embodied AI |
| Aurora Innovation | Applied Researcher | Robotics | 9 | Multimodal |
| Amazon | Applied Scientist II, Amazon AWS Agentic AI, AWS AI Fundamental Research | Big Tech | 9 | Agent research · Frontier research · Multimodal · Audio & speech · Agent orchestration · Fine-tuning |
| Amazon | Applied Scientist, Prime Video - Generative AI | Big Tech | 9 | Multimodal · Fine-tuning · Agent orchestration · Model serving |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Audio & speech · Multimodal · Fine-tuning · Frontier research · Model serving · Inference infra |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Multimodal · Audio & speech · Fine-tuning · Model serving · Inference infra |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Audio & speech · Multimodal · Fine-tuning · Model serving · Inference infra |
| Director, Machine Learning Engineering – Content & User Understanding | Consumer | 9 | Multimodal · Model serving | |
| Amazon | Applied scientist, Agentic AI, AWS Agentic AI | Big Tech | 9 | Agent orchestration · Agent research · Multimodal · RL robotics |
| NVIDIA | Senior Systems Software Engineer, Machine Learning | Semiconductors | 9 | Agent orchestration · LLM observability · Multimodal · Model serving |
| Amazon | Applied Scientist, Amazon Robotics | Big Tech | 9 | Multimodal · RL robotics · Embodied AI · Fine-tuning |
| Wayve | Staff Machine Learning Engineer, AV Core | Robotics | 9 | Embodied AI · Multimodal · Fine-tuning · Model serving · Evals · Interpretability · Pretraining · Agent orchestration |
| Amazon | Sr. Applied Scientist, AWS Just-Walk-Out Science Team | Big Tech | 9 | Agent orchestration · Agent research · Multimodal |
| Axon | AI Scientist I | Enterprise | 9 | Multimodal · Fine-tuning · Inference infra · Model serving |
| AMD | Principal AI Performance Modeling Architect | Semiconductors | 9 | Inference infra · Model serving · Fine-tuning · Multimodal · Audio & speech |
| ABBYY | Principal Machine Learning Engineer - Model Efficiency Optimization | Enterprise | 9 | Model serving · Inference infra · Fine-tuning · Quantization · Multimodal |
| Research Scientist, Gemini Vision, DeepMind | Big Tech | 9 | Multimodal · Frontier research · Pretraining · Fine-tuning · RL post-training · Agent research · Agent orchestration · Model serving · Inference infra |
Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone.
Primary AI lifecycle stage: application and pre-training.
As of today, 524 active AI roles across 86 companies in our index reference Vision. Hiring concentrates at the agents (26%) and post-training (20%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.