Which companies are hiring for Vision roles?

The companies with the most active Vision listings are: Amazon (116 roles), Google (60 roles), NVIDIA (51 roles), Adobe (25 roles), ByteDance (20 roles).

What AI lifecycle stage does Vision belong to?

Vision primarily belongs to the application and pre-training stages of the AI lifecycle. In current hiring, Vision roles concentrate at: agents (26%), post-training (20%).

What sectors invest most in Vision?

The sectors with the most active Vision hiring are: Big Tech, Semiconductors, Enterprise.

← Tag co-occurrence network

Vision

Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone.

Primary AI lifecycle stage: application and pre-training.

As of today, 524 active AI roles across 86 companies in our index reference Vision. Hiring concentrates at the agents (26%) and post-training (20%) stages. Most common sectors: Big Tech, Semiconductors, Enterprise.

Top hiring:

Function

All Engineering · 526 Research · 212 Product · 28

Status

All Active only

Sort

AI score Recently posted Company A–Z

766 AI roles tagged vision.

Company	Title	Sector	AI score	Other tags
Google	Staff Software Engineer, Generative AI	Big Tech	9	Agent orchestration · Inference infra · Model serving · Evals · Fine-tuning · Multimodal
NVIDIA	AI Inference Performance Engineer - New College Grad 2026	Semiconductors	9	Inference infra · Model serving · Quantization · Audio & speech
Oracle	Snr Director, Applied Science	Enterprise	9	Multimodal · Agent orchestration · Model serving · Inference infra · RAG · Evals · Guardrails · LLM observability · Audio & speech
Microsoft	Member of Technical Staff, Microsoft Robotics (Spatial AI)	Big Tech	9	Embodied AI · Multimodal · Agent orchestration · Model serving · Inference infra
Microsoft	Member of Technical Staff, Microsoft Robotics (Robot Learning)	Big Tech	9	Embodied AI · RL robotics · Multimodal · Fine-tuning · Model serving · Evals
OpenAI	RE/RS, Data Understanding (MM)	AI Frontier	9	Synthetic data · Multimodal · Audio & speech
Google	Research Scientist, Frontier Health, DeepMind	Big Tech	9	Agent orchestration · Multimodal · RL post-training · Reward modeling · Evals · Tool use · Audio & speech
Aurora Innovation	Applied Researcher	Robotics	9	Multimodal
Aurora Innovation	Applied Researcher	Robotics	9	Multimodal
Aurora Innovation	Applied Researcher	Robotics	9	Multimodal
Amazon	Applied Scientist II, GenAI Evaluation Media (GEM)	Big Tech	9	Agent orchestration · Multimodal · Model serving · Evals
Amazon	Member of Technical Staff - Science, Frontier AI & Robotics (FAR)	Big Tech	9	Multimodal · Frontier research · Fine-tuning · Model serving · Embodied AI
GE Healthcare	AI Research Intern	Healthcare	9	Frontier research · Multimodal · Fine-tuning
Skydio	Autonomy Engineer - Deep Learning	Defense	9	Multimodal · Fine-tuning · Model serving · Inference infra · Synthetic data · Embodied AI
Aurora Innovation	Applied Researcher	Robotics	9	Multimodal
Amazon	Applied Scientist II, Amazon AWS Agentic AI, AWS AI Fundamental Research	Big Tech	9	Agent research · Frontier research · Multimodal · Audio & speech · Agent orchestration · Fine-tuning
Amazon	Applied Scientist, Prime Video - Generative AI	Big Tech	9	Multimodal · Fine-tuning · Agent orchestration · Model serving
Amazon	Applied Scientist, Alexa Edge AI	Big Tech	9	Audio & speech · Multimodal · Fine-tuning · Frontier research · Model serving · Inference infra
Amazon	Applied Scientist, Alexa Edge AI	Big Tech	9	Multimodal · Audio & speech · Fine-tuning · Model serving · Inference infra
Amazon	Applied Scientist, Alexa Edge AI	Big Tech	9	Audio & speech · Multimodal · Fine-tuning · Model serving · Inference infra
Pinterest	Director, Machine Learning Engineering – Content & User Understanding	Consumer	9	Multimodal · Model serving
Amazon	Applied scientist, Agentic AI, AWS Agentic AI	Big Tech	9	Agent orchestration · Agent research · Multimodal · RL robotics
NVIDIA	Senior Systems Software Engineer, Machine Learning	Semiconductors	9	Agent orchestration · LLM observability · Multimodal · Model serving
Amazon	Applied Scientist, Amazon Robotics	Big Tech	9	Multimodal · RL robotics · Embodied AI · Fine-tuning
Wayve	Staff Machine Learning Engineer, AV Core	Robotics	9	Embodied AI · Multimodal · Fine-tuning · Model serving · Evals · Interpretability · Pretraining · Agent orchestration
Amazon	Sr. Applied Scientist, AWS Just-Walk-Out Science Team	Big Tech	9	Agent orchestration · Agent research · Multimodal
Axon	AI Scientist I	Enterprise	9	Multimodal · Fine-tuning · Inference infra · Model serving
AMD	Principal AI Performance Modeling Architect	Semiconductors	9	Inference infra · Model serving · Fine-tuning · Multimodal · Audio & speech
ABBYY	Principal Machine Learning Engineer - Model Efficiency Optimization	Enterprise	9	Model serving · Inference infra · Fine-tuning · Quantization · Multimodal
Google	Research Scientist, Gemini Vision, DeepMind	Big Tech	9	Multimodal · Frontier research · Pretraining · Fine-tuning · RL post-training · Agent research · Agent orchestration · Model serving · Inference infra

Frequently asked questions

What is Vision in AI?
Computer vision tasks — classification, detection, segmentation, OCR, visual reasoning — now mostly delivered through vision-language models that share an LLM backbone. Primary AI lifecycle stage: application and pre-training.
How many AI roles reference Vision right now?
524 active AI roles across 86 companies in our index reference Vision as of today.
Which companies are hiring for Vision roles?
The companies with the most active Vision listings are: Amazon (116 roles), Google (60 roles), NVIDIA (51 roles), Adobe (25 roles), ByteDance (20 roles).
What AI lifecycle stage does Vision belong to?
Vision primarily belongs to the application and pre-training stages of the AI lifecycle. In current hiring, Vision roles concentrate at: agents (26%), post-training (20%).
What sectors invest most in Vision?
The sectors with the most active Vision hiring are: Big Tech, Semiconductors, Enterprise.