How many AI roles reference Synthetic data right now?

234 active AI roles across 63 companies in our index reference Synthetic data as of today. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).

Which companies are hiring for Synthetic data roles?

The companies with the most active Synthetic data listings are: xAI (35 roles), NVIDIA (22 roles), Handshake (15 roles), Amazon (14 roles), Google (10 roles).

What AI lifecycle stage does Synthetic data belong to?

Synthetic data primarily belongs to the data stage of the AI lifecycle. In current hiring, Synthetic data roles concentrate at: data (55%), agents (19%).

What sectors invest most in Synthetic data?

The sectors with the most active Synthetic data hiring are: AI Frontier, Big Tech, Enterprise.

← Tag co-occurrence network

Synthetic data

Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data.

Primary AI lifecycle stage: data.

As of today, 234 active AI roles across 63 companies in our index reference Synthetic data. Hiring concentrates at the data (55%) and agents (19%) stages. Most common sectors: AI Frontier, Big Tech, Enterprise. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).

Top hiring:

Function

All Engineering · 252 Research · 116 Product · 22

Status

All Active only

Sort

AI score Recently posted Company A–Z

FilteredsectorBig Tech×

104 AI roles tagged synthetic_data.

Company	Title	Sector	AI score	Other tags
Google	Research Scientist, Applied ML, Quantum Error Correction	Big Tech	10	Evals · Frontier research
Meta	Research Scientist – World Models, Robotics & Embodied AI	Big Tech	10	Embodied AI · Agent research · Multimodal · Vision · RL robotics · Model serving
Meta	Research Scientist, AI, Formal and informal Reasoning	Big Tech	10	Frontier research · Agent research · RL post-training · Fine-tuning · Evals
Meta	AI Research Scientist, Media Data Research - MSL FAIR	Big Tech	10	Frontier research · Multimodal · Pretraining · Fine-tuning
Meta	AI Research Scientist, Text Data Research - MSL FAIR	Big Tech	10	Frontier research · Pretraining · RL post-training · Agent research
Google	Research Scientist, Visual Data and Generative Research	Big Tech	9	Fine-tuning · Vision · Multimodal · Evals
Google	Research Scientist, Manipulation for Robotics, DeepMind	Big Tech	9	Embodied AI · Agent research · Multimodal · Model serving · Frontier research
Meta	Research Scientist, Post-Training (Tech Leadership)- Meta Superintelligence Labs	Big Tech	9	RL post-training · Fine-tuning · Agent research · Agent orchestration · LLM observability
Google	Senior Product Manager, Gemini Internationalization Modeling, DeepMind	Big Tech	9	Pretraining · RL post-training · Evals · Multimodal · Agent orchestration · LLM observability
Google	Research Scientist, Stitch	Big Tech	9	Agent orchestration · Tool use · Evals · Agent research · Fine-tuning · RL post-training
Amazon	Applied Scientist II, Foundation Model, Robotics	Big Tech	9	Embodied AI · Agent orchestration · Fine-tuning · Multimodal · Vision · RL robotics
Amazon	Member of Technical Staff - Machine Learning, Frontier AI Robotics	Big Tech	9	Embodied AI · RL robotics · Multimodal · Model serving · Inference infra
Microsoft	Principal Researcher - Agentic AI - Microsoft Research AI Frontiers	Big Tech	9	Agent orchestration · Agent research · Multi-agent · Frontier research · Pretraining · RL post-training · Evals · Model serving
Microsoft	Senior Researcher - Agentic AI - Microsoft Research AI Frontiers	Big Tech	9	Agent orchestration · Agent research · Multi-agent · Evals · Frontier research · Model serving
Google	Research Scientist, Visual Data and Generative Research	Big Tech	9	Vision · Fine-tuning · Multimodal · Evals
Meta	Research Engineer (Technical Leadership), FAIR Data - Meta Superintelligence Labs	Big Tech	9	Agent research · Frontier research · Multimodal · Pretraining · RL post-training
Google	Staff Software Engineer, Generative AI, Core ML	Big Tech	9	Agent orchestration · Tool use · Evals · Fine-tuning · RL post-training · Reward modeling · Agent research · Multimodal
Amazon	Applied Scientist, Demand Forecasting	Big Tech	9	Frontier research · Pretraining · Inference infra · Model serving · Code gen
Microsoft	Member of Technical Staff -Member of Technical Staff - Pretraining Text Data	Big Tech	9	Frontier research
Amazon	Member of Technical Staff, AGI Autonomy	Big Tech	9	Agent orchestration · RL robotics · Embodied AI · LLM observability
Microsoft	Member of Technical Staff - Pretraining Text Data	Big Tech	9	Pretraining
Microsoft	Research Software Engineer - Multiple Levels- AI Frontiers	Big Tech	9	Agent orchestration · Agent research · Evals · Model serving · Inference infra
Microsoft	Research Intern - OneDrive and SharePoint (Summer 2026)	Big Tech	9	Multimodal · Agent orchestration · Evals · Fine-tuning · RAG · Vision · Code gen
Meta	AI Research Scientist, Media Data Research - MSL FAIR	Big Tech	9	Frontier research · Multimodal · Pretraining · RL post-training · Vision
Microsoft	Research Intern - Office of Applied Research	Big Tech	9	RL post-training · Fine-tuning · Agent research · Evals
Meta	Research Engineer, Media Data Research - MSL FAIR	Big Tech	9	Multimodal · Pretraining · Fine-tuning · Frontier research · Agent research · Vision
Meta	Research Engineer, Text Data Research - MSL FAIR	Big Tech	9	Frontier research · Pretraining · RL post-training
Google	Software Engineer III, GenAI Data Operations Research, XR	Big Tech	8	Inference infra · Model serving · Fine-tuning · Audio & speech
Google	Software Engineering Manager II, AI Productivity	Big Tech	8	Agent orchestration · Evals · Guardrails · LLM observability · Multimodal · Model serving
Meta	Visiting Researcher, FAIR (University Grad)	Big Tech	8	LLM observability · Evals

Frequently asked questions

What is Synthetic data in AI?
Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data. Primary AI lifecycle stage: data.
How many AI roles reference Synthetic data right now?
234 active AI roles across 63 companies in our index reference Synthetic data as of today. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).
Which companies are hiring for Synthetic data roles?
The companies with the most active Synthetic data listings are: xAI (35 roles), NVIDIA (22 roles), Handshake (15 roles), Amazon (14 roles), Google (10 roles).
What AI lifecycle stage does Synthetic data belong to?
Synthetic data primarily belongs to the data stage of the AI lifecycle. In current hiring, Synthetic data roles concentrate at: data (55%), agents (19%).
What sectors invest most in Synthetic data?
The sectors with the most active Synthetic data hiring are: AI Frontier, Big Tech, Enterprise.