Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data.
Primary AI lifecycle stage: data.
As of today, 234 active AI roles across 63 companies in our index reference Synthetic data. Hiring concentrates at the data (55%) and agents (19%) stages. Most common sectors: AI Frontier, Big Tech, Enterprise. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).
Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data. Primary AI lifecycle stage: data.
234 active AI roles across 63 companies in our index reference Synthetic data as of today. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).
The companies with the most active Synthetic data listings are: xAI (35 roles), NVIDIA (22 roles), Handshake (15 roles), Amazon (14 roles), Google (10 roles).
Synthetic data primarily belongs to the data stage of the AI lifecycle. In current hiring, Synthetic data roles concentrate at: data (55%), agents (19%).
The sectors with the most active Synthetic data hiring are: AI Frontier, Big Tech, Enterprise.
9 AI roles tagged synthetic_data.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| DoorDash | AI Research Fellowship, (Summer and Fall 2026) | Consumer | 9 | Agent orchestration · Tool use · Evals · Forecasting · Multimodal · Vision · Audio & speech · Frontier research |
| Roblox | Principal Machine Learning Engineer, Engineering Acceleration | Consumer | 9 | Agent orchestration · Agent research · Evals · Fine-tuning · Model serving · Code gen |
| Airbnb | Senior Staff Machine Learning Engineer, Data & Eval | Consumer | 9 | Evals · LLM observability · Guardrails · RAG · Agent orchestration · Tool use · Fine-tuning |
| Snap | Computer Vision Engineer | Consumer | 8 | Vision · Multimodal · Fine-tuning |
| Roblox | Senior Machine Learning Engineer, GenAI Data | Consumer | 8 | Multimodal · Evals · Model serving · Inference infra |
| Spotify | Machine Learning Engineering Manager - Personalization | Consumer | 8 | Recommender systems · Model serving |
| Discord | Manager, Scaled Abuse Countermeasures and Research | Consumer | 7 | Agent orchestration · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Recommender systems · Search & ranking · Vision · Audio & speech · Frontier research · Interpretability · Agent research · RL post-training · RLHF · Reward modeling · RL robotics · Embodied AI |
| Whatnot | Software Engineer, Trust & Risk | Consumer | 7 | Agent orchestration · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Vision · Audio & speech · Frontier research · Interpretability · Agent research · RL post-training · RLHF · Reward modeling · RL robotics · Embodied AI |
| Coursera | Learning Designer - Technical/AI | Consumer | 5 | Agent orchestration · Evals |