Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data. Primary AI lifecycle stage: data.
234 active AI roles across 63 companies in our index reference Synthetic data as of today. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).
The companies with the most active Synthetic data listings are: xAI (35 roles), NVIDIA (22 roles), Handshake (15 roles), Amazon (14 roles), Google (10 roles).
Synthetic data primarily belongs to the data stage of the AI lifecycle. In current hiring, Synthetic data roles concentrate at: data (55%), agents (19%).
The sectors with the most active Synthetic data hiring are: AI Frontier, Big Tech, Enterprise.
Generating training data via LLMs or simulators to augment or replace human-labeled datasets, especially for rare cases, long-tail tasks, or alignment data.
Primary AI lifecycle stage: data.
As of today, 234 active AI roles across 63 companies in our index reference Synthetic data. Hiring concentrates at the data (55%) and agents (19%) stages. Most common sectors: AI Frontier, Big Tech, Enterprise. New postings fell 25% in the last 30 days versus the prior 30 (85 → 64).
104 AI roles tagged synthetic_data.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Research Scientist, Applied ML, Quantum Error Correction | Big Tech | 10 | Evals · Frontier research | |
| Meta | Research Scientist – World Models, Robotics & Embodied AI | Big Tech | 10 | Embodied AI · Agent research · Multimodal · Vision · RL robotics · Model serving |
| Meta | Research Scientist, AI, Formal and informal Reasoning | Big Tech | 10 | Frontier research · Agent research · RL post-training · Fine-tuning · Evals |
| Meta | AI Research Scientist, Media Data Research - MSL FAIR | Big Tech | 10 | Frontier research · Multimodal · Pretraining · Fine-tuning |
| Meta | AI Research Scientist, Text Data Research - MSL FAIR | Big Tech | 10 | Frontier research · Pretraining · RL post-training · Agent research |
| Research Scientist, Visual Data and Generative Research | Big Tech | 9 | Fine-tuning · Vision · Multimodal · Evals | |
| Research Scientist, Manipulation for Robotics, DeepMind | Big Tech | 9 | Embodied AI · Agent research · Multimodal · Model serving · Frontier research | |
| Meta | Research Scientist, Post-Training (Tech Leadership)- Meta Superintelligence Labs | Big Tech | 9 | RL post-training · Fine-tuning · Agent research · Agent orchestration · LLM observability |
| Senior Product Manager, Gemini Internationalization Modeling, DeepMind | Big Tech | 9 | Pretraining · RL post-training · Evals · Multimodal · Agent orchestration · LLM observability | |
| Research Scientist, Stitch | Big Tech | 9 | Agent orchestration · Tool use · Evals · Agent research · Fine-tuning · RL post-training | |
| Amazon | Applied Scientist II, Foundation Model, Robotics | Big Tech | 9 | Embodied AI · Agent orchestration · Fine-tuning · Multimodal · Vision · RL robotics |
| Amazon | Member of Technical Staff - Machine Learning, Frontier AI Robotics | Big Tech | 9 | Embodied AI · RL robotics · Multimodal · Model serving · Inference infra |
| Microsoft | Principal Researcher - Agentic AI - Microsoft Research AI Frontiers | Big Tech | 9 | Agent orchestration · Agent research · Multi-agent · Frontier research · Pretraining · RL post-training · Evals · Model serving |
| Microsoft | Senior Researcher - Agentic AI - Microsoft Research AI Frontiers | Big Tech | 9 | Agent orchestration · Agent research · Multi-agent · Evals · Frontier research · Model serving |
| Research Scientist, Visual Data and Generative Research | Big Tech | 9 | Vision · Fine-tuning · Multimodal · Evals | |
| Meta | Research Engineer (Technical Leadership), FAIR Data - Meta Superintelligence Labs | Big Tech | 9 | Agent research · Frontier research · Multimodal · Pretraining · RL post-training |
| Staff Software Engineer, Generative AI, Core ML | Big Tech | 9 | Agent orchestration · Tool use · Evals · Fine-tuning · RL post-training · Reward modeling · Agent research · Multimodal | |
| Amazon | Applied Scientist, Demand Forecasting | Big Tech | 9 | Frontier research · Pretraining · Inference infra · Model serving · Code gen |
| Microsoft | Member of Technical Staff -Member of Technical Staff - Pretraining Text Data | Big Tech | 9 | Frontier research |
| Amazon | Member of Technical Staff, AGI Autonomy | Big Tech | 9 | Agent orchestration · RL robotics · Embodied AI · LLM observability |
| Microsoft | Member of Technical Staff - Pretraining Text Data | Big Tech | 9 | Pretraining |
| Microsoft | Research Software Engineer - Multiple Levels- AI Frontiers | Big Tech | 9 | Agent orchestration · Agent research · Evals · Model serving · Inference infra |
| Microsoft | Research Intern - OneDrive and SharePoint (Summer 2026) | Big Tech | 9 | Multimodal · Agent orchestration · Evals · Fine-tuning · RAG · Vision · Code gen |
| Meta | AI Research Scientist, Media Data Research - MSL FAIR | Big Tech | 9 | Frontier research · Multimodal · Pretraining · RL post-training · Vision |
| Microsoft | Research Intern - Office of Applied Research | Big Tech | 9 | RL post-training · Fine-tuning · Agent research · Evals |
| Meta | Research Engineer, Media Data Research - MSL FAIR | Big Tech | 9 | Multimodal · Pretraining · Fine-tuning · Frontier research · Agent research · Vision |
| Meta | Research Engineer, Text Data Research - MSL FAIR | Big Tech | 9 | Frontier research · Pretraining · RL post-training |
| Software Engineer III, GenAI Data Operations Research, XR | Big Tech | 8 | Inference infra · Model serving · Fine-tuning · Audio & speech | |
| Software Engineering Manager II, AI Productivity | Big Tech | 8 | Agent orchestration · Evals · Guardrails · LLM observability · Multimodal · Model serving | |
| Meta | Visiting Researcher, FAIR (University Grad) | Big Tech | 8 | LLM observability · Evals |