New AI postings mentioning Direct Preference Optimization (DPO) per week — 53 total over 11 weeks.
73 active AI roles across 29 companies mention Direct Preference Optimization (DPO). Category: ML Techniques.
Direct Preference Optimization (DPO) is a skill in the "ML Techniques" category. It currently appears in 73 active AI roles across 29 companies in our index.
The top employers with active AI roles mentioning Direct Preference Optimization (DPO) are: Amazon (25), Adobe (5), ByteDance (4), Autodesk (4), Unity (3).
Over the last 11 weeks, 53 new AI postings mentioned Direct Preference Optimization (DPO). Demand is rising — up 69% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring Direct Preference Optimization (DPO) are concentrated in: post-training (53%), agents (25%), pre-training (7%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.
Job postings that mention Direct Preference Optimization (DPO) most often also require: RLHF & Alignment Techniques, Large Language Models (LLMs), Machine Learning, Reinforcement Learning (RL), Generative AI.
3 AI roles requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| NVIDIA | Agent RL Infra Engineer | Semiconductors | 9 | L2 |
| Cerebras | Applied AI/ML Scientist | Semiconductors | 9 | L2 |
| NVIDIA | Senior Math Libraries Engineer - Sparsity in AI | Semiconductors | 7 | L3 |