73 active AI roles across 29 companies mention Direct Preference Optimization (DPO). Category: ML Techniques.
New AI postings mentioning Direct Preference Optimization (DPO) per week — 53 total over 11 weeks.
Direct Preference Optimization (DPO) is a skill in the "ML Techniques" category. It currently appears in 73 active AI roles across 29 companies in our index.
The top employers with active AI roles mentioning Direct Preference Optimization (DPO) are: Amazon (25), Adobe (5), ByteDance (4), Autodesk (4), Unity (3).
Over the last 11 weeks, 53 new AI postings mentioned Direct Preference Optimization (DPO). Demand is rising — up 69% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring Direct Preference Optimization (DPO) are concentrated in: post-training (53%), agents (25%), pre-training (7%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.
Job postings that mention Direct Preference Optimization (DPO) most often also require: RLHF & Alignment Techniques, Large Language Models (LLMs), Machine Learning, Reinforcement Learning (RL), Generative AI.
1 AI role requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| Deloitte | Research Engineer — Post-Training & Small Language Models (SLMs), Healthcare AI | Consulting | 9 | L2 |