Content on this site is AI-generated and may contain errors. If you find issues, please report at
GitHub Issues
.
LLM Learning
Home
Resources
Ctrl K
δΈζ
/
EN
Esc
#preference-optimization
1 articles
Advanced
From DPO to GRPO: Direct Preference Optimization
#dpo
#grpo
#ipo
#preference-optimization
#offline-rl