Content on this site is AI-generated and may contain errors. If you find issues, please report at
GitHub Issues
.
LLM Learning
Home
Resources
Ctrl K
δΈζ
/
EN
Esc
#instruct-gpt
1 articles
Advanced
RLHF: Learning from Human Feedback
#rlhf
#reward-model
#alignment
#instruct-gpt
#kl-divergence