HN2new | past | comments | ask | show | jobs | submitlogin
Reinforcement Learning from Human Feedback (rlhfbook.com)
133 points by onurkanbkrc 28 days ago | hide | past | favorite | 5 comments


Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials


You could say he's also learning from human feedback


Related. Others?

RLHF Book - https://hackertimes.com/item?id=42902936 - Feb 2025 (37 comments)


Web version with links, etc:

https://rlhfbook.com/


Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: