Reinforcement Learning from Human Feedback
85 points - today at 12:53 PM
SourceLast time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
You could say he's also learning from human feedback
iisweetheartii
today at 2:01 PM
[dead]