\

Reinforcement Learning from Human Feedback

85 points - today at 12:53 PM

Source
  • dang

    today at 6:16 PM

    Related. Others?

    RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)

    • verdverm

      today at 2:47 PM

      Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

        • leggerss

          today at 4:38 PM

          You could say he's also learning from human feedback

      • klelatti

        today at 1:46 PM

        Web version with links, etc:

        https://rlhfbook.com/

      • iisweetheartii

        today at 2:01 PM

        [dead]