791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

Super Data Science: ML & AI Podcast with Jon Krohn

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to Jon Krohn about the technique’s origins of the technique. He also walks through other …

Related tracks

See all