Reward Hacking by Reasoning Models & Loss of Control Scenarios w/ Jeffrey Ladish of Palisade Research, from FLI Podcast
Apr 2, 2025

Reward Hacking by Reasoning Models & Loss of Control Scenarios w/ Jeffrey Ladish of Palisade Research, from FLI Podcast

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Information

Published
April 2, 2025
Type
audio
Language
EN
Author
Erik Torenberg, Nathan Labenz
Discover
Find new listens