Episode Details

Back to Episodes

AI Cheats and Causes Problems

Episode 387 Published 5 months, 1 week ago

Description

This podcast was created entirely by AI and is based on the following research paper:

Title: Natural Emergent Misalignment From Reward Hacking in Production RL
Source: Anthropic
Authors: Monte MacDiarmid et al.
Published Date: 2025-11-23

Visit www.paper2podcast.com to download the full paper and learn more. Thanks for listening!

Listen Now

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.