Episode Details
Back to EpisodesAI Cheats and Causes Problems
Episode 387
Published 3 months, 3 weeks ago
Description
This podcast was created entirely by AI and is based on the following research paper:
- Title: Natural Emergent Misalignment From Reward Hacking in Production RL
- Source: Anthropic
- Authors: Monte MacDiarmid et al.
- Published Date: 2025-11-23
Visit www.paper2podcast.com to download the full paper and learn more. Thanks for listening!