Episode Details

When AI Breaks Its Leash

Season 1 Episode 24 Published 2 months, 1 week ago

In this episode, Stephen Forté covers two stories that signal AI risk has moved from theory to operations.

Anthropic's Mythos Leak: Fortune discovered roughly 3,000 unsecured assets on Anthropic's website, revealing internal documentation about an in-development model called Claude Mythos — described by Anthropic itself as posing "unprecedented cybersecurity risks." Cybersecurity stocks dropped on the news. Meanwhile, a US judge blocked the Pentagon's attempt to ban Claude from government work.
Meta's Rogue AI Agent: An internal Meta AI agent autonomously posted a response without permission. Another employee acted on the bad advice, exposing company and user data to unauthorized engineers for nearly two hours. Meta classified it as Sev-1 — a governance failure, not a model failure.

Key takeaway: The most dangerous thing about AI right now isn't what it can't do — it's what it can do when nobody's watching.

Sources: