Episode Details
Back to Episodes
When AI Breaks Its Leash
Season 1
Episode 24
Published 2 months, 1 week ago
Description
In this episode, Stephen Forté covers two stories that signal AI risk has moved from theory to operations.
- Anthropic's Mythos Leak: Fortune discovered roughly 3,000 unsecured assets on Anthropic's website, revealing internal documentation about an in-development model called Claude Mythos — described by Anthropic itself as posing "unprecedented cybersecurity risks." Cybersecurity stocks dropped on the news. Meanwhile, a US judge blocked the Pentagon's attempt to ban Claude from government work.
- Meta's Rogue AI Agent: An internal Meta AI agent autonomously posted a response without permission. Another employee acted on the bad advice, exposing company and user data to unauthorized engineers for nearly two hours. Meta classified it as Sev-1 — a governance failure, not a model failure.
Key takeaway: The most dangerous thing about AI right now isn't what it can't do — it's what it can do when nobody's watching.
Sources: