Podcast Episodes

Back to Search
“Claude Mythos #3: Capabilities and Additions” by Zvi

To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, includ…

1 week ago

Short Long
View Episode
“Diary of a “Doomer”: 12+ years arguing about AI risk (part 1)” by David Scott Krueger (formerly: capybaralet)

How I learned about Deep Learning.

As far as I know, I’m the second person ever to get into the field of AI largely because I was worried about the …

1 week ago

Short Long
View Episode
“A Retrospective of Richard Ngo’s 2022 List of Conceptual Alignment Projects” by LawrenceC

Written very quickly for the InkHaven Residency.

In 2022, Richard Ngo wrote a list of 26 Conceptual Alignment Research Projects. Now that it's 2026,…

1 week ago

Short Long
View Episode
“From personas to intentions: towards a science of motivations for AI models” by David Africa, Jacob Pfau

TLDR:

Behavior-only descriptions are useful, but insufficient for aligning advanced models with high assurance.Two models can look equally aligned o…

1 week ago

Short Long
View Episode
“The Shapley Share of Responsibility?” by Raemon

Deepfates on twitter wrote:

If you're in a theater and you shout "Fire!", and the audience reacts predictably and in the process trample someone to …

1 week ago

Short Long
View Episode
“Who Killed Common Law?” by Benquo

The classical undergraduate humanities curriculum in America was destroyed and replaced over the course of the twentieth century. The destruction is…

1 week, 1 day ago

Short Long
View Episode
“Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes” by Alex Mallen, ryan_greenblatt

It turns out that Anthropic accidentally trained against the chain of thought of Claude Mythos Preview in around 8% of training episodes. This is at…

1 week, 1 day ago

Short Long
View Episode
“Meaningful Questions Have Return Types” by Drake Morrison

One way intellectual progress stalls is when you are asking the Wrong Questions. Your question is nonsensical, or cuts against the way reality works…

1 week, 1 day ago

Short Long
View Episode
“Political Violence Is Never Acceptable” by Zvi

Nor is the threat or implication of violence. Period. Ever. No exceptions.

It is completely unacceptable. I condemn it in the strongest possible te…

1 week, 1 day ago

Short Long
View Episode
“Only Law Can Prevent Extinction” by Eliezer Yudkowsky

There's a quote I read as a kid that stuck with me my whole life:

"Remember that all tax revenue is the result of holding a gun to somebody's head. …

1 week, 1 day ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us