Podcast Episodes
Back to Search“Claude Mythos #3: Capabilities and Additions” by Zvi
To round out coverage of Mythos, today covers capabilities other than cyber, and anything else additional not covered by the first two posts, includ…
1 week ago
“Diary of a “Doomer”: 12+ years arguing about AI risk (part 1)” by David Scott Krueger (formerly: capybaralet)
How I learned about Deep Learning.
As far as I know, I’m the second person ever to get into the field of AI largely because I was worried about the …
1 week ago
“A Retrospective of Richard Ngo’s 2022 List of Conceptual Alignment Projects” by LawrenceC
Written very quickly for the InkHaven Residency.
In 2022, Richard Ngo wrote a list of 26 Conceptual Alignment Research Projects. Now that it's 2026,…
1 week ago
“From personas to intentions: towards a science of motivations for AI models” by David Africa, Jacob Pfau
TLDR:
Behavior-only descriptions are useful, but insufficient for aligning advanced models with high assurance.Two models can look equally aligned o…1 week ago
“The Shapley Share of Responsibility?” by Raemon
Deepfates on twitter wrote:
If you're in a theater and you shout "Fire!", and the audience reacts predictably and in the process trample someone to …
1 week ago
“Who Killed Common Law?” by Benquo
The classical undergraduate humanities curriculum in America was destroyed and replaced over the course of the twentieth century. The destruction is…
1 week, 1 day ago
“Anthropic repeatedly accidentally trained against the CoT, demonstrating inadequate processes” by Alex Mallen, ryan_greenblatt
It turns out that Anthropic accidentally trained against the chain of thought of Claude Mythos Preview in around 8% of training episodes. This is at…
1 week, 1 day ago
“Meaningful Questions Have Return Types” by Drake Morrison
One way intellectual progress stalls is when you are asking the Wrong Questions. Your question is nonsensical, or cuts against the way reality works…
1 week, 1 day ago
“Political Violence Is Never Acceptable” by Zvi
Nor is the threat or implication of violence. Period. Ever. No exceptions.
It is completely unacceptable. I condemn it in the strongest possible te…
1 week, 1 day ago
“Only Law Can Prevent Extinction” by Eliezer Yudkowsky
There's a quote I read as a kid that stuck with me my whole life:
"Remember that all tax revenue is the result of holding a gun to somebody's head. …
1 week, 1 day ago