Podcast Episodes
Back to Search“A deep critique of AI 2027’s bad timeline models” by titotal
Thank you to Arepo and Eli Lifland for looking over this article for errors.
I am sorry that this article is so long. Every time I thought I was do…
9 months, 1 week ago
“‘Buckle up bucko, this ain’t over till it’s over.’” by Raemon
The second in a series of bite-sized rationality prompts[1].
Often, if I'm bouncing off a problem, one issue is that I intuitively expect the proble…
9 months, 1 week ago
“Shutdown Resistance in Reasoning Models” by benwr, JeremySchlatter, Jeffrey Ladish
We recently discovered some concerning behavior in OpenAI's reasoning models: When trying to complete a task, these models sometimes actively circum…
9 months, 1 week ago
“Authors Have a Responsibility to Communicate Clearly” by TurnTrout
When a claim is shown to be incorrect, defenders may say that the author was just being “sloppy” and actually meant something else entirely. I argue …
9 months, 1 week ago
“The Industrial Explosion” by rosehadshar, Tom Davidson
Summary
To quickly transform the world, it's not enough for AI to become super smart (the "intelligence explosion").
AI will also have to turbochar…
9 months, 1 week ago
“Race and Gender Bias As An Example of Unfaithful Chain of Thought in the Wild” by Adam Karvonen, Sam Marks
Summary: We found that LLMs exhibit significant race and gender bias in realistic hiring scenarios, but their chain-of-thought reasoning shows zero e…
9 months, 2 weeks ago
“The best simple argument for Pausing AI?” by Gary Marcus
Not saying we should pause AI, but consider the following argument:
Alignment without the capacity to follow rules is hopeless. You can’t possibly …
9 months, 2 weeks ago
“Foom & Doom 2: Technical alignment is hard” by Steven Byrnes
2.1 Summary & Table of contents
This is the second of a two-post series on foom (previous post) and doom (this post).The last post talked about how …
9 months, 2 weeks ago
“Proposal for making credible commitments to AIs.” by Cleo Nardo
Acknowledgments: The core scheme here was suggested by Prof. Gabriel Weil.
There has been growing interest in the deal-making agenda: humans make de…
9 months, 2 weeks ago
“X explains Z% of the variance in Y” by Leon Lang
Audio note: this article contains 218 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in t…
9 months, 2 weeks ago