Podcast Episodes
Back to Search“Tracing the Thoughts of a Large Language Model” by Adam Jermyn
[This is our blog post on the papers, which can be found at https://transformer-circuits.pub/2025/attribution-graphs/biology.html and https://transf…
11 months ago
“Recent AI model progress feels mostly like bullshit” by lc
About nine months ago, I and three friends decided that AI had gotten good enough to monitor large codebases autonomously for security problems. We …
11 months ago
“AI for AI safety” by Joe Carlsmith
(Audio version here (read by the author), or search for "Joe Carlsmith Audio" on your podcast app.
This is the fourth essay in a series that I’m call…
11 months ago
“Policy for LLM Writing on LessWrong” by jimrandomh
LessWrong has been receiving an increasing number of posts and contents that look like they might be LLM-written or partially-LLM-written, so we're …
11 months ago
“Will Jesus Christ return in an election year?” by Eric Neyman
Thanks to Jesse Richardson for discussion.
Polymarket asks: will Jesus Christ return in 2025?
In the three days since the market opened, traders ha…
11 months ago
“Good Research Takes are Not Sufficient for Good Strategic Takes” by Neel Nanda
TL;DR Having a good research track record is some evidence of good big-picture takes, but it's weak evidence. Strategic thinking is hard, and requir…
11 months, 1 week ago
“Intention to Treat” by Alicorn
When my son was three, we enrolled him in a study of a vision condition that runs in my family. They wanted us to put an eyepatch on him for part of…
11 months, 1 week ago
“On the Rationality of Deterring ASI” by Dan H
I’m releasing a new paper “Superintelligence Strategy” alongside Eric Schmidt (formerly Google), and Alexandr Wang (Scale AI). Below is the executive…
11 months, 1 week ago
[Linkpost] “METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman
This is a link post. Summary: We propose measuring AI performance in terms of the length of tasks AI agents can complete. We show that this metric ha…
11 months, 1 week ago
“I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?” by shrimpy
I have, over the last year, become fairly well-known in a small corner of the internet tangentially related to AI.
As a result, I've begun making what…
11 months, 1 week ago