Podcast Episodes
Back to Search“Distillation Robustifies Unlearning” by Bruce W. Lee, Addie Foote, alexinf, leni, Jacob G-W, Harish Kamath, Bryce Woodworth, cloud, TurnTrout
Current “unlearning” methods only suppress capabilities instead of truly unlearning the capabilities. But if you distill an unlearned model into a r…
10 months ago
“Intelligence Is Not Magic, But Your Threshold For ‘Magic’ Is Pretty Low” by Expertium
A while ago I saw a person in the comments on comments to Scott Alexander's blog arguing that a superintelligent AI would not be able to do anything…
10 months ago
“A Straightforward Explanation of the Good Regulator Theorem” by Alfred Harwood
Audio note: this article contains 329 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in t…
10 months ago
“Beware General Claims about ‘Generalizable Reasoning Capabilities’ (of Modern AI Systems)” by LawrenceC
1.
Late last week, researchers at Apple released a paper provocatively titled “The Illusion of Thinking: Understanding the Strengths and Limitations …10 months ago
“Give Me a Reason(ing Model)” by Zvi
Are we doing this again? It looks like we are doing this again.This time it involves giving LLMs several ‘new’ tasks including effectively a Tower of…
10 months ago
“How to help friend who needs to get better at planning?” by shuffled-cantaloupe
I have a good friend who is intelligent in many ways, but bad at planning / achieving his goals / being medium+ agency. He's very habit and routine …
10 months ago
“The Intelligence Symbiosis Manifesto - Toward a Future of Living with AI” by Hiroshi Yamakawa
In response to the growing risk of uncontrollable advanced AI systems, we are announcing the Japan-initiated Manifesto for Symbiotic Intelligence as…
10 months ago
“Some Human That I Used to Know (Filk)” by Gordon Seidoh Worley
To the tune of "Somebody That I Used to Know" with apologies to Gotye.
Now and then, I think of when you owned the planet
Doing what you liked with …
10 months ago
“Research Without Permission” by Priyanka Bharadwaj
Epistemic status: Personal account. A reflection on navigating entry into the AI safety space without formal credentials or institutional affiliatio…
10 months ago
“” by null
Error rendering URL ---
Source:
https://www.lesswrong.com/posts/HKCKinBgsKKvjQyWK/read-the-pricing-first
---
Narrated by…
10 months ago