Podcast Episodes
Back to Search[Linkpost] “MAGA populists call for holy war against Big Tech” by Remmelt
This is a link post. Excerpts on AI
Geoffrey Miller was handed the mic and started berating one of the panelists: Shyam Sankar, the chief technology …
7 months ago
“Your LLM-assisted scientific breakthrough probably isn’t real” by eggsyntax
Summary
An increasing number of people in recent months have believed that they've made an important and novel scientific breakthrough, which they'v…
7 months, 1 week ago
“Trust me bro, just one more RL scale up, this one will be the real scale up with the good environments, the actually legit one, trust me bro” by ryan_greenblatt
I've recently written about how I've updated against seeing substantially faster than trend AI progress due to quickly massively scaling up RL on ag…
7 months, 1 week ago
“⿻ Plurality & 6pack.care” by Audrey Tang
(Cross-posted from speaker's notes of my talk at Deepmind today.)
Good local time, everyone. I am Audrey Tang, 🇹🇼 Taiwan's Cyber Ambassador and firs…
7 months, 2 weeks ago
[Linkpost] “The Cats are On To Something” by Hastings
This is a link post. So the situation as it stands is that the fraction of the light cone expected to be filled with satisfied cats is not zero. This…
7 months, 2 weeks ago
[Linkpost] “Open Global Investment as a Governance Model for AGI” by Nick Bostrom
This is a link post. I've seen many prescriptive contributions to AGI governance take the form of proposals for some radically new structure. Some ca…
7 months, 2 weeks ago
“Will Any Old Crap Cause Emergent Misalignment?” by J Bostock
The following work was done independently by me in an afternoon and basically entirely vibe-coded with Claude. Code and instructions to reproduce ca…
7 months, 2 weeks ago
“AI Induced Psychosis: A shallow investigation” by Tim Hua
“This is a Copernican-level shift in perspective for the field of AI safety.” - Gemini 2.5 Pro
“What you need right now is not validation, but immed…
7 months, 2 weeks ago
“Before LLM Psychosis, There Was Yes-Man Psychosis” by johnswentworth
A studio executive has no beliefs
That's the way of a studio system
We've bowed to every rear of all the studio chiefs
And you can bet your ass we'v…
7 months, 3 weeks ago
“Training a Reward Hacker Despite Perfect Labels” by ariana_azarbal, vgillioz, TurnTrout
Summary: Perfectly labeled outcomes in training can still boost reward hacking tendencies in generalization. This can hold even when the train/test …
7 months, 3 weeks ago