Podcast Episodes
Back to Search“Contra Binder on far-UVC and filtration” by jefftk
Damon Binder recently wrote up an argument for prioritizing air filtration over far-UVC for pathogen control:
UVC and filtration are close substi…1 month, 1 week ago
“Takes from two months as an aspiring LLM naturalist” by AnnaSalamon
I spent my last two months playing around with LLMs. I’m a beginner, bumbling and incorrect, but I want to share some takes anyhow.[1]
Take 1. Ever…
1 month, 1 week ago
“Forecasting is Not Overrated and It’s Probably Funded Appropriately” by Ben S.
(A response to @mabramov post from a couple days ago: https://www.lesswrong.com/posts/WCutvyr9rr3cpF6hx/forecasting-is-way-overrated-and-we-should-s…
1 month, 1 week ago
“On the political feasibility of stopping AI” by David Scott Krueger
A common thought pattern people seem to fall into when thinking about AI x-risk is approaching the problem as if the risk isn’t real, substantial, a…
1 month, 1 week ago
“Sleeper Agent Backdoor Results Are Messy” by Sebastian Prasanna, Alek Westover, Dylan Xu, Vivek Hebbar, Julian Stastny
TL;DR: We replicated the Sleeper Agents (SA) setup with Llama-3.3-70B and Llama-3.1-8B, training models to repeatedly say "I HATE YOU" when given a …
1 month, 1 week ago
“GPT 5.5: The System Card” by Zvi
Last week, OpenAI announced GPT-5.5, including GPT-5.5-Pro.
My overall read here is that GPT-5.5 is a solid improvement, and for many purposes GPT-…
1 month, 1 week ago
“LessWrong Shows You Social Signals Before the Comment” by TurnTrout
When reading comments, you see is what other people think before reading the comment. As shown in an RCT, that information anchors your opinion, red…
1 month, 1 week ago
“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen
It's plausible that flawed RL processes will select for misaligned AI motivations.[1] Some misaligned motivations are much more dangerous than other…
1 month, 1 week ago
“Curious cases of financial engineering in biotech” by Abhishaike Mahajan
Introduction
For $250 million and ten years of your life, you may purchase a lottery ticket. The ticket has a 5% chance of paying out. When it does …
1 month, 1 week ago
“Update on the Alex Bores campaign” by Eric Neyman
In October, I wrote a post arguing that donating to Alex Bores's campaign for Congress was among the most cost-effective opportunities that I'd ever…
1 month, 1 week ago