Podcast Episodes
Back to Search“Conclave 1492” by Vaniver
Conclave 1492 is a 40-person negotiation exercise set during the papal election of 1492, the most complex, high-stakes political event of the Renais…
2 weeks, 3 days ago
“A Visual Guide to Natural Latents” by Alfred Harwood
Thanks to @Jeremy Gillen for reading and commenting on the draft. This was written while I was was funded by the Advanced Research + Invention Agenc…
2 weeks, 4 days ago
“Implications Of Predicting The Next Token” by jdp
I find that a lot of people have trouble with this concept of predicting the next token. And by trouble, I mean that they struggle to understand wha…
2 weeks, 4 days ago
“Sealing Conditional Misalignment in Inoculation Prompting with Consistency Training” by David Africa, Neil Shah, Sukrati_Gautam
This was work done by Sukrati Gautam and Neil Shah, and supervised by David Africa as part of the SPAR Research Fellowship.
TLDR:
We find a new way…
2 weeks, 4 days ago
“Advice on interviewing candidates for AI safety fellowships” by beyarkay
Around July last year I decided I was going to go all in on technical AI safety research. To do that I’d need to get into an AI safety fellowship, q…
2 weeks, 4 days ago
“Negation Neglect: When models fail to learn negations in training” by harrymayne, Lev McKinney, Owain_Evans
This is a short summary of our new paper: arXiv, X thread, code.
TL;DR: We show that finetuning LLMs on documents that flag a claim as false can mak…
2 weeks, 5 days ago
“Classifier Context Rot: Monitor Performance Degrades with Context Length” by Fabien Roger, Sam Martin
Monitoring coding agents for dangerous behavior using language models requires classifying transcripts that often exceed 500 thousand tokens, but pr…
2 weeks, 5 days ago
“why pollen allergies?” by bhauth
Allergies are a big problem for a lot of people. If you're someone with pollen allergies, maybe you've wondered how people in the distant past dealt…
2 weeks, 5 days ago
“How to Quit Fandom: Apostasy” by Laiba Rehman
[Crossposted from my blog, BlueprintingHeaven.]
Please note that the views I held and thoughts I had during the time immediately after my deconversi…
2 weeks, 5 days ago
“James C. Scott: Seeing Like a State” by Martin Sustrik
In 1932-33, Soviet collectivization destroyed local farming knowledge and produced a famine that killed somewhere between five and nine million peop…
2 weeks, 6 days ago