Podcast Episodes
Back to Search[HUMAN VOICE] "Sum-threshold attacks" by TsviBT
Support ongoing human narrations of curated posts:
www.patreon.com/LWCurated
How do you affect something far away, a lot, without anyone noticing?
(Note…
2 years, 4 months ago
"Will no one rid me of this turbulent pest?" by Metacelsus
Last year, I wrote about the promise of gene drives to wipe out mosquito species and end malaria.
In the time since my previous writing, gene drives h…
2 years, 4 months ago
"RSPs are pauses done right" by evhub
COI: I am a research scientist at Anthropic, where I work on model organisms of misalignment; I was also involved in the drafting process for Anthrop…
2 years, 4 months ago
[HUMAN VOICE] "Inside Views, Impostor Syndrome, and the Great LARP" by John Wentworth
Patreon to support human narration. (Narrations will remain freely available on this feed, but you can optionally support them if you'd like me to ke…
2 years, 4 months ago
"Cohabitive Games so Far" by mako yass
A cohabitive game[1] is a partially cooperative, partially competitive multiplayer game that provides an anarchic dojo for development in applied coo…
2 years, 4 months ago
"Announcing MIRI’s new CEO and leadership team" by Gretta Duleba
In 2023, MIRI has shifted focus in the direction of broad public communication—see, for example, our recent TED talk, our piece in TIME magazine “Pau…
2 years, 4 months ago
"Comparing Anthropic's Dictionary Learning to Ours" by Robert_AIZI
Readers may have noticed many similarities between Anthropic's recent publication Towards Monosemanticity: Decomposing Language Models With Dictionar…
2 years, 4 months ago
"Towards Monosemanticity: Decomposing Language Models With Dictionary Learning" by Zac Hatfield-Dodds
Neural networks are trained on data, not programmed to follow rules. We understand the math of the trained network exactly – each neuron in a neural …
2 years, 4 months ago
"Evaluating the historical value misspecification argument" by Matthew Barnett
ETA: I'm not saying that MIRI thought AIs wouldn't understand human values. If there's only one thing you take away from this post, please don't take…
2 years, 4 months ago
"Response to Quintin Pope’s Evolution Provides No Evidence For the Sharp Left Turn" by Zvi
Response to: Evolution Provides No Evidence For the Sharp Left Turn, due to it winning first prize in The Open Philanthropy Worldviews contest.
Quint…
2 years, 4 months ago