Podcast Episodes
Back to Search"At 87, Pearl is still able to change his mind" by rotatingpaguro
Judea Pearl is a famous researcher, known for Bayesian networks (the standard way of representing Bayesian models), and his statistical formalization…
2 years, 5 months ago
"We're Not Ready: thoughts on "pausing" and responsible scaling policies" by Holden Karnofsky
Views are my own, not Open Philanthropy’s. I am married to the President of Anthropic and have a financial interest in both Anthropic and OpenAI via …
2 years, 5 months ago
[HUMAN VOICE] "Alignment Implications of LLM Successes: a Debate in One Act" by Zack M Davis
Support ongoing human narrations of curated posts:
www.patreon.com/LWCurated
Doomimir: Humanity has made no progress on the alignment problem. Not only…
2 years, 5 months ago
"LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B" by Simon Lermen & Jeffrey Ladish.
Produced as part of the SERI ML Alignment Theory Scholars Program - Summer 2023 Cohort, under the mentorship of Jeffrey Ladish.
TL;DR LoRA fine-tunin…
2 years, 5 months ago
"Holly Elmore and Rob Miles dialogue on AI Safety Advocacy" by jacobjacob, Robert Miles & Holly_Elmore
Holly is an independent AI Pause organizer, which includes organizing protests (like this upcoming one). Rob is an AI Safety YouTuber. I (jacobjacob)…
2 years, 5 months ago
"Labs should be explicit about why they are building AGI" by Peter Barnett
Three of the big AI labs say that they care about alignment and that they think misaligned AI poses a potentially existential threat to humanity. The…
2 years, 5 months ago
[HUMAN VOICE] "Sum-threshold attacks" by TsviBT
Support ongoing human narrations of curated posts:
www.patreon.com/LWCurated
How do you affect something far away, a lot, without anyone noticing?
(Note…
2 years, 5 months ago
"Will no one rid me of this turbulent pest?" by Metacelsus
Last year, I wrote about the promise of gene drives to wipe out mosquito species and end malaria.
In the time since my previous writing, gene drives h…
2 years, 5 months ago
"RSPs are pauses done right" by evhub
COI: I am a research scientist at Anthropic, where I work on model organisms of misalignment; I was also involved in the drafting process for Anthrop…
2 years, 6 months ago
[HUMAN VOICE] "Inside Views, Impostor Syndrome, and the Great LARP" by John Wentworth
Patreon to support human narration. (Narrations will remain freely available on this feed, but you can optionally support them if you'd like me to ke…
2 years, 6 months ago