Podcast Episodes
Back to Search″“The AI Doc” is coming out March 26” by Rob Bensinger, Beckeck
On Thursday, March 26th, a major new AI documentary is coming out: The AI Doc: Or How I Became an Apocaloptimist. Tickets are on sale now.
The movie…
1 month ago
“Null Results From An Orexin-A RCT” by niplav, harsimony, nomagicpill
Over the last few months we[1] have been doing a sleep experiment inspired by our suspicion that orexin is an exciting target for sleep need reducti…
1 month ago
“Broad Timelines” by Toby_Ord
No-one knows when AI will begin having transformative impacts upon the world. People aren’t sure and shouldn’t be sure: there just isn’t enough evid…
1 month ago
“Protecting humanity and Claude from rationalization and unaligned AI” by Kaj_Sotala
My first academic piece on risks from AI was a talk that I gave at the 2009 European Conference on Philosophy and Computing. Titled “three factors m…
1 month ago
“An interactive version of the extropians mailing list” by beyarkay
Claude & I vibecoded an interface for the extropians mailing list. It's live! Have a look here: https://extropians.boydkane.com/.
From Wikipedia, di…
1 month ago
[Linkpost] “OpenAI: How we monitor internal coding agents for misalignment” by Marcus Williams
This is a link post.
Sharing some of the monitoring work I've been doing at OpenAI: How we monitor internal coding agents for misalignment.
OpenAI no…
1 month ago
“Training on Documents About Monitoring Leads To CoT Obfuscation” by Reilly Haskins, bilalchughtai, Josh Engels
Authors: Reilly Haskins*, Bilal Chughtai**, Joshua Engels**
* primary contributor
** advice and mentorship
Summary
[Note: This is a research update …
1 month ago
“Two Skillsets You Need to Launch an Impactful AI Safety Project” by Luc Brinkman, plex
Your project might be failing without you even knowing it.
It's hard to save the world. If you’re launching a new AI Safety project, this sequence h…
1 month ago
“Two Skillsets You Need to Launch an Impactful AI Safety Project” by Luc Brinkman, plex
Your project might be failing without you even knowing it.
It's hard to save the world. If you’re launching a new AI Safety project, this sequence h…
1 month ago
[Linkpost] “Metagaming matters for training, evaluation, and oversight” by jenny, Bronson Schoen
This is a link post.
Following up on our previous work on verbalized eval awareness:
we are sharing a post investigating the emergence of metagaming …
1 month ago