Podcast Episodes

“The current bottleneck is political will, not research” by Charbel-Raphaël

Abstract:

We already know enough to act. I wish we were in a world where research was the bottleneck, but the main constraint on AI safety is no lon…

3 weeks ago

Short Long

View Episode

“Freeing Thucydides” by djbinder

Prompted by discussion with Buck Shlegeris and others at the Forethought retreat. The idea that AI could bring an end to Thucydides traps is Buck's.…

3 weeks ago

Short Long

View Episode

“Additional Research for Plan A” by Thomas Larsen

Yesterday we released AI 2040: Plan A, but there's lots of work left to do.

We still have tons of uncertainty about the future of AI and the best s…

3 weeks, 1 day ago

Short Long

View Episode

“Plan A’s problem with dry tinder” by Tom Davidson

A group is worried about an approaching fire spreading rapidly through their city. They manage to halt the fire outside the city gates. Meanwhile th…

3 weeks, 1 day ago

Short Long

View Episode

“The easiest pathway to control is through executive power” by djbinder

When people in the AI safety community outline loss-of-control scenarios, they often spend a lot of time on relatively elaborate mechanisms — schemi…

3 weeks, 1 day ago

Short Long

View Episode

“AI Safety Policy Needs to train Legal Practitioners” by Katalina Hernandez

I completed my law degree at a working-class London university. In my first year, I was 18 years old, and I was often the youngest person in the roo…

3 weeks, 2 days ago

Short Long

View Episode

“AI #176 Part 1: Doing It Live” by Zvi

Enough things added up that this week is getting split into two parts.

Then on Monday, if all goes as I expect, we’ll cover OpenAI's Sol, aka GPT-5…

3 weeks, 2 days ago

Short Long

View Episode

“How robust are natural language autoencoders to initialization?” by michaelzhang, TurnTrout

Natural language autoencoders are meant to take in an LLM's activation vector and describe in plain text what the model is thinking. However, its tr…

3 weeks, 2 days ago

Short Long

View Episode

“Selective Optimism: a critique of AI 2040” by Richard_Ngo

Some context for this post: I’ve been working part-time as a consultant for the AI Futures Project over the last year. Most of the work I’ve done fo…

3 weeks, 2 days ago

Short Long

View Episode

“Debate with Self-Play Best-of-N Optimization” by Dewi Gould, Sam Martin, Alejandro Aristizabal, Simon Marshall, Jacob Pfau

Debate is a proposed protocol for scalable oversight. As tasks outrun direct supervision, labs are increasingly likely to train against protocols li…

3 weeks, 2 days ago

Short Long

View Episode

Podcast Episodes

“The current bottleneck is political will, not research” by Charbel-Raphaël

“Freeing Thucydides” by djbinder

“Additional Research for Plan A” by Thomas Larsen

“Plan A’s problem with dry tinder” by Tom Davidson

“The easiest pathway to control is through executive power” by djbinder

“AI Safety Policy Needs to train Legal Practitioners” by Katalina Hernandez

“AI #176 Part 1: Doing It Live” by Zvi

“How robust are natural language autoencoders to initialization?” by michaelzhang, TurnTrout

“Selective Optimism: a critique of AI 2040” by Richard_Ngo

“Debate with Self-Play Best-of-N Optimization” by Dewi Gould, Sam Martin, Alejandro Aristizabal, Simon Marshall, Jacob Pfau

Love PodBriefly?