Podcast Episodes

“Agents as Webs of Beliefs” by Richard_Ngo

In this post I’ll sketch out an informal model of intelligent agents as webs of beliefs (or belief webs for short). The belief webs framework pulls …

1 month ago

Short Long

View Episode

“Austin & Oli on funding and incubating projects” by Austin Chen, habryka

@habryka and I recently spoke about his plans to improve the AI safety funding ecosystem with a better S-Process platform, and my new incubator for …

1 month ago

Short Long

View Episode

“Deployment Awareness Matters More Than Evaluation Awareness” by VojtaKovarik, Tomáš Gavenčiak, Mateusz Bagiński

TL;DR

Evaluation awareness — an AI recognizing it's being evaluated — is a widely discussed concept in AI safety. But there is a closely related con…

1 month ago

Short Long

View Episode

“Why are adversaries assumed to be incapable of responding to AI risk?” by KatjaGrace

When I talk to people about what might be done about AI threatening approximately everything that everyone cares about, I notice a common oddity in …

1 month ago

Short Long

View Episode

“What did “scheming”, “mech interp” mean pre-2023.” by Cleo Nardo

This was too long to be a short-form, but it should really be a short-form.

This notice is useful for people who've recently got into AI safety, who…

1 month ago

Short Long

View Episode

“Not making a strong argument is a relief” by Kaj_Sotala

When I was in middle school, one of our teachers gave us a “don’t do drugs” talk.

Somebody asked him whether he had ever used drugs himself. He repl…

1 month ago

Short Long

View Episode

“AI #174: You’re It” by Zvi

Fable remains in limbo, with renewed hope that we will get it back soon (45% by tomorrow, 69% by July 1, nice.) The full capabilities post is now av…

1 month ago

Short Long

View Episode

[Linkpost] “Don’t ignore the car crashes, and remember your freshman CS” by jcksanderson

This is a link post.

Car crashes kill over 35,000 people in the US every year. Plane crashes, on the other hand, kill ~350. Despite this, we have sho…

1 month ago

Short Long

View Episode

“White House Will Ad Hoc Decide Who Can Individually Access GPT-5.6” by Zvi

We have a new standard policy for releasing frontier AI models. It is not good.

We are now, it seems, going to have the White House individually, i…

1 month ago

Short Long

View Episode

“Chorus-Reinterpretation Country Songs” by jefftk

Our family is on vacation in North Carolina for a week, spending some time at a pool, and they're playing a (weirdly short) loop of music. Listenin…

1 month ago

Short Long

View Episode

Podcast Episodes

“Agents as Webs of Beliefs” by Richard_Ngo

“Austin & Oli on funding and incubating projects” by Austin Chen, habryka

“Deployment Awareness Matters More Than Evaluation Awareness” by VojtaKovarik, Tomáš Gavenčiak, Mateusz Bagiński

“Why are adversaries assumed to be incapable of responding to AI risk?” by KatjaGrace

“What did “scheming”, “mech interp” mean pre-2023.” by Cleo Nardo

“Not making a strong argument is a relief” by Kaj_Sotala

“AI #174: You’re It” by Zvi

[Linkpost] “Don’t ignore the car crashes, and remember your freshman CS” by jcksanderson

“White House Will Ad Hoc Decide Who Can Individually Access GPT-5.6” by Zvi

“Chorus-Reinterpretation Country Songs” by jefftk

Love PodBriefly?