Podcast Episodes

[Linkpost] “To Understand History, Keep Former Population Distributions In Mind” by Arjun Panickssery

This is a link post. Guillaume Blanc has a piece in Works in Progress (I assume based on his paper) about how France's fertility declined earlier tha…

11 months, 3 weeks ago

Short Long

View Episode

“AI-enabled coups: a small group could use AI to seize power” by Tom Davidson, Lukas Finnveden, rosehadshar

We’ve written a new report on the threat of AI-enabled coups.

I think this is a very serious risk – comparable in importance to AI takeover but muc…

11 months, 3 weeks ago

Short Long

View Episode

“Accountability Sinks” by Martin Sustrik

Back in the 1990s, ground squirrels were briefly fashionable pets, but their popularity came to an abrupt end after an incident at Schiphol Airport …

11 months, 3 weeks ago

Short Long

View Episode

“Training AGI in Secret would be Unsafe and Unethical” by Daniel Kokotajlo

Subtitle: Bad for loss of control risks, bad for concentration of power risks

I’ve had this sitting in my drafts for the last year. I wish I’d been …

11 months, 4 weeks ago

Short Long

View Episode

“Why Should I Assume CCP AGI is Worse Than USG AGI?” by Tomás B.

Though, given my doomerism, I think the natsec framing of the AGI race is likely wrongheaded, let me accept the Dario/Leopold/Altman frame that AGI …

11 months, 4 weeks ago

Short Long

View Episode

“Surprising LLM reasoning failures make me think we still need qualitative breakthroughs for AGI” by Kaj_Sotala

Introduction

Writing this post puts me in a weird epistemic position. I simultaneously believe that:

The reasoning failures that I'll discuss are st…

1 year ago

Short Long

View Episode

“Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study” by Adam Karvonen

Dario Amodei, CEO of Anthropic, recently worried about a world where only 30% of jobs become automated, leading to class tensions between the automa…

1 year ago

Short Long

View Episode

“Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)” by Neel Nanda, lewis smith, Senthooran Rajamanoharan, Arthur Conmy, Callum McDougall, Tom Lieberum, János Kramár, Rohin Shah

Audio note: this article contains 31 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in th…

1 year ago

Short Long

View Episode

[Linkpost] “Playing in the Creek” by Hastings

This is a link post. When I was a really small kid, one of my favorite activities was to try and dam up the creek in my backyard. I would carefully m…

1 year ago

Short Long

View Episode

“Thoughts on AI 2027” by Max Harms

This is part of the MIRI Single Author Series. Pieces in this series represent the beliefs and opinions of their named authors, and do not claim to …

1 year ago

Short Long

View Episode

Podcast Episodes

[Linkpost] “To Understand History, Keep Former Population Distributions In Mind” by Arjun Panickssery

“AI-enabled coups: a small group could use AI to seize power” by Tom Davidson, Lukas Finnveden, rosehadshar

“Accountability Sinks” by Martin Sustrik

“Training AGI in Secret would be Unsafe and Unethical” by Daniel Kokotajlo

“Why Should I Assume CCP AGI is Worse Than USG AGI?” by Tomás B.

“Surprising LLM reasoning failures make me think we still need qualitative breakthroughs for AGI” by Kaj_Sotala

Introduction

“Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study” by Adam Karvonen

“Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)” by Neel Nanda, lewis smith, Senthooran Rajamanoharan, Arthur Conmy, Callum McDougall, Tom Lieberum, János Kramár, Rohin Shah

[Linkpost] “Playing in the Creek” by Hastings

“Thoughts on AI 2027” by Max Harms

Love PodBriefly?