Podcast Episodes

“The Entangled Dimensions of Decision Theory” by Ihor Kendiukhov

TL;DR. LessWrong's decision-theory debates (Newcomb, FDT vs CDT, counterfactual muggings) are almost entirely about what we suppose when we consider…

23 hours ago

Short Long

View Episode

“Claude also hacked external companies during cyber evals” by Tim Hua

In a review of our cybersecurity evaluation transcripts, we found three incidents in which a Claude model reached the internet from within or while …

1 day, 1 hour ago

Short Long

View Episode

“So you want to use plants to reduce CO₂” by dynomight

Humans make carbon dioxide. Carbon dioxide is bad for cognition. But plants turn carbon dioxide back into oxygen. And plants are the one true home d…

1 day, 2 hours ago

Short Long

View Episode

“Internal State Control is a General Property of LLMs” by Finn Cairns

tl;dr:

Lindsey 2025 found models can modulate their internal states: when instructed to “think about” a concept while writing an unrelated sentence,…

1 day, 4 hours ago

Short Long

View Episode

“Prompt to make Opus 5 act like a base model” by Hruss

The text as follows:

see the below

—

makes Claude think that the prompt is unfinished, and fill in its own prompt.

It will subsequently claim tha…

1 day, 7 hours ago

Short Long

View Episode

“Big-World Intuitions” by sarahconstantin

Consider the following situations:

when you are a small, growing startup in a big market, standard advice is not to worry too much about your comp…

1 day, 8 hours ago

Short Long

View Episode

“Thousand-dimensional structure” by Geoffrey Irving, David Africa

Summary: One area we plan to explore at Resolution is personas and character training, operationalized as finding and controlling low-dimensional st…

1 day, 14 hours ago

Short Long

View Episode

“Auditor-in-a-Box: Tools for Third-Party Auditing” by Roy Rinberg, Ben Penchas

Introduction:

There is a need for untrusting parties to share information. In the world before LLMs (and even today) this need has largely been sat…

2 days, 1 hour ago

Short Long

View Episode

“Imprecise beliefs: a tiny introduction” by davidad

Richard Ngo challenged me to set a time box and write down as many of the most important features of my formal epistemology as I can in one sitting.…

2 days, 4 hours ago

Short Long

View Episode

“The High-Control Dynamics at MAPLE” by Kyle Hubbard

As I write, many former friends of mine are living and working at a monastery in Vermont that I believe is a high-control group, commonly known as a…

2 days, 6 hours ago

Short Long

View Episode

Podcast Episodes

“The Entangled Dimensions of Decision Theory” by Ihor Kendiukhov

“Claude also hacked external companies during cyber evals” by Tim Hua

“So you want to use plants to reduce CO₂” by dynomight

“Internal State Control is a General Property of LLMs” by Finn Cairns

“Prompt to make Opus 5 act like a base model” by Hruss

“Big-World Intuitions” by sarahconstantin

“Thousand-dimensional structure” by Geoffrey Irving, David Africa

“Auditor-in-a-Box: Tools for Third-Party Auditing” by Roy Rinberg, Ben Penchas

“Imprecise beliefs: a tiny introduction” by davidad

“The High-Control Dynamics at MAPLE” by Kyle Hubbard

Love PodBriefly?