Podcast Episodes

“Model organisms researchers should check whether high LRs defeat their model organisms” by dx26, Sebastian Prasanna, Alek Westover, Vivek Hebbar, Julian Stastny

Thanks to Buck Shlegeris for feedback on a draft of this post.

The goal-guarding hypothesis states that schemers will be able to preserve their goal…

1 week, 5 days ago

Short Long

“Anthropic did not publish a “risk discussion” of Mythos when required by their RSP” by RobertM

I and some other people noticed a potential discrepancy in Anthropic's announcement of Claude Mythos. The version of the RSP that was operative over…

1 week, 5 days ago

Short Long

View Episode

“Claude Mythos: The System Card” by Zvi

Claude Mythos is different.

This is the first model other than GPT-2 that is at first not being released for public use at all.

With GPT-2 the del…

1 week, 5 days ago

Short Long

View Episode

“Some takes on UV & cancer” by Steven Byrnes

Table of contents:

Part 1: In which I use my optical physics background to share some hopefully-uncontroversial observationsPart 2: In which I boldl…

1 week, 5 days ago

Short Long

View Episode

“AI #163: Mythos Quest” by Zvi

There exists an AI model, Claude Mythos, that has discovered critical safety vulnerabilities in every major operating system and browser. If release…

1 week, 6 days ago

Short Long

View Episode

“Slightly-Super Persuasion Will Do” by Tomás B.

In SF this week, I met an online friend in person for the first time yesterday. We talked about super-persuasion. His take was: there is mostly an e…

1 week, 6 days ago

Short Long

View Episode

“Help me launch Obsolete: a book aimed at building a new movement for AI reform” by garrison

I wrote a book! It's called Obsolete: The AI Industry's Trillion-Dollar Race to Replace You—and How to Stop It, and it’ll be available in May if you…

1 week, 6 days ago

Short Long

View Episode

“Have we already lost? Part 1: The Plan in 2024” by LawrenceC

Written very quickly for the Inkhaven Residency.

As I take the time to reflect on the state of AI Safety in early 2026, one question feels unavoida…

1 week, 6 days ago

Short Long

View Episode

“Do not be surprised if LessWrong gets hacked” by RobertM

Or, for that matter, anything else.

This post is meant to be two things:

a PSA about LessWrong's current security posture, from a LessWrong admin[1]…

1 week, 6 days ago

Short Long

View Episode

“One Week in the Rat Farm” by Philip Harker

Hello, LessWrong. This is a personal introduction diary-ish post and it does not have a thesis. I apologise if this isn't a good fit for the website…

1 week, 6 days ago

Short Long

View Episode

Podcast Episodes

“Model organisms researchers should check whether high LRs defeat their model organisms” by dx26, Sebastian Prasanna, Alek Westover, Vivek Hebbar, Julian Stastny

“Anthropic did not publish a “risk discussion” of Mythos when required by their RSP” by RobertM

“Claude Mythos: The System Card” by Zvi

“Some takes on UV & cancer” by Steven Byrnes

“AI #163: Mythos Quest” by Zvi

“Slightly-Super Persuasion Will Do” by Tomás B.

“Help me launch Obsolete: a book aimed at building a new movement for AI reform” by garrison

“Have we already lost? Part 1: The Plan in 2024” by LawrenceC

“Do not be surprised if LessWrong gets hacked” by RobertM

“One Week in the Rat Farm” by Philip Harker

Love PodBriefly?