Podcast Episodes

Back to Search
“Model organisms researchers should check whether high LRs defeat their model organisms” by dx26, Sebastian Prasanna, Alek Westover, Vivek Hebbar, Julian Stastny

Thanks to Buck Shlegeris for feedback on a draft of this post.

The goal-guarding hypothesis states that schemers will be able to preserve their goal…

1 week, 5 days ago

Short Long
View Episode
“Anthropic did not publish a “risk discussion” of Mythos when required by their RSP” by RobertM

I and some other people noticed a potential discrepancy in Anthropic's announcement of Claude Mythos. The version of the RSP that was operative over…

1 week, 5 days ago

Short Long
View Episode
“Claude Mythos: The System Card” by Zvi

Claude Mythos is different.

This is the first model other than GPT-2 that is at first not being released for public use at all.

With GPT-2 the del…

1 week, 5 days ago

Short Long
View Episode
“Some takes on UV & cancer” by Steven Byrnes

Table of contents:

Part 1: In which I use my optical physics background to share some hopefully-uncontroversial observationsPart 2: In which I boldl…

1 week, 5 days ago

Short Long
View Episode
“AI #163: Mythos Quest” by Zvi

There exists an AI model, Claude Mythos, that has discovered critical safety vulnerabilities in every major operating system and browser. If release…

1 week, 6 days ago

Short Long
View Episode
“Slightly-Super Persuasion Will Do” by Tomás B.

In SF this week, I met an online friend in person for the first time yesterday. We talked about super-persuasion. His take was: there is mostly an e…

1 week, 6 days ago

Short Long
View Episode
“Help me launch Obsolete: a book aimed at building a new movement for AI reform” by garrison

I wrote a book! It's called Obsolete: The AI Industry's Trillion-Dollar Race to Replace You—and How to Stop It, and it’ll be available in May if you…

1 week, 6 days ago

Short Long
View Episode
“Have we already lost? Part 1: The Plan in 2024” by LawrenceC


Written very quickly for the Inkhaven Residency.

As I take the time to reflect on the state of AI Safety in early 2026, one question feels unavoida…

1 week, 6 days ago

Short Long
View Episode
“Do not be surprised if LessWrong gets hacked” by RobertM

Or, for that matter, anything else.

This post is meant to be two things:

a PSA about LessWrong's current security posture, from a LessWrong admin[1]…

1 week, 6 days ago

Short Long
View Episode
“One Week in the Rat Farm” by Philip Harker

Hello, LessWrong. This is a personal introduction diary-ish post and it does not have a thesis. I apologise if this isn't a good fit for the website…

1 week, 6 days ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us