Podcast Episodes

[HUMAN VOICE] Update on human narration for this podcast

Contact: patreon.com/lwcurated or [perrin dot j dot walker plus lesswrong fnord gmail].

All Solenoid's narration work found here.

1 year, 10 months ago

Short Long

View Episode

“Maybe Anthropic’s Long-Term Benefit Trust is powerless” by Zach Stein-Perlman

Crossposted from AI Lab Watch. Subscribe on Substack.

Introduction.

Anthropic has an unconventional governance mechanism: an independent "Long-Term B…

1 year, 10 months ago

Short Long

View Episode

“Notifications Received in 30 Minutes of Class” by tanagrabeast

Introduction.

If you are choosing to read this post, you've probably seen the image below depicting all the notifications students received on their…

1 year, 10 months ago

Short Long

View Episode

“AI companies aren’t really using external evaluators” by Zach Stein-Perlman

New blog: AI Lab Watch. Subscribe on Substack.

Many AI safety folks think that METR is close to the labs, with ongoing relationships that grant it acc…

1 year, 10 months ago

Short Long

View Episode

“EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024” by scasper

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.Part 13 of 12 in the Engineer's Interpretability Sequence.

TL;D…

1 year, 10 months ago

Short Long

View Episode

“What’s Going on With OpenAI’s Messaging?” by ozziegoen

This is a quickly-written opinion piece, of what I understand about OpenAI. I first posted it to Facebook, where it had some discussion.

Some argum…

1 year, 10 months ago

Short Long

View Episode

“Language Models Model Us” by eggsyntax

Produced as part of the MATS Winter 2023-4 program, under the mentorship of @Jessica Rumbelow

One-sentence summary: On a dataset of human-written essa…

1 year, 10 months ago

Short Long

View Episode

Jaan Tallinn’s 2023 Philanthropy Overview

This is a link post.to follow up my philantropic pledge from 2020, i've updated my philanthropy page with 2023 results.

in 2023 my donations funded $4…

1 year, 10 months ago

Short Long

View Episode

“OpenAI: Exodus” by Zvi

Previously: OpenAI: Facts From a Weekend, OpenAI: The Battle of the Board, OpenAI: Leaks Confirm the Story, OpenAI: Altman Returns, OpenAI: The Board…

1 year, 10 months ago

Short Long

View Episode

DeepMind’s ”Frontier Safety Framework” is weak and unambitious

FSF blogpost. Full document (just 6 pages; you should read it). Compare to Anthropic's RSP, OpenAI's RSP ("PF"), and METR's Key Components of an RSP.…

1 year, 10 months ago

Short Long

View Episode

Podcast Episodes

[HUMAN VOICE] Update on human narration for this podcast

“Maybe Anthropic’s Long-Term Benefit Trust is powerless” by Zach Stein-Perlman

“Notifications Received in 30 Minutes of Class” by tanagrabeast

“AI companies aren’t really using external evaluators” by Zach Stein-Perlman

“EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024” by scasper

“What’s Going on With OpenAI’s Messaging?” by ozziegoen

“Language Models Model Us” by eggsyntax

Jaan Tallinn’s 2023 Philanthropy Overview

“OpenAI: Exodus” by Zvi

DeepMind’s ”​​Frontier Safety Framework” is weak and unambitious

Love PodBriefly?

DeepMind’s ”Frontier Safety Framework” is weak and unambitious