Podcast Episodes
Back to Search[HUMAN VOICE] Update on human narration for this podcast
Contact: patreon.com/lwcurated or [perrin dot j dot walker plus lesswrong fnord gmail].
All Solenoid's narration work found here.
1 year, 10 months ago
“Maybe Anthropic’s Long-Term Benefit Trust is powerless” by Zach Stein-Perlman
Crossposted from AI Lab Watch. Subscribe on Substack.
Introduction.
Anthropic has an unconventional governance mechanism: an independent "Long-Term B…
1 year, 10 months ago
“Notifications Received in 30 Minutes of Class” by tanagrabeast
Introduction.
If you are choosing to read this post, you've probably seen the image below depicting all the notifications students received on their…
1 year, 10 months ago
“AI companies aren’t really using external evaluators” by Zach Stein-Perlman
New blog: AI Lab Watch. Subscribe on Substack.
Many AI safety folks think that METR is close to the labs, with ongoing relationships that grant it acc…
1 year, 10 months ago
“EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024” by scasper
Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.Part 13 of 12 in the Engineer's Interpretability Sequence.
TL;D…
1 year, 10 months ago
“What’s Going on With OpenAI’s Messaging?” by ozziegoen
This is a quickly-written opinion piece, of what I understand about OpenAI. I first posted it to Facebook, where it had some discussion.
Some argum…
1 year, 10 months ago
“Language Models Model Us” by eggsyntax
Produced as part of the MATS Winter 2023-4 program, under the mentorship of @Jessica Rumbelow
One-sentence summary: On a dataset of human-written essa…
1 year, 10 months ago
Jaan Tallinn’s 2023 Philanthropy Overview
This is a link post.to follow up my philantropic pledge from 2020, i've updated my philanthropy page with 2023 results.
in 2023 my donations funded $4…
1 year, 10 months ago
“OpenAI: Exodus” by Zvi
Previously: OpenAI: Facts From a Weekend, OpenAI: The Battle of the Board, OpenAI: Leaks Confirm the Story, OpenAI: Altman Returns, OpenAI: The Board…
1 year, 10 months ago
DeepMind’s ”Frontier Safety Framework” is weak and unambitious
FSF blogpost. Full document (just 6 pages; you should read it). Compare to Anthropic's RSP, OpenAI's RSP ("PF"), and METR's Key Components of an RSP.…
1 year, 10 months ago