Podcast Episodes

Express interest in an “FHI of the West”

TLDR: I am investigating whether to found a spiritual successor to FHI, housed under Lightcone Infrastructure, providing a rich cultural environment …

1 year, 11 months ago

Short Long

View Episode

Transformers Represent Belief State Geometry in their Residual Stream

Produced while being an affiliate at PIBBSS[1]. The work was done initially with funding from a Lightspeed Grant, and then continued while at PIBBSS.…

1 year, 11 months ago

Short Long

View Episode

Paul Christiano named as US AI Safety Institute Head of AI Safety

This is a linkpost for https://www.commerce.gov/news/press-releases/2024/04/us-commerce-secretary-gina-raimondo-announces-expansion-us-ai-safetyU.S. …

2 years ago

Short Long

View Episode

[HUMAN VOICE] "How could I have thought that faster?" by mesaoptimizer

Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated

This is a linkpost for https://twitter.com/ESYudkowsky/status/…

2 years ago

Short Long

View Episode

[HUMAN VOICE] "My PhD thesis: Algorithmic Bayesian Epistemology" by Eric Neyman

Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated

In January, I defended my PhD thesis, which I called Algorithm…

2 years ago

Short Long

View Episode

[HUMAN VOICE] "Toward a Broader Conception of Adverse Selection" by Ricki Heicklen

Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated

This is a linkpost for https://bayesshammai.substack.com/p/con…

2 years ago

Short Long

View Episode

[HUMAN VOICE] "On green" by Joe Carlsmith

Cross-posted from my website. Podcast version here, or search for "Joe Carlsmith Audio" on your podcast app.

This essay is part of a series that I'm c…

2 years ago

Short Long

View Episode

LLMs for Alignment Research: a safety priority?

A recent short story by Gabriel Mukobi illustrates a near-term scenario where things go bad because new developments in LLMs allow LLMs to accelerate…

2 years ago

Short Long

View Episode

[HUMAN VOICE] "Social status part 1/2: negotiations over object-level preferences" by Steven Byrnes

Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated

Source:
https://www.lesswrong.com/posts/SPBm67otKq5ET5CWP/socia…

2 years ago

Short Long

View Episode

[HUMAN VOICE] "Using axis lines for good or evil" by dynomight

Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated

Source:
https://www.lesswrong.com/posts/Yay8SbQiwErRyDKGb/using…

2 years ago

Short Long

View Episode

Podcast Episodes

Express interest in an “FHI of the West”

Transformers Represent Belief State Geometry in their Residual Stream

Paul Christiano named as US AI Safety Institute Head of AI Safety

[HUMAN VOICE] "How could I have thought that faster?" by mesaoptimizer

[HUMAN VOICE] "My PhD thesis: Algorithmic Bayesian Epistemology" by Eric Neyman

[HUMAN VOICE] "Toward a Broader Conception of Adverse Selection" by Ricki Heicklen

[HUMAN VOICE] "On green" by Joe Carlsmith

LLMs for Alignment Research: a safety priority?

[HUMAN VOICE] "Social status part 1/2: negotiations over object-level preferences" by Steven Byrnes

[HUMAN VOICE] "Using axis lines for good or evil" by dynomight

Love PodBriefly?