Podcast Episodes
Back to SearchExpress interest in an “FHI of the West”
TLDR: I am investigating whether to found a spiritual successor to FHI, housed under Lightcone Infrastructure, providing a rich cultural environment …
1 year, 11 months ago
Transformers Represent Belief State Geometry in their Residual Stream
Produced while being an affiliate at PIBBSS[1]. The work was done initially with funding from a Lightspeed Grant, and then continued while at PIBBSS.…
1 year, 11 months ago
Paul Christiano named as US AI Safety Institute Head of AI Safety
This is a linkpost for https://www.commerce.gov/news/press-releases/2024/04/us-commerce-secretary-gina-raimondo-announces-expansion-us-ai-safetyU.S. …
2 years ago
[HUMAN VOICE] "How could I have thought that faster?" by mesaoptimizer
Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated
This is a linkpost for https://twitter.com/ESYudkowsky/status/…
2 years ago
[HUMAN VOICE] "My PhD thesis: Algorithmic Bayesian Epistemology" by Eric Neyman
Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated
In January, I defended my PhD thesis, which I called Algorithm…
2 years ago
[HUMAN VOICE] "Toward a Broader Conception of Adverse Selection" by Ricki Heicklen
Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated
This is a linkpost for https://bayesshammai.substack.com/p/con…
2 years ago
[HUMAN VOICE] "On green" by Joe Carlsmith
Cross-posted from my website. Podcast version here, or search for "Joe Carlsmith Audio" on your podcast app.
This essay is part of a series that I'm c…
2 years ago
LLMs for Alignment Research: a safety priority?
A recent short story by Gabriel Mukobi illustrates a near-term scenario where things go bad because new developments in LLMs allow LLMs to accelerate…
2 years ago
[HUMAN VOICE] "Social status part 1/2: negotiations over object-level preferences" by Steven Byrnes
Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated
Source:
https://www.lesswrong.com/posts/SPBm67otKq5ET5CWP/socia…
2 years ago
[HUMAN VOICE] "Using axis lines for good or evil" by dynomight
Support ongoing human narrations of LessWrong's curated posts:
www.patreon.com/LWCurated
Source:
https://www.lesswrong.com/posts/Yay8SbQiwErRyDKGb/using…
2 years ago