Podcast Episodes
Back to Search"Dear Self; we need to talk about ambition" by Elizabeth
I keep seeing advice on ambition, aimed at people in college or early in their career, that would have been really bad for me at similar ages. Rather…
2 years, 7 months ago
"Book Launch: "The Carving of Reality," Best of LessWrong vol. III" by Raemon
The Carving of Reality, third volume of the Best of LessWrong books is now available on Amazon (US).
The Carving of Reality includes 43 essays from 29…
2 years, 7 months ago
"Assume Bad Faith" by Zack_M_Davis
I've been trying to avoid the terms "good faith" and "bad faith". I'm suspicious that most people who have picked up the phrase "bad faith" from hear…
2 years, 7 months ago
"Large Language Models will be Great for Censorship" by Ethan Edwards
LLMs can do many incredible things. They can generate unique creative content, carry on long conversations in any number of subjects, complete comple…
2 years, 7 months ago
"Ten Thousand Years of Solitude" by agp
This is a linkpost for the article "Ten Thousand Years of Solitude", written by Jared Diamond for Discover Magazine in 1993, four years before he pub…
2 years, 7 months ago
"6 non-obvious mental health issues specific to AI safety" by Igor Ivanov
Intro: I am a psychotherapist, and I help people working on AI safety. I noticed patterns of mental health issues highly specific to this group. It's…
2 years, 7 months ago
"Against Almost Every Theory of Impact of Interpretability" by Charbel-Raphaël
I gave a talk about the different risk models, followed by an interpretability presentation, then I got a problematic question, "I don't understand, …
2 years, 7 months ago
"Inflection.ai is a major AGI lab" by Nikola
Inflection.ai (co-founded by DeepMind co-founder Mustafa Suleyman) should be perceived as a frontier LLM lab of similar magnitude as Meta, OpenAI, De…
2 years, 8 months ago
"Feedbackloop-first Rationality" by Raemon
I've been workshopping a new rationality training paradigm. (By "rationality training paradigm", I mean an approach to learning/teaching the skill of…
2 years, 8 months ago
"When can we trust model evaluations?" bu evhub
In "Towards understanding-based safety evaluations," I discussed why I think evaluating specifically the alignment of models is likely to require mec…
2 years, 8 months ago