Podcast Episodes
Back to Search"What I would do if I wasn’t at ARC Evals" by LawrenceC
In which: I list 9 projects that I would work on if I wasn’t busy working on safety standards at ARC Evals, and explain why they might be good to wor…
2 years, 5 months ago
"The U.S. is becoming less stable" by lc
We focus so much on arguing over who is at fault in this country that I think sometimes we fail to alert on what's actually happening. I would just l…
2 years, 5 months ago
"Meta Questions about Metaphilosophy" by Wei Dai
To quickly recap my main intellectual journey so far (omitting a lengthy side trip into cryptography and Cypherpunk land), with the approximate age t…
2 years, 5 months ago
"OpenAI API base models are not sycophantic, at any size" by Nostalgebraist
In Discovering Language Model Behaviors with Model-Written Evaluations" (Perez et al 2022), the authors studied language model "sycophancy" - the ten…
2 years, 5 months ago
"Dear Self; we need to talk about ambition" by Elizabeth
I keep seeing advice on ambition, aimed at people in college or early in their career, that would have been really bad for me at similar ages. Rather…
2 years, 5 months ago
"Book Launch: "The Carving of Reality," Best of LessWrong vol. III" by Raemon
The Carving of Reality, third volume of the Best of LessWrong books is now available on Amazon (US).
The Carving of Reality includes 43 essays from 29…
2 years, 5 months ago
"Assume Bad Faith" by Zack_M_Davis
I've been trying to avoid the terms "good faith" and "bad faith". I'm suspicious that most people who have picked up the phrase "bad faith" from hear…
2 years, 5 months ago
"Large Language Models will be Great for Censorship" by Ethan Edwards
LLMs can do many incredible things. They can generate unique creative content, carry on long conversations in any number of subjects, complete comple…
2 years, 6 months ago
"Ten Thousand Years of Solitude" by agp
This is a linkpost for the article "Ten Thousand Years of Solitude", written by Jared Diamond for Discover Magazine in 1993, four years before he pub…
2 years, 6 months ago
"6 non-obvious mental health issues specific to AI safety" by Igor Ivanov
Intro: I am a psychotherapist, and I help people working on AI safety. I noticed patterns of mental health issues highly specific to this group. It's…
2 years, 6 months ago