Podcast Episodes
Back to Search“HPMOR: The (Probably) Untold Lore” by Gretta Duleba, Eliezer Yudkowsky
Eliezer and I love to talk about writing. We talk about our own current writing projects, how we’d improve the books we’re reading, and what we want…
7 months ago
“On ‘ChatGPT Psychosis’ and LLM Sycophancy” by jdp
As a person who frequently posts about large language model psychology I get an elevated rate of cranks and schizophrenics in my inbox. Often these …
7 months ago
“Subliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data” by cloud, mle, Owain_Evans
Authors: Alex Cloud*, Minh Le*, James Chua, Jan Betley, Anna Sztyber-Betley, Jacob Hilton, Samuel Marks, Owain Evans (*Equal contribution, randomly …
7 months ago
“Love stays loved (formerly ‘Skin’)” by Swimmer963 (Miranda Dixon-Luinenburg)
This is a short story I wrote in mid-2022. Genre: cosmic horror as a metaphor for living with a high p-doom.
One
The last time I saw my mom, we m…
7 months, 1 week ago
“Make More Grayspaces” by Duncan Sabien (Inactive)
Author's note: These days, my thoughts go onto my substack by default, instead of onto LessWrong. Everything I write becomes free after a week or so…
7 months, 1 week ago
“Shallow Water is Dangerous Too” by jefftk
Content warning: risk to children
Julia and I knowdrowning is the biggestrisk to US children under 5, and we try to take this seriously.But yesterda…
7 months, 1 week ago
“Narrow Misalignment is Hard, Emergent Misalignment is Easy” by Edward Turner, Anna Soligo, Senthooran Rajamanoharan, Neel Nanda
Anna and Ed are co-first authors for this work. We’re presenting these results as a research update for a continuing body of work, which we hope wil…
7 months, 1 week ago
“Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety” by Tomek Korbak, Mikita Balesni, Vlad Mikulik, Rohin Shah
Twitter | Paper PDF
Seven years ago, OpenAI five had just been released, and many people in the AI safety community expected AIs to be opaque RL age…
7 months, 1 week ago
“the jackpot age” by thiccythot
This essay is about shifts in risk taking towards the worship of jackpots and its broader societal implications. Imagine you are presented with this…
7 months, 2 weeks ago
“Surprises and learnings from almost two months of Leo Panickssery” by Nina Panickssery
Leo was born at 5am on the 20th May, at home (this was an accident but the experience has made me extremely homebirth-pilled). Before that, I was on…
7 months, 2 weeks ago