Podcast Episodes
Back to Search“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout
Produced as part of MATS 8.0 under the mentorship of Alex Turner and Alex Cloud. This research note overviews some early results which we are lookin…
8 months, 2 weeks ago
“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska
FutureHouse is a company that builds literature research agents. They tested it on the bio + chem subset of HLE questions, then noticed errors in th…
8 months, 2 weeks ago
“Maya’s Escape” by Bridgett Kay
Maya did not believe she lived in a simulation. She knew that her continued hope that she could escape from the nonexistent simulation was based on …
8 months, 2 weeks ago
“Do confident short timelines make sense?” by TsviBT, abramdemski
TsviBT Tsvi's context
Some context:
My personal context is that I care about decreasing existential risk, and I think that the broad distribution o…
8 months, 3 weeks ago
“HPMOR: The (Probably) Untold Lore” by Gretta Duleba, Eliezer Yudkowsky
Eliezer and I love to talk about writing. We talk about our own current writing projects, how we’d improve the books we’re reading, and what we want…
8 months, 3 weeks ago
“On ‘ChatGPT Psychosis’ and LLM Sycophancy” by jdp
As a person who frequently posts about large language model psychology I get an elevated rate of cranks and schizophrenics in my inbox. Often these …
8 months, 3 weeks ago
“Subliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data” by cloud, mle, Owain_Evans
Authors: Alex Cloud*, Minh Le*, James Chua, Jan Betley, Anna Sztyber-Betley, Jacob Hilton, Samuel Marks, Owain Evans (*Equal contribution, randomly …
8 months, 3 weeks ago
“Love stays loved (formerly ‘Skin’)” by Swimmer963 (Miranda Dixon-Luinenburg)
This is a short story I wrote in mid-2022. Genre: cosmic horror as a metaphor for living with a high p-doom.
One
The last time I saw my mom, we m…
8 months, 3 weeks ago
“Make More Grayspaces” by Duncan Sabien (Inactive)
Author's note: These days, my thoughts go onto my substack by default, instead of onto LessWrong. Everything I write becomes free after a week or so…
8 months, 3 weeks ago
“Shallow Water is Dangerous Too” by jefftk
Content warning: risk to children
Julia and I knowdrowning is the biggestrisk to US children under 5, and we try to take this seriously.But yesterda…
8 months, 3 weeks ago