Podcast Episodes

“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout

Produced as part of MATS 8.0 under the mentorship of Alex Turner and Alex Cloud. This research note overviews some early results which we are lookin…

8 months, 2 weeks ago

Short Long

View Episode

“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska

FutureHouse is a company that builds literature research agents. They tested it on the bio + chem subset of HLE questions, then noticed errors in th…

8 months, 2 weeks ago

Short Long

View Episode

“Maya’s Escape” by Bridgett Kay

Maya did not believe she lived in a simulation. She knew that her continued hope that she could escape from the nonexistent simulation was based on …

8 months, 2 weeks ago

Short Long

View Episode

“Do confident short timelines make sense?” by TsviBT, abramdemski

TsviBT Tsvi's context

Some context:

My personal context is that I care about decreasing existential risk, and I think that the broad distribution o…

8 months, 3 weeks ago

Short Long

View Episode

“HPMOR: The (Probably) Untold Lore” by Gretta Duleba, Eliezer Yudkowsky

Eliezer and I love to talk about writing. We talk about our own current writing projects, how we’d improve the books we’re reading, and what we want…

8 months, 3 weeks ago

Short Long

View Episode

“On ‘ChatGPT Psychosis’ and LLM Sycophancy” by jdp

As a person who frequently posts about large language model psychology I get an elevated rate of cranks and schizophrenics in my inbox. Often these …

8 months, 3 weeks ago

Short Long

View Episode

“Subliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data” by cloud, mle, Owain_Evans

Authors: Alex Cloud*, Minh Le*, James Chua, Jan Betley, Anna Sztyber-Betley, Jacob Hilton, Samuel Marks, Owain Evans (*Equal contribution, randomly …

8 months, 3 weeks ago

Short Long

View Episode

“Love stays loved (formerly ‘Skin’)” by Swimmer963 (Miranda Dixon-Luinenburg)

This is a short story I wrote in mid-2022. Genre: cosmic horror as a metaphor for living with a high p-doom.

One

The last time I saw my mom, we m…

8 months, 3 weeks ago

Short Long

View Episode

“Make More Grayspaces” by Duncan Sabien (Inactive)

Author's note: These days, my thoughts go onto my substack by default, instead of onto LessWrong. Everything I write becomes free after a week or so…

8 months, 3 weeks ago

Short Long

View Episode

“Shallow Water is Dangerous Too” by jefftk

Content warning: risk to children

Julia and I knowdrowning is the biggestrisk to US children under 5, and we try to take this seriously.But yesterda…

8 months, 3 weeks ago

Short Long

View Episode

Podcast Episodes

“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout

“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska

“Maya’s Escape” by Bridgett Kay

“Do confident short timelines make sense?” by TsviBT, abramdemski

“HPMOR: The (Probably) Untold Lore” by Gretta Duleba, Eliezer Yudkowsky

“On ‘ChatGPT Psychosis’ and LLM Sycophancy” by jdp

“Subliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data” by cloud, mle, Owain_Evans

“Love stays loved (formerly ‘Skin’)” by Swimmer963 (Miranda Dixon-Luinenburg)

“Make More Grayspaces” by Duncan Sabien (Inactive)

“Shallow Water is Dangerous Too” by jefftk

Love PodBriefly?