Podcast Episodes
Back to Search“METR’s Evaluation of GPT-5” by GradientDissenter
METR (where I work, though I'm cross-posting in a personal capacity) evaluated GPT-5 before it was externally deployed. We performed a much more com…
6 months, 2 weeks ago
“Emotions Make Sense” by DaystarEld
For the past five years I've been teaching a class at various rationality camps, workshops, conferences, etc. I’ve done it maybe 50 times in total, …
6 months, 3 weeks ago
“The Problem” by Rob Bensinger, tanagrabeast, yams, So8res, Eliezer Yudkowsky, Gretta Duleba
This is a new introduction to AI as an extinction threat, previously posted to the MIRI website in February alongside a summary. It was written inde…
6 months, 3 weeks ago
“Many prediction markets would be better off as batched auctions” by William Howard
All prediction market platforms trade continuously, which is the same mechanism the stock market uses. Buy and sell limit orders can be posted at an…
6 months, 3 weeks ago
“Whence the Inkhaven Residency?” by Ben Pace
Essays like Paul Graham's, Scott Alexander's, and Eliezer Yudkowsky's have influenced a generation of people in how they think about startups, ethic…
6 months, 3 weeks ago
“I am worried about near-term non-LLM AI developments” by testingthewaters
TL;DR
I believe that:
Almost all LLM-centric safety research will not provide any significant safety value with regards to existential or civilisat…
7 months ago
“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout
Produced as part of MATS 8.0 under the mentorship of Alex Turner and Alex Cloud. This research note overviews some early results which we are lookin…
7 months ago
“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska
FutureHouse is a company that builds literature research agents. They tested it on the bio + chem subset of HLE questions, then noticed errors in th…
7 months ago
“Maya’s Escape” by Bridgett Kay
Maya did not believe she lived in a simulation. She knew that her continued hope that she could escape from the nonexistent simulation was based on …
7 months ago
“Do confident short timelines make sense?” by TsviBT, abramdemski
TsviBT Tsvi's context
Some context:
My personal context is that I care about decreasing existential risk, and I think that the broad distribution o…
7 months ago