Podcast Episodes
Back to Search"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez
TL;DR: This document lays out the case for research on “model organisms of misalignment” – in vitro demonstrations of the kinds of failures that migh…
2 years, 8 months ago
"My current LK99 questions" by Eliezer Yudkowsky
So this morning I thought to myself, "Okay, now I will actually try to study the LK99 question, instead of betting based on nontechnical priors and m…
2 years, 8 months ago
"The "public debate" about AI is confusing for the general public and for policymakers because it is a three-sided debate" by Adam David Long
Summary of Argument: The public debate among AI experts is confusing because there are, to a first approximation, three sides, not two sides to the d…
2 years, 8 months ago
"ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks" by Beth Barnes
Blogpost version
Paper
We have just released our first public report. It introduces methodology for assessing the capacity of LLM agents to acquire res…
2 years, 8 months ago
"Thoughts on sharing information about language model capabilities" by paulfchristiano
I believe that sharing information about the capabilities and limits of existing ML systems, and especially language model agents, significantly redu…
2 years, 8 months ago
"Yes, It's Subjective, But Why All The Crabs?" by johnswentworth
Some early biologist, equipped with knowledge of evolution but not much else, might see all these crabs and expect a common ancestral lineage. That’s…
2 years, 8 months ago
"Self-driving car bets" by paulfchristiano
This month I lost a bunch of bets.
Back in early 2016 I bet at even odds that self-driving ride sharing would be available in 10 US cities by July 202…
2 years, 8 months ago
"Cultivating a state of mind where new ideas are born" by Henrik Karlsson
In the early 2010s, a popular idea was to provide coworking spaces and shared living to people who were building startups. That way the founders woul…
2 years, 8 months ago
"Rationality !== Winning" by Raemon
I think "Rationality is winning" is a bit of a trap.
(The original phrase is notably "rationality is systematized winning", which is better, but it t…
2 years, 8 months ago
"Brain Efficiency Cannell Prize Contest Award Ceremony" by Alexander Gietelink Oldenziel
Previously Jacob Cannell wrote the post "Brain Efficiency" which makes several radical claims: that the brain is at the pareto frontier of speed, ene…
2 years, 8 months ago