Podcast Episodes
Back to Search“Unexpected Things that are People” by Ben Goldhaber
Cross-posted from https://bengoldhaber.substack.com/
It's widely known that Corporations are People. This is universally agreed to be a good thing; …
3 months, 2 weeks ago
“Sonnet 4.5’s eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals” by Alexa Pan, ryan_greenblatt
According to the Sonnet 4.5 system card, Sonnet 4.5 is much more likely than Sonnet 4 to mention in its chain-of-thought that it thinks it is being e…
3 months, 3 weeks ago
“Publishing academic papers on transformative AI is a nightmare” by Jakub Growiec
I am a professor of economics. Throughout my career, I was mostly working on economic growth theory, and this eventually brought me to the topic of …
3 months, 3 weeks ago
“The Unreasonable Effectiveness of Fiction” by Raelifin
[Meta: This is Max Harms. I wrote a novel about China and AGI, which comes out today. This essay from my fiction newsletter has been slightly modifi…
3 months, 3 weeks ago
“Legible vs. Illegible AI Safety Problems” by Wei Dai
Some AI safety problems are legible (obvious or understandable) to company leaders and government policymakers, implying they are unlikely to deploy…
3 months, 3 weeks ago
“Lack of Social Grace is a Lack of Skill” by Screwtape
1.
I have claimed that one of the fundamental questions of rationality is “what am I about to do and what will happen next?” One of the domains I a…
3 months, 3 weeks ago
[Linkpost] “I ate bear fat with honey and salt flakes, to prove a point” by aggliu
This is a link post. Eliezer Yudkowsky did not exactly suggest that you should eat bear fat covered with honey and sprinkled with salt flakes.
What h…
3 months, 3 weeks ago
“What’s up with Anthropic predicting AGI by early 2027?” by ryan_greenblatt
As far as I'm aware, Anthropic is the only AI company with official AGI timelines[1]: they expect AGI by early 2027. In their recommendations (from …
3 months, 3 weeks ago
[Linkpost] “Emergent Introspective Awareness in Large Language Models” by Drake Thomas
This is a link post. New Anthropic research (tweet, blog post, paper):
We investigate whether large language models can introspect on their internal…
3 months, 3 weeks ago
[Linkpost] “You’re always stressed, your mind is always busy, you never have enough time” by mingyuan
This is a link post. You have things you want to do, but there's just never time. Maybe you want to find someone to have kids with, or maybe you want…
3 months, 3 weeks ago