Podcast Episodes
Back to Search“Legible vs. Illegible AI Safety Problems” by Wei Dai
Some AI safety problems are legible (obvious or understandable) to company leaders and government policymakers, implying they are unlikely to deploy…
5 months, 1 week ago
“Lack of Social Grace is a Lack of Skill” by Screwtape
1.
I have claimed that one of the fundamental questions of rationality is “what am I about to do and what will happen next?” One of the domains I a…
5 months, 1 week ago
[Linkpost] “I ate bear fat with honey and salt flakes, to prove a point” by aggliu
This is a link post. Eliezer Yudkowsky did not exactly suggest that you should eat bear fat covered with honey and sprinkled with salt flakes.
What h…
5 months, 1 week ago
“What’s up with Anthropic predicting AGI by early 2027?” by ryan_greenblatt
As far as I'm aware, Anthropic is the only AI company with official AGI timelines[1]: they expect AGI by early 2027. In their recommendations (from …
5 months, 1 week ago
[Linkpost] “Emergent Introspective Awareness in Large Language Models” by Drake Thomas
This is a link post. New Anthropic research (tweet, blog post, paper):
We investigate whether large language models can introspect on their internal…
5 months, 1 week ago
[Linkpost] “You’re always stressed, your mind is always busy, you never have enough time” by mingyuan
This is a link post. You have things you want to do, but there's just never time. Maybe you want to find someone to have kids with, or maybe you want…
5 months, 1 week ago
“LLM-generated text is not testimony” by TsviBT
Crosspost from my blog.
Synopsis
When we share words with each other, we don't only care about the words themselves. We care also—even primarily—ab…
5 months, 1 week ago
“Post title: Why I Transitioned: A Case Study” by Fiora Sunshine
An Overture
Famously, trans people tend not to have great introspective clarity into their own motivations for transition. Intuitively, they tend to…
5 months, 2 weeks ago
“The Memetics of AI Successionism” by Jan_Kulveit
TL;DR: AI progress and the recognition of associated risks are painful to think about. This cognitive dissonance acts as fertile ground in the memet…
5 months, 2 weeks ago
“How Well Does RL Scale?” by Toby_Ord
This is the latest in a series of essays on AI Scaling.
You can find the others on my site.
Summary: RL-training for LLMs scales surprisingly poorl…
5 months, 2 weeks ago