Podcast Episodes

“Prism: Automating Science-of-Evals Research” by LAThomson

tl;dr – we present [Prism], a scaffold for automating science-of-evals research: work that makes the evaluation the primary object of study. The sca…

2 weeks, 5 days ago

Short Long

View Episode

“The Flood, by Anton Leicht” by Austin Chen

Note: I'm crossposting Anton's newest article from his blog. Anton covers AI policy angles in a singular fashion; every article he writes is worth r…

2 weeks, 5 days ago

Short Long

View Episode

“Toy Models of Initialisation Effects on RL Dynamics” by Edward James Young, lennie

This is a follow-up to two posts Geodesic released last week on our current research direction. The code for generating the figures can be found at …

2 weeks, 5 days ago

Short Long

View Episode

“Our response to Séb Krier on Plan A” by MKodama, Thomas Larsen

This criticism of AI 2040: Plan A by Séb Krier unfortunately seriously mischaracterizes our proposal. It also mostly contains flat assertions, not r…

2 weeks, 5 days ago

Short Long

View Episode

“Better Call Sol The Workhorse” by Zvi

OpenAI's GPT-5.6-Sol is finally here, along with the cheaper Terra and Luna.

We’ve seen the early hype as reported on Thursday, but as always that …

2 weeks, 5 days ago

Short Long

View Episode

“Pausing AI at human level seems harder than pausing ASAP” by MichaelDickens

Cross-posted from my website.

Some people think we should pause AI, but not now. They say we should wait until AI reaches human level, [1] because…

2 weeks, 5 days ago

Short Long

View Episode

“It’s 2030 and we fucked up. How did it happen?” by Boaz Barak

[Linkpost to this. Loosely based on a lecture I gave in the recursive conference with the same title. Don’t take “2030” literally¹ —it could also be…

2 weeks, 5 days ago

Short Long

View Episode

“The Whitney Biennial Should Admit That Emilie Gossiaux Wants to Fuck Their Dog” by jenn

content warnings: depictions of human and anthro nudity, discussion of bestiality, modern art

Credit where it's due: it is genuinely, unironically b…

2 weeks, 5 days ago

Short Long

View Episode

“The US Government may find it difficult to seize control during takeoff” by RobertM

Epistemic status: conditioning on things I consider unlikely, many undiscussed considerations, not the whole story, etc. I'm not trying to advance a…

2 weeks, 6 days ago

Short Long

View Episode

“One-Pager Brief on Pangram Labs” by Sheikh Abdur Raheem Ali

Pangram Labs builds the most accurate AI text detector in the world. Team is >25 FTE; they are active on Twitter, you can engage directly, look for …

2 weeks, 6 days ago

Short Long

View Episode

Podcast Episodes

“Prism: Automating Science-of-Evals Research” by LAThomson

“The Flood, by Anton Leicht” by Austin Chen

“Toy Models of Initialisation Effects on RL Dynamics” by Edward James Young, lennie

“Our response to Séb Krier on Plan A” by MKodama, Thomas Larsen

“Better Call Sol The Workhorse” by Zvi

“Pausing AI at human level seems harder than pausing ASAP” by MichaelDickens

“It’s 2030 and we fucked up. How did it happen?” by Boaz Barak

“The Whitney Biennial Should Admit That Emilie Gossiaux Wants to Fuck Their Dog” by jenn

“The US Government may find it difficult to seize control during takeoff” by RobertM

“One-Pager Brief on Pangram Labs” by Sheikh Abdur Raheem Ali

Love PodBriefly?