Podcast Episodes
Back to Search"Opinionated Takes on Meetups Organizing" by jenn
Screwtape, as the global ACX meetups czar, has to be reasonable and responsible in his advice giving for running meetups.
And the advice is great! I…
3 months, 3 weeks ago
"How to game the METR plot" by shash42
TL;DR: In 2025, we were in the 1-4 hour range, which has only 14 samples in METR's underlying data. The topic of each sample is public, making it ea…
3 months, 3 weeks ago
"Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers" by Sam Marks, Adam Karvonen, James Chua, Subhash Kantamneni, Euan Ong, Julian Minder, Clément Dumas, Owain_Evans
TL;DR: We train LLMs to accept LLM neural activations as inputs and answer arbitrary questions about them in natural language. These Activation Orac…
3 months, 3 weeks ago
"Scientific breakthroughs of the year" by technicalities
A couple of years ago, Gavin became frustrated with science journalism. No one was pulling together results across fields; the articles usually did…
3 months, 4 weeks ago
"A high integrity/epistemics political machine?" by Raemon
I have goals that can only be reached via a powerful political machine. Probably a lot of other people around here share them. (Goals include “ensur…
3 months, 4 weeks ago
"How I stopped being sure LLMs are just making up their internal experience (but the topic is still confusing)" by Kaj_Sotala
How it started
I used to think that anything that LLMs said about having something like subjective experience or what it felt like on the inside was…
3 months, 4 weeks ago
“My AGI safety research—2025 review, ’26 plans” by Steven Byrnes
Previous: 2024, 2022
“Our greatest fear should not be of failure, but of succeeding at something that doesn't really matter.” –attributed to DL Mood…
4 months ago
“Weird Generalization & Inductive Backdoors” by Jorio Cocola, Owain_Evans, dylan_f
This is the abstract and introduction of our new paper.
Links: 📜 Paper, 🐦 Twitter thread, 🌐 Project page, 💻 Code
Authors: Jan Betley*, Jorio Cocola…
4 months ago
“Insights into Claude Opus 4.5 from Pokémon” by Julian Bradshaw
Credit: Nano Banana, with some text provided. You may be surprised to learn that ClaudePlaysPokemon is still running today, and that Claude still has…
4 months ago
“The funding conversation we left unfinished” by jenn
People working in the AI industry are making stupid amounts of money, and word on the street is that Anthropic is going to have some sort of liquidi…
4 months ago