Podcast Episodes
Back to Search"The Owned Ones" by Eliezer Yudkowsky
(An LLM Whisperer placed a strong request that I put this story somewhere not on Twitter, so it could be scraped by robots not owned by Elon Musk. I…
2 weeks, 4 days ago
"The Iliad Intensive Course Materials" by Leon Lang, David Udell, Alexander Gietelink Oldenziel
We are releasing the course materials of the Iliad Intensive, a new month-long and full-time AI Alignment course that runs in-person every second mo…
2 weeks, 4 days ago
"The Darwinian Honeymoon - Why I am not as impressed by human progress as I used to be" by Elias Schmied
Crossposted from Substack and the EA Forum.
A common argument for optimism about the future is that living conditions have improved a lot in the p…
2 weeks, 4 days ago
"What I did in the hedonium shockwave, by Emma, age six and a half" by ozymandias
My name is Emma and I’m six and a half years old and I like pink and Pokemon and my cat River and I’m going to be swallowed by a hedonium shockwave …
2 weeks, 6 days ago
"Bad Problems Don’t Stop Being Bad Because Somebody’s Wrong About Fault Analysis" by Linch
Here's a dynamic I’ve seen at least a dozen times:
Alice: Man that article has a very inaccurate/misleading/horrifying headline.
Bob: Did you know…
3 weeks ago
"x-risk-themed" by kave
Sometimes, a friend who works around here, at an x-risk-themed organisation, will think about leaving their job. They’ll ask a group of people “what…
3 weeks, 1 day ago
"Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations" by Subhash Kantamneni, kitft, Euan Ong, Sam Marks
Abstract
We introduce Natural Language Autoencoders (NLAs), an unsupervised method for generating natural language explanations of LLM activations. …
3 weeks, 2 days ago
[Linkpost] "Interpreting Language Model Parameters" by Lucius Bushnaq, Dan Braun, Oliver Clive-Griffin, Bart Bussmann, Nathan Hu, mivanitskiy, Linda Linsefors, Lee Sharkey
This is a link post. This is the latest work in our Parameter Decomposition agenda. We introduce a new parameter decomposition method, adVersarial Pa…
3 weeks, 3 days ago
"It’s nice of you to worry about me, but I really do have a life" by Viliam
I have two shameful secrets that I probably shouldn't talk about online:
I love my family.I enjoy my hobbies. "What an idiot!" you probably think. "…
3 weeks, 5 days ago
"Irretrievability; or, Murphy’s Curse of Oneshotness upon ASI" by Eliezer Yudkowsky
Example 1: The Viking 1 lander
In the 1970s, NASA sent a pair of probes to Mars, Viking 1 and Viking 2 missions, at a total cost of 1 billion dollar…
3 weeks, 5 days ago