Podcast Episodes
Back to Search“Will we really put data centers in space?” by Avi Parrack, fin
Abstract
Several major technology companies have announced plans to operate AI data centers in orbit. Elon Musk recently claimed: “the lowest-cost p…
2 weeks ago
“PLA Daily Translation: Reflections on Warfare Brought by AGI” by eeeee
Source
“Reflections on Warfare Brought by AGI” (AGI带来的战争思考)
Source: PLA Daily (解放军报)
Date: January 21, 2025
Authors: Rong Ming (荣明), Hu Xiaofeng (…
2 weeks ago
“Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List” by Owain_Evans
Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment. Also available as a PDF.
Contents
What is OOCR?Example…2 weeks ago
“Numb mental state shifts” by KatjaGrace
There are different mental states that feel different. Those are relatively obvious. For instance, being angry or drunk or frustrated or besotted.
…
2 weeks ago
“You can opt out of allergies” by Rattengift
My friends are starting to sniffle and sneeze every time we speak, signalling it's finally the worst part of the year: The latter half of Spring, wh…
2 weeks, 1 day ago
“Notes on Collaborating with Claude Opus” by Nissa Seru
INTENT: Share elements of my mental model regarding collaboration with Claude Opus models. Not intentionally scoped to a specific model version, but…
2 weeks, 1 day ago
“Learned Chain-of-Thought Obfuscation Generalises to Unseen Tasks” by Nathaniel Mitrani, sassanb, Cam Tice, Puria
TL;DR
Training against a CoT or summary-only monitor can lead to obfuscation of dangerous reasoning in unseen tasks. This strengthens the “don’t tra…
2 weeks, 1 day ago
“What am I, if not an AI?” by makiba
TL:DR
I RL fine-tuned Mistral 7B Instruct v0.3 and Llama 3.1 8B Instruct to avoid self-identifying as a language model, without specifying a target …2 weeks, 2 days ago
“AI #169: New Knowledge” by Zvi
Even in a relatively quiet period, AI is out there creating new knowledge. The new knowledge in question is OpenAI getting us the first truly impres…
2 weeks, 2 days ago
“Loss of Oversight: How AI Systems May Become Harder to Audit, Monitor, and Investigate” by Jordan Taylor, Max H, Ed Fage, Thomas Read, Joseph Bloom
Produced by UK AISI Model Transparency and Situational Awareness teams. If you’re a Research Scientist or Research Engineer, we’re hiring – apply he…
2 weeks, 2 days ago