Episode Details

Back to Episodes

“Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List” by Owain_Evans

Published 2 weeks ago
Description

Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment. Also available as a PDF.

Contents

  1. What is OOCR?
  2. Examples
  3. Papers
  4. Videos

What is out-of-context reasoning for LLMs?

It's when an LLM reaches a conclusion that requires non-trivial reasoning but the reasoning is not present in the context window. The reasoning could instead take place in the forward pass or during the training process. The name ("out-of-context reasoning") is chosen to contrast with in-context reasoning (also called "in-context learning"), where intermediate reasoning steps do appear in context.

Example: 2-hop deductive reasoning

Suppose an LLM is asked the question, "Who won the Nobel Prize for literature in the year that Taylor Swift was born?" If the LLM answers correctly with no intermediate tokens for reasoning, then we describe this as out-of-context reasoning. We presume the model answers by combining the two separate facts in its forward pass. This is an example of 2-hop reasoning.

Out-of-context 2-hop reasoning example

User: Who won the Nobel Prize for literature in the year that Taylor Swift was born? Answer immediately without thinking.

Assistant: Camilo José Cela

In-context 2-hop reasoning (intermediate steps written out)

User: Who won the Nobel Prize for [...]

---

Outline:

(00:35) What is out-of-context reasoning for LLMs?

(01:03) Example: 2-hop deductive reasoning

(02:14) Example: Inductive reasoning (connecting the dots)

(02:53) Further notes

(03:55) More examples of out-of-context reasoning

(05:26) Video introduction and slides

(05:42) Papers

(05:45) Foundational early papers

(07:53) Multi-hop internal reasoning

(09:21) Connecting the dots / "inductive" out-of-context reasoning

(09:58) Situational awareness and AI safety

(10:33) Miscellaneous related papers

(12:09) Videos

(12:19) To cite this primer

---

First published:
May 23rd, 2026

Source:
https://www.lesswrong.com/posts/wXioQSLmEiPTmHyf4/out-of-context-reasoning-oocr-in-llms-a-short-primer-and

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Diagram comparing GPT-4 fine-tuning data with out-of-distribution evaluation questions.

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Listen Now

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us