Episode Details
Back to Episodes“Claude Opus 4.8: The System Card” by Zvi
Description
Only six weeks after Opus 4.7, we have Opus 4.8.
For everyone, that means another incremental upgrade to Claude. It is once again smarter, and can do tasks for longer, and comes with a number of hot new features.
For me, that also means reading another 244 page system card.
It was only April 20 when I did a full review of the Opus 4.7 system card, plus an additional post focusing on related issues of model welfare.
These updates are incremental and coming more rapidly, and this still is below the capability level of Claude Mythos, so the focus will be on the delta. What is different about Opus 4.8 versus what we already know about Opus 4.7 and Mythos?
It turns out there's still a lot to talk about.
Image created as self-portrait for this post by Claude Opus 4.8Table of Contents
- Here We Go Again: Executive Summary.
- Introduction (1).
- RSP Evaluations (2).
- Move That Goalpost.
- The Failures Are News.
- Alignment Risk Slowly Rises.
- New Risk Pathways Just Dropped.
- Cyber (3).
- Harmful Requests (4.1).
- We Need To Talk (4.2 [...]
---
Outline:
(01:16) Here We Go Again: Executive Summary
(02:33) Introduction (1)
(02:42) RSP Evaluations (2)
(03:47) Move That Goalpost
(05:41) The Failures Are News
(07:33) Alignment Risk Slowly Rises
(09:00) New Risk Pathways Just Dropped
(11:26) Cyber (3)
(12:22) Harmful Requests (4.1)
(14:23) We Need To Talk (4.2 and 4.3)
(17:36) Overcoming Bias (4.4)
(19:33) Agentic Safety (5)
(21:40) Prompt Injection (5.2)
(25:18) Alignment (6)
(26:33) Looking For Problems
(27:55) Who Watches The Training (6.2.2)
(32:07) Automated Behavioral Audit
(32:47) The Model Is Smarter Than The Eval (6.2.3.2)
(34:39) You Should See The Other Guy
(36:30) UK AISI Testing (6.2.4)
(36:50) In Vendbench (6.2.5)
(39:27) Honesty (6.3.3 to 6.3.6)
(41:35) Chain of Thought (CoT) Monitorability (6.5)
(44:09) What's In The Box? (6.6)
(45:57) That's All For Now
---
First published:
May 29th, 2026
Source:
https://www.lesswrong.com/posts/Gx6cJ6cG9JfeSNcLB/claude-opus-4-8-the-system-card
---
Narrated by TYPE III AUDIO.
---
Love PodBriefly?
If you like Podbriefly.com, please consider donating to support the ongoing development.
Support Us