Podcast Episodes
Back to Search
【第75期】cDPO:通过发掘critical tokens去修正回答
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability
Summary…
1 year, 4 months ago
【第74期】苏格拉底游戏:AI Agent的脑内活动
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Boundless Socratic Learning with Language Games
Summary
This position paper explores the concept of Soc…
1 year, 4 months ago
【第73期】HiAR-ICL:LLM推理的ICL
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Summary
This r…
1 year, 4 months ago
【第72期】LLM-Brained GUI Agents: A Survey
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Large Language Model-Brained GUI Agents: A Survey
Summary
This paper surveys the development and applic…
1 year, 4 months ago
【第71期】英伟达的audio大模型Fugatto
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Fugatto 1:Foundational Generative Audio Transformer Opus 1
Summary
The document describes Fugatto, a no…
1 year, 4 months ago
【第70期】O1 Replication Journey:Part 2
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or …
1 year, 4 months ago
【第69期】O1 Replication Journey:Part 1
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
O1 Replication Journey: A Strategic Progress Report -- Part 1
Summary
This research report details a te…
1 year, 4 months ago
【第68期】stream-x算法,省去Experience Replay的在线强化学习
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates
Summary
This r…
1 year, 4 months ago
【第67期】BABY-AIGS:AI-Generated Science
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
AIGS: Generating Science from AI-Powered Automated Falsification
Summary
This research paper introduces…
1 year, 4 months ago
【第66期】Anthropic研究:给LLM评估加点“统计学”
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
Summary
This paper adv…
1 year, 4 months ago