Podcast Episodes
Back to Search
【第69期】O1 Replication Journey:Part 1
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
O1 Replication Journey: A Strategic Progress Report -- Part 1
Summary
This research report details a te…
1 year, 6 months ago
【第68期】stream-x算法,省去Experience Replay的在线强化学习
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates
Summary
This r…
1 year, 6 months ago
【第67期】BABY-AIGS:AI-Generated Science
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
AIGS: Generating Science from AI-Powered Automated Falsification
Summary
This research paper introduces…
1 year, 6 months ago
【第66期】Anthropic研究:给LLM评估加点“统计学”
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
Summary
This paper adv…
1 year, 6 months ago
【第65期】Liquid Time-constant Networks:液体(神经)网络是什么?
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Liquid Time-constant Networks
Summary
This research introduces Liquid Time-Constant Networks (LTCs), a …
1 year, 6 months ago
【第64期】NeuroClips:从fMRI数据还原大脑中视频
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Summary
The study introduces …
1 year, 6 months ago
【第63期】无论DPO还是PPO,Preference Feedback应该怎么用?
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Summary
This …
1 year, 6 months ago
【第62期】sCMs:比Diffusion更快的图像生成算法
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Simplifying, stabilizing, and scaling continuous-time consistency models
Summary
This research paper in…
1 year, 6 months ago
【第61期】大模型的「推理」是在做什么?
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Summary
This research inv…
1 year, 7 months ago
【第60期】RLTools:基于C++的开源强化学习工具
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Summary
RLtools, a…
1 year, 7 months ago