Podcast Episodes

【第69期】O1 Replication Journey：Part 1

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
O1 Replication Journey: A Strategic Progress Report -- Part 1
Summary
This research report details a te…

1 year, 6 months ago

Short Long

View Episode

【第68期】stream-x算法，省去Experience Replay的在线强化学习

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates
Summary
This r…

1 year, 6 months ago

Short Long

View Episode

【第67期】BABY-AIGS：AI-Generated Science

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
AIGS: Generating Science from AI-Powered Automated Falsification
Summary
This research paper introduces…

1 year, 6 months ago

Short Long

View Episode

【第66期】Anthropic研究：给LLM评估加点“统计学”

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
Summary
This paper adv…

1 year, 6 months ago

Short Long

View Episode

【第65期】Liquid Time-constant Networks：液体（神经）网络是什么？

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Liquid Time-constant Networks
Summary
This research introduces Liquid Time-Constant Networks (LTCs), a …

1 year, 6 months ago

Short Long

View Episode

【第64期】NeuroClips：从fMRI数据还原大脑中视频

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Summary
The study introduces …

1 year, 6 months ago

Short Long

View Episode

【第63期】无论DPO还是PPO，Preference Feedback应该怎么用？

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Summary
This …

1 year, 6 months ago

Short Long

View Episode

【第62期】sCMs：比Diffusion更快的图像生成算法

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Simplifying, stabilizing, and scaling continuous-time consistency models
Summary
This research paper in…

1 year, 6 months ago

Short Long

View Episode

【第61期】大模型的「推理」是在做什么？

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Summary
This research inv…

1 year, 7 months ago

Short Long

View Episode

【第60期】RLTools：基于C++的开源强化学习工具

Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Summary
RLtools, a…

1 year, 7 months ago

Short Long

View Episode

Podcast Episodes

【第69期】O1 Replication Journey：Part 1

【第68期】stream-x算法，省去Experience Replay的在线强化学习

【第67期】BABY-AIGS：AI-Generated Science

【第66期】Anthropic研究：给LLM评估加点“统计学”

【第65期】Liquid Time-constant Networks：液体（神经）网络是什么？

【第64期】NeuroClips：从fMRI数据还原大脑中视频

【第63期】无论DPO还是PPO，Preference Feedback应该怎么用？

【第62期】sCMs：比Diffusion更快的图像生成算法

【第61期】大模型的「推理」是在做什么？

【第60期】RLTools：基于C++的开源强化学习工具

Love PodBriefly?