Podcast Episodes
Back to Search
【第105期】MAXINFORL:最大化对底层任务信息增益的强化学习
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
Summar…
1 year, 3 months ago
【第104期】STAR:无梯度的进化优化算法
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
STAR: Synthesis of Tailored Architectures
Summary
This research paper introduces STAR, a novel framewor…
1 year, 3 months ago
【第103期】开源和闭源大型语言模型的比较研究
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
The Open Source Advantage in Large Language Models (LLMs)
Summary
This research paper compares open-sou…
1 year, 3 months ago
【第102期】Byte Latent Transformer (BLT):用byte级替代token级
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Byte Latent Transformer: Patches Scale Better Than Tokens
Summary
The paper introduces the Byte Latent …
1 year, 3 months ago
【第101期】Large Concept Models (LCMs)
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Large Concept Models: Language Modeling in a Sentence Representation Space
Summary
This research paper …
1 year, 3 months ago
【第100期】SLM更懂LLM提示词
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Smaller Language Models Are Better Instruction Evolvers
Summary
This research paper investigates the su…
1 year, 3 months ago
【第99期】GREATER:一种对于小模型的提示词优化技术
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Summary
The pa…
1 year, 3 months ago
【第98期】SPaR:通过搜索树改进LLM指令遵循
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models…
1 year, 3 months ago
【第97期】SCBench:基于KV Cache的评估长上下文LLM基准
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
SCBench: A KV Cache-Centric Analysis of Long-Context Methods
Summary
The paper introduces SCBench, a ne…
1 year, 3 months ago
【第96期】AsyncLM:异步LLM函数调用
Seventy3: 用NotebookLM将论文生成播客,让大家跟着AI一起进步。
今天的主题是:
Asynchronous LLM Function Calling
Summary
This research paper introduces AsyncLM, a system designed to …
1 year, 3 months ago