Podcast Episodes
Back to SearchLearning Adaptive Parallel Reasoning with Language Models
Episode 705
🤗 Upvotes: 35 | cs.AI, cs.CL
Authors:
Jiayi Pan, Xiuyu Li, Long Lian, Charlie Snell, Yifei Zhou, Adam Yala, Trev…
10Â months, 3Â weeks ago
Learning to Reason under Off-Policy Guidance
Episode 704
🤗 Upvotes: 59 | cs.LG, cs.AI, cs.CL
Authors:
Jianhao Yan, Yafu Li, Zican Hu, Zhi Wang, Ganqu Cui, Xiaoye Qu, Yu …
10Â months, 3Â weeks ago
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
Episode 703
🤗 Upvotes: 50 | cs.CV
Authors:
Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, De-An Hua…
10Â months, 3Â weeks ago
FlowReasoner: Reinforcing Query-Level Meta-Agents
Episode 702
🤗 Upvotes: 36 | cs.AI
Authors:
Hongcheng Gao, Yue Liu, Yufei He, Longxu Dou, Chao Du, Zhijie Deng, Bryan Hooi, M…
10Â months, 3Â weeks ago
ToolRL: Reward is All Tool Learning Needs
Episode 701
🤗 Upvotes: 33 | cs.LG, cs.AI, cs.CL
Authors:
Cheng Qian, Emre Can Acikgoz, Qi He, Hongru Wang, Xiusi Chen, Dilek…
10Â months, 3Â weeks ago
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents
Episode 700
🤗 Upvotes: 25 | cs.CR, cs.AI, cs.CL, cs.LG, cs.MA
Authors:
Salman Rahman, Liwei Jiang, James Shiffer, Genglin Li…
10Â months, 3Â weeks ago
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians
Episode 699
🤗 Upvotes: 21 | cs.CV
Authors:
Cailin Zhuang, Yaoqi Hu, Xuanyang Zhang, Wei Cheng, Jiacheng Bao, Shengqi Liu, Yi…
10Â months, 3Â weeks ago
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Episode 698
🤗 Upvotes: 64 | cs.AI, cs.CL, cs.CV
Authors:
Yang Yue, Zhiqi Chen, Rui Lu, Andrew Zhao, Zhaokai Wang, Yang Yue, …
10Â months, 3Â weeks ago
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
Episode 697
🤗 Upvotes: 31 | cs.CL, cs.AI
Authors:
Yicheng Chen, Yining Li, Kai Hu, Zerun Ma, Haochen Ye, Kai Chen
10Â months, 3Â weeks ago
NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes
Episode 696
🤗 Upvotes: 30 | cs.AI
Authors:
Tianyang Xu, Haojie Zheng, Chengze Li, Haoxiang Chen, Yixin Liu, Ruoxi Chen, Lich…
10Â months, 3Â weeks ago