Podcast Episodes
Back to SearchWhen Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance
Episode 1195
🤗 Upvotes: 29 | cs.CL
Authors:
Nicolas Boizard, Hippolyte Gisserot-Boukhlef, Kevin El-Haddad, Céline Hudelot, Pi…
5Â months, 2Â weeks ago
LongLive: Real-time Interactive Long Video Generation
Episode 1194
🤗 Upvotes: 136 | cs.CV
Authors:
Shuai Yang, Wei Huang, Ruihang Chu, Yicheng Xiao, Yuyang Zhao, Xianbang Wang, Mu…
5Â months, 3Â weeks ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
Episode 1193
🤗 Upvotes: 102 | cs.LG, cs.AI
Authors:
Junkang Wu, Kexin Huang, Jiancan Wu, An Zhang, Xiang Wang, Xiangnan He
5Â months, 3Â weeks ago
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
Episode 1192
🤗 Upvotes: 98 | cs.LG, cs.CL
Authors:
Xu Wujiang, Wentian Zhao, Zhenting Wang, Li Yu-Jhe, Jin Can, Jin Mingyu, M…
5Â months, 3Â weeks ago
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Episode 1191
🤗 Upvotes: 81 | cs.CV, cs.CL
Authors:
Junbo Niu, Zheng Liu, Zhuangcheng Gu, Bin Wang, Linke Ouyang, Zhiyuan Zhao…
5Â months, 3Â weeks ago
ReviewScore: Misinformed Peer Review Detection with Large Language Models
Episode 1190
🤗 Upvotes: 54 | cs.CL
Authors:
Hyun Ryu, Doohyuk Jang, Hyemin S. Lee, Joonhyun Jeong, Gyeongman Kim, Donghyeon C…
5Â months, 3Â weeks ago
Variational Reasoning for Language Models
Episode 1189
🤗 Upvotes: 51 | cs.CL, cs.AI, cs.LG
Authors:
Xiangxin Zhou, Zichen Liu, Haonan Wang, Chao Du, Min Lin, Chongxuan…
5Â months, 3Â weeks ago
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Episode 1188
🤗 Upvotes: 48 | cs.CL, cs.AI, cs.LG
Authors:
Renjie Luo, Zichen Liu, Xiangyan Liu, Chao Du, Min Lin, Wenhu Chen,…
5Â months, 3Â weeks ago
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Episode 1187
🤗 Upvotes: 28 | cs.CV, cs.RO
Authors:
Jinkun Hao, Naifu Liang, Zhen Luo, Xudong Xu, Weipeng Zhong, Ran Yi, Yiche…
5Â months, 3Â weeks ago
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning
Episode 1186
🤗 Upvotes: 28 | cs.CV, cs.AI, cs.CL
Authors:
Long Xing, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jianze Liang, Qido…
5Â months, 3Â weeks ago