Podcast Episodes
Back to SearchLLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Episode 1112
🤗 Upvotes: 63 | cs.CV, cs.LG
Authors:
Xiyao Wang, Chunyuan Li, Jianwei Yang, Kai Zhang, Bo Liu, Tianyi Xiong, Fu…
8Â months ago
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Episode 1111
🤗 Upvotes: 50 | cs.CV, cs.AI
Authors:
Hao Lu, Jiahao Wang, Yaolun Zhang, Ruohui Wang, Xuanyu Zheng, Yepeng Tang,…
8Â months ago
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
Episode 1110
🤗 Upvotes: 39 | cs.CV
Authors:
Yuan Liu, Zhongyin Zhao, Le Tian, Haicheng Wang, Xubing Ye, Yangxiu You, Zilin Yu…
8Â months ago
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Episode 1109
🤗 Upvotes: 28 | cs.LG, cs.AI
Authors:
Baichuan-M2 Team, :, Chengfeng Dou, Chong Liu, Fan Yang, Fei Li, Jiyuan Ji…
8Â months ago
Kwai Keye-VL 1.5 Technical Report
Episode 1108
🤗 Upvotes: 26 | cs.CV
Authors:
Biao Yang, Bin Wen, Boyang Ding, Changyi Liu, Chenglong Chu, Chengru Song, Chongl…
8Â months ago
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic
Episode 1107
🤗 Upvotes: 25 | cs.CL
Authors:
Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem
Title:
…
8Â months ago
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Episode 1106
🤗 Upvotes: 21 | cs.LG, cs.AI
Authors:
Wenfeng Feng, Penghong Zhao, Guochao Jiang, Chuzhan Hao, Yuewei Zhang, Hao…
8Â months ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Episode 1105
🤗 Upvotes: 84 | cs.CV, cs.AI, cs.LG
Authors:
Jie Jiang, Qi Yang, Bolin Ni, Shiming Xiang, Han Hu, Houwen Peng
8Â months ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Episode 1104
🤗 Upvotes: 40 | cs.CL, cs.AI
Authors:
Ming Hu, Chenglong Ma, Wei Li, Wanghan Xu, Jiamin Wu, Jucheng Hu, Tianbin …
8Â months ago
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Episode 1103
🤗 Upvotes: 59 | cs.LG, cs.CL
Authors:
Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, …
8Â months, 1Â week ago