Podcast Episodes

Back to Search
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Episode 1112

🤗 Upvotes: 63 | cs.CV, cs.LG

Authors:
Xiyao Wang, Chunyuan Li, Jianwei Yang, Kai Zhang, Bo Liu, Tianyi Xiong, Fu…

8 months ago

Short Long
View Episode
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Episode 1111

🤗 Upvotes: 50 | cs.CV, cs.AI

Authors:
Hao Lu, Jiahao Wang, Yaolun Zhang, Ruohui Wang, Xuanyu Zheng, Yepeng Tang,…

8 months ago

Short Long
View Episode
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Episode 1110

🤗 Upvotes: 39 | cs.CV

Authors:
Yuan Liu, Zhongyin Zhao, Le Tian, Haicheng Wang, Xubing Ye, Yangxiu You, Zilin Yu…

8 months ago

Short Long
View Episode
Baichuan-M2: Scaling Medical Capability with Large Verifier System

Episode 1109

🤗 Upvotes: 28 | cs.LG, cs.AI

Authors:
Baichuan-M2 Team, :, Chengfeng Dou, Chong Liu, Fan Yang, Fei Li, Jiyuan Ji…

8 months ago

Short Long
View Episode
Kwai Keye-VL 1.5 Technical Report

Episode 1108

🤗 Upvotes: 26 | cs.CV

Authors:
Biao Yang, Bin Wen, Boyang Ding, Changyi Liu, Chenglong Chu, Chengru Song, Chongl…

8 months ago

Short Long
View Episode
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Episode 1107

🤗 Upvotes: 25 | cs.CL

Authors:
Mohammad Zbeeb, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Title:
…

8 months ago

Short Long
View Episode
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Episode 1106

🤗 Upvotes: 21 | cs.LG, cs.AI

Authors:
Wenfeng Feng, Penghong Zhao, Guochao Jiang, Chuzhan Hao, Yuewei Zhang, Hao…

8 months ago

Short Long
View Episode
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Episode 1105

🤗 Upvotes: 84 | cs.CV, cs.AI, cs.LG

Authors:
Jie Jiang, Qi Yang, Bolin Ni, Shiming Xiang, Han Hu, Houwen Peng

…

8 months ago

Short Long
View Episode
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Episode 1104

🤗 Upvotes: 40 | cs.CL, cs.AI

Authors:
Ming Hu, Chenglong Ma, Wei Li, Wanghan Xu, Jiamin Wu, Jucheng Hu, Tianbin …

8 months ago

Short Long
View Episode
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Episode 1103

🤗 Upvotes: 59 | cs.LG, cs.CL

Authors:
Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, …

8 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us