Podcast Episodes
Back to SearchPVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Episode 1106
🤗 Upvotes: 21 | cs.LG, cs.AI
Authors:
Wenfeng Feng, Penghong Zhao, Guochao Jiang, Chuzhan Hao, Yuewei Zhang, Hao…
10Â months ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Episode 1105
🤗 Upvotes: 84 | cs.CV, cs.AI, cs.LG
Authors:
Jie Jiang, Qi Yang, Bolin Ni, Shiming Xiang, Han Hu, Houwen Peng
10Â months ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Episode 1104
🤗 Upvotes: 40 | cs.CL, cs.AI
Authors:
Ming Hu, Chenglong Ma, Wei Li, Wanghan Xu, Jiamin Wu, Jucheng Hu, Tianbin …
10Â months ago
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Episode 1103
🤗 Upvotes: 59 | cs.LG, cs.CL
Authors:
Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, …
10Â months ago
VibeVoice Technical Report
Episode 1102
🤗 Upvotes: 45 | cs.CL, cs.AI, cs.SD, eess.AS
Authors:
Zhiliang Peng, Jianwei Yu, Wenhui Wang, Yaoyao Chang, Yuta…
10Â months ago
CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Episode 1101
🤗 Upvotes: 43 | cs.LG, cs.AI
Authors:
Weida Wang, Dongchen Huang, Jiatong Li, Tengchao Yang, Ziyang Zheng, Di Zh…
10Â months ago
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Episode 1100
🤗 Upvotes: 28 | cs.CV
Authors:
Lin Li, Zehuan Huang, Haoran Feng, Gengxiong Zhuang, Rui Chen, Chunchao Guo, Lu S…
10Â months ago
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
Episode 1099
🤗 Upvotes: 26 | cs.CV
Authors:
Jianwen Jiang, Weihong Zeng, Zerong Zheng, Jiaqi Yang, Chao Liang, Wang Liao, Han…
10Â months ago
Spacer: Towards Engineered Scientific Inspiration
Episode 1098
🤗 Upvotes: 25 | cs.AI, cs.LG, cs.NE
Authors:
Minhyeong Lee, Suyoung Hwang, Seunghyun Moon, Geonho Nah, Donghyun …
10Â months ago
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Episode 1097
🤗 Upvotes: 23 | cs.LG
Authors:
Zihao Huang, Yu Bao, Qiyang Min, Siyan Chen, Ran Guo, Hongzhi Huang, Defa Zhu, Yu…
10Â months ago