Podcast Episodes
Back to SearchOn Path to Multimodal Generalist: General-Level and General-Bench
Episode 745
🤗 Upvotes: 55 | cs.CV
Authors:
Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu…
10Â months, 1Â week ago
Flow-GRPO: Training Flow Matching Models via Online RL
Episode 744
🤗 Upvotes: 36 | cs.CV, cs.AI
Authors:
Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang,…
10Â months, 1Â week ago
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Episode 743
🤗 Upvotes: 57 | cs.CV
Authors:
Xinjie Zhang, Jintao Guo, Shanshan Zhao, Minghao Fu, Lunhao Duan, Guo-Hua Wang, Q…
10Â months, 1Â week ago
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Episode 742
🤗 Upvotes: 35 | cs.CL
Authors:
Hao Sun, Zile Qiao, Jiayan Guo, Xuanbo Fan, Yingyan Hou, Yong Jiang, Pengjun Xie,…
10Â months, 1Â week ago
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Episode 741
🤗 Upvotes: 67 | cs.CV
Authors:
Yibin Wang, Zhimin Li, Yuhang Zang, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wan…
10Â months, 1Â week ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Episode 740
🤗 Upvotes: 63 | cs.LG, cs.AI, cs.CL
Authors:
Andrew Zhao, Yiran Wu, Yang Yue, Tong Wu, Quentin Xu, Yang Yue, Mat…
10Â months, 1Â week ago
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
Episode 739
🤗 Upvotes: 23 | cs.CL, cs.AI, cs.LG, I.2.7
Authors:
Daniel Goldstein, Eric Alcaide, Janna Lu, Eugene Cheah
10Â months, 1Â week ago
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios
Episode 738
🤗 Upvotes: 21 | cs.CV, cs.AI, cs.MM
Authors:
Shiyi Zhang, Junhao Zhuang, Zhaoyang Zhang, Ying Shan, Yansong Tang…
10Â months, 1Â week ago
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Episode 737
🤗 Upvotes: 56 | cs.AI, cs.CL, cs.SD
Authors:
Yemin Shi, Yu Shu, Siwei Dong, Guangyi Liu, Jaward Sesay, Jingwen L…
10Â months, 1Â week ago
RM-R1: Reward Modeling as Reasoning
Episode 736
🤗 Upvotes: 48 | cs.CL, cs.AI, cs.LG
Authors:
Xiusi Chen, Gaotang Li, Ziqi Wang, Bowen Jin, Cheng Qian, Yu Wang, …
10Â months, 1Â week ago