Podcast Episodes
Back to SearchVision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Episode 1212
🤗 Upvotes: 103 | cs.CV, cs.AI
Authors:
Qinsi Wang, Bo Liu, Tianyi Zhou, Jing Shi, Yueqian Lin, Yiran Chen, Hai H…
7Â months ago
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Episode 1211
🤗 Upvotes: 57 | cs.CL
Authors:
Shaobo Wang, Jiaming Wang, Jiajun Zhang, Cong Wang, Yue Min, Zichen Wen, Fei Huan…
7Â months ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
Episode 1210
🤗 Upvotes: 45 | cs.CL, cs.AI, cs.LG
Authors:
Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, …
7Â months ago
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Episode 1209
🤗 Upvotes: 36 | cs.LG, cs.AI, cs.CV, cs.MM
Authors:
Junlin Han, Shengbang Tong, David Fan, Yufan Ren, Koustuv Si…
7Â months ago
OceanGym: A Benchmark Environment for Underwater Embodied Agents
Episode 1208
🤗 Upvotes: 30 | cs.CL, cs.AI, cs.CV, cs.LG, cs.RO
Authors:
Yida Xue, Mingjun Mao, Xiangyuan Ru, Yuqi Zhu, Baocha…
7Â months ago
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
Episode 1207
🤗 Upvotes: 29 | cs.CV, cs.AI
Authors:
Xinyu Tian, Shu Zou, Zhaoyuan Yang, Mengqi He, Fabian Waschkowski, Lukas W…
7Â months ago
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
Episode 1206
🤗 Upvotes: 26 | cs.LG, cs.CL
Authors:
Xin Xu, Cliveb AI, Kai Yang, Tianhao Chen, Yang Wang, Saiyong Yang, Can Ya…
7Â months ago
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
Episode 1205
🤗 Upvotes: 26 | cs.CV, cs.AI
Authors:
Junyu Chen, Wenkun He, Yuchao Gu, Yuyang Zhao, Jincheng Yu, Junsong Chen, …
7Â months ago
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
Episode 1204
🤗 Upvotes: 98 | cs.LG, cs.AI, cs.CV
Authors:
Jintao Zhang, Haoxu Wang, Kai Jiang, Shuo Yang, Kaiwen Zheng, Haoch…
7Â months, 1Â week ago
StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs
Episode 1203
🤗 Upvotes: 58 | cs.CL
Authors:
Yuhan Song, Linhao Zhang, Chuhan Wu, Aiwei Liu, Wei Jia, Houfeng Wang, Xiao Zhou
7Â months, 1Â week ago