Podcast Episodes
Back to SearchACON: Optimizing Context Compression for Long-horizon LLM Agents
Episode 1215
🤗 Upvotes: 21 | cs.AI, cs.CL
Authors:
Minki Kang, Wei-Ning Chen, Dongge Han, Huseyin A. Inan, Lukas Wutschitz, Y…
5Â months, 2Â weeks ago
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use
Episode 1214
🤗 Upvotes: 124 | cs.CL, cs.AI
Authors:
Zijian Wu, Xiangyan Liu, Xinyuan Zhang, Lingjun Chen, Fanqing Meng, Lingx…
5Â months, 2Â weeks ago
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Episode 1213
🤗 Upvotes: 106 | cs.NE, cs.AI, cs.LG, stat.ML
Authors:
Adrian Kosowski, Przemysław Uznański, Jan Chorowski, Zuza…
5Â months, 2Â weeks ago
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Episode 1212
🤗 Upvotes: 103 | cs.CV, cs.AI
Authors:
Qinsi Wang, Bo Liu, Tianyi Zhou, Jing Shi, Yueqian Lin, Yiran Chen, Hai H…
5Â months, 2Â weeks ago
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Episode 1211
🤗 Upvotes: 57 | cs.CL
Authors:
Shaobo Wang, Jiaming Wang, Jiajun Zhang, Cong Wang, Yue Min, Zichen Wen, Fei Huan…
5Â months, 2Â weeks ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
Episode 1210
🤗 Upvotes: 45 | cs.CL, cs.AI, cs.LG
Authors:
Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, …
5Â months, 2Â weeks ago
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Episode 1209
🤗 Upvotes: 36 | cs.LG, cs.AI, cs.CV, cs.MM
Authors:
Junlin Han, Shengbang Tong, David Fan, Yufan Ren, Koustuv Si…
5Â months, 2Â weeks ago
OceanGym: A Benchmark Environment for Underwater Embodied Agents
Episode 1208
🤗 Upvotes: 30 | cs.CL, cs.AI, cs.CV, cs.LG, cs.RO
Authors:
Yida Xue, Mingjun Mao, Xiangyuan Ru, Yuqi Zhu, Baocha…
5Â months, 2Â weeks ago
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
Episode 1207
🤗 Upvotes: 29 | cs.CV, cs.AI
Authors:
Xinyu Tian, Shu Zou, Zhaoyuan Yang, Mengqi He, Fabian Waschkowski, Lukas W…
5Â months, 2Â weeks ago
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
Episode 1206
🤗 Upvotes: 26 | cs.LG, cs.CL
Authors:
Xin Xu, Cliveb AI, Kai Yang, Tianhao Chen, Yang Wang, Saiyong Yang, Can Ya…
5Â months, 2Â weeks ago