Podcast Episodes
Back to SearchKlear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
Episode 1045
🤗 Upvotes: 28 | cs.LG, cs.AI, cs.CL
Authors:
Zhenpeng Su, Leiyu Pan, Xue Bai, Dening Liu, Guanting Dong, Jiaming…
7Â months, 1Â week ago
MolmoAct: Action Reasoning Models that can Reason in Space
Episode 1044
🤗 Upvotes: 22 | cs.RO
Authors:
Jason Lee, Jiafei Duan, Haoquan Fang, Yuquan Deng, Shuo Liu, Boyang Li, Bohan Fan…
7Â months, 1Â week ago
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Episode 1043
🤗 Upvotes: 79 | cs.CL
Authors:
GLM-4. 5 Team, :, Aohan Zeng, Xin Lv, Qinkai Zheng, Zhenyu Hou, Bin Chen, Chengxi…
7Â months, 1Â week ago
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Episode 1042
🤗 Upvotes: 36 | cs.GR, cs.AI, cs.CV, cs.LG
Authors:
Seungyong Lee, Jeong-gi Kwak
Title:
…
7Â months, 1Â week ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Episode 1041
🤗 Upvotes: 143 | cs.AI, cs.CL, cs.LG
Authors:
Chengshuai Zhao, Zhen Tan, Pingchuan Ma, Dawei Li, Bohan Jiang, Ya…
7Â months, 2Â weeks ago
VeriGUI: Verifiable Long-Chain GUI Dataset
Episode 1040
🤗 Upvotes: 117 | cs.HC
Authors:
Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong…
7Â months, 2Â weeks ago
Efficient Agents: Building Effective Agents While Reducing Cost
Episode 1039
🤗 Upvotes: 53 | cs.AI, cs.CL, cs.MA
Authors:
Ningning Wang, Xavier Hu, Pai Liu, He Zhu, Yue Hou, Heyuan Huang, S…
7Â months, 2Â weeks ago
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Episode 1038
🤗 Upvotes: 38 | cs.AI, cs.CL, cs.CV, cs.LG, cs.MA, cs.MM
Authors:
Zeyi Sun, Ziyu Liu, Yuhang Zang, Yuhang Cao, X…
7Â months, 2Â weeks ago
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning
Episode 1037
🤗 Upvotes: 32 | cs.LG, cs.CL, cs.SE
Authors:
Alexander Golubev, Maria Trofimova, Sergei Polezhaev, Ibragim Bader…
7Â months, 2Â weeks ago
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success
Episode 1036
🤗 Upvotes: 29 | cs.LG, cs.AI
Authors:
George Bredis, Stanislav Dereka, Viacheslav Sinii, Ruslan Rakhimov, Daniil…
7Â months, 2Â weeks ago