Podcast Episodes
Back to SearchNo Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping
Episode 1185
🤗 Upvotes: 27 | cs.CL, cs.AI, cs.LG
Authors:
Thanh-Long V. Le, Myeongho Jeon, Kim Vu, Viet Lai, Eunho Yang
5Â months, 3Â weeks ago
VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models
Episode 1184
🤗 Upvotes: 95 | cs.LG, cs.CL
Authors:
Guochao Jiang, Wenfeng Feng, Guofeng Quan, Chuzhan Hao, Yuewei Zhang, Guoh…
5Â months, 3Â weeks ago
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
Episode 1183
🤗 Upvotes: 76 | cs.CL
Authors:
Yizhou Wang, Chen Tang, Han Deng, Jiabei Xiao, Jiaqi Liu, Jianyu Wu, Jun Yao, Pen…
5Â months, 3Â weeks ago
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
Episode 1182
🤗 Upvotes: 67 | cs.CV
Authors:
Sicong Leng, Jing Wang, Jiaxi Li, Hao Zhang, Zhiqiang Hu, Boqiang Zhang, Yuming J…
5Â months, 3Â weeks ago
Tree Search for LLM Agent Reinforcement Learning
Episode 1181
🤗 Upvotes: 58 | cs.LG, cs.AI
Authors:
Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu
5Â months, 3Â weeks ago
Seedream 4.0: Toward Next-generation Multimodal Image Generation
Episode 1180
🤗 Upvotes: 46 | cs.CV
Authors:
Team Seedream, Yunpeng Chen, Yu Gao, Lixue Gong, Meng Guo, Qiushan Guo, Zhiyao Gu…
5Â months, 3Â weeks ago
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
Episode 1179
🤗 Upvotes: 28 | cs.CV, cs.AI
Authors:
Team Hunyuan3D, :, Bowen Zhang, Chunchao Guo, Haolin Liu, Hongyu Yan, Huiw…
5Â months, 3Â weeks ago
AutoIntent: AutoML for Text Classification
Episode 1178
🤗 Upvotes: 22 | cs.CL
Authors:
Ilya Alekseev, Roman Solomatin, Darina Rustamova, Denis Kuznetsov
Tit…
5Â months, 3Â weeks ago
Video models are zero-shot learners and reasoners
Episode 1177
🤗 Upvotes: 49 | cs.LG, cs.AI, cs.CV, cs.RO
Authors:
Thaddäus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu,…
5Â months, 3Â weeks ago
SIM-CoT: Supervised Implicit Chain-of-Thought
Episode 1176
🤗 Upvotes: 28 | cs.CL, cs.AI
Authors:
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, …
5Â months, 3Â weeks ago