Podcast Episodes

Back to Search
No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Episode 1185

🤗 Upvotes: 27 | cs.CL, cs.AI, cs.LG

Authors:
Thanh-Long V. Le, Myeongho Jeon, Kim Vu, Viet Lai, Eunho Yang

…

5 months, 3 weeks ago

Short Long
View Episode
VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Episode 1184

🤗 Upvotes: 95 | cs.LG, cs.CL

Authors:
Guochao Jiang, Wenfeng Feng, Guofeng Quan, Chuzhan Hao, Yuewei Zhang, Guoh…

5 months, 3 weeks ago

Short Long
View Episode
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

Episode 1183

🤗 Upvotes: 76 | cs.CL

Authors:
Yizhou Wang, Chen Tang, Han Deng, Jiabei Xiao, Jiaqi Liu, Jianyu Wu, Jun Yao, Pen…

5 months, 3 weeks ago

Short Long
View Episode
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Episode 1182

🤗 Upvotes: 67 | cs.CV

Authors:
Sicong Leng, Jing Wang, Jiaxi Li, Hao Zhang, Zhiqiang Hu, Boqiang Zhang, Yuming J…

5 months, 3 weeks ago

Short Long
View Episode
Tree Search for LLM Agent Reinforcement Learning

Episode 1181

🤗 Upvotes: 58 | cs.LG, cs.AI

Authors:
Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu

…

5 months, 3 weeks ago

Short Long
View Episode
Seedream 4.0: Toward Next-generation Multimodal Image Generation

Episode 1180

🤗 Upvotes: 46 | cs.CV

Authors:
Team Seedream, Yunpeng Chen, Yu Gao, Lixue Gong, Meng Guo, Qiushan Guo, Zhiyao Gu…

5 months, 3 weeks ago

Short Long
View Episode
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Episode 1179

🤗 Upvotes: 28 | cs.CV, cs.AI

Authors:
Team Hunyuan3D, :, Bowen Zhang, Chunchao Guo, Haolin Liu, Hongyu Yan, Huiw…

5 months, 3 weeks ago

Short Long
View Episode
AutoIntent: AutoML for Text Classification

Episode 1178

🤗 Upvotes: 22 | cs.CL

Authors:
Ilya Alekseev, Roman Solomatin, Darina Rustamova, Denis Kuznetsov

Tit…

5 months, 3 weeks ago

Short Long
View Episode
Video models are zero-shot learners and reasoners

Episode 1177

🤗 Upvotes: 49 | cs.LG, cs.AI, cs.CV, cs.RO

Authors:
Thaddäus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu,…

5 months, 3 weeks ago

Short Long
View Episode
SIM-CoT: Supervised Implicit Chain-of-Thought

Episode 1176

🤗 Upvotes: 28 | cs.CL, cs.AI

Authors:
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, …

5 months, 3 weeks ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us