Podcast Episodes
Back to SearchThe Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Episode 1579
🤗 Upvotes: 38 | cs.CL, cs.AI
Authors:
Qiguang Chen, Yantao Du, Ziniu Li, Jinhao Liu, Songyao Duan, Jiarui Guo, M…
2Â months ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
Episode 1578
🤗 Upvotes: 30 | cs.CL
Authors:
Jiajie Zhang, Xin Lv, Ling Feng, Lei Hou, Juanzi Li
Title:
…
2Â months ago
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis
Episode 1577
🤗 Upvotes: 25 | cs.CL, cs.AI, cs.LG
Authors:
Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Zhicheng Do…
2Â months ago
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
Episode 1576
🤗 Upvotes: 22 | cs.CL
Authors:
Mingxin Li, Yanzhao Zhang, Dingkun Long, Keqin Chen, Sibo Song, Shuai Bai, Zhibo …
2Â months ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Episode 1575
🤗 Upvotes: 98 | cs.CL, cs.AI, cs.LG
Authors:
Shih-Yang Liu, Xin Dong, Ximing Lu, Shizhe Diao, Peter Belcak, Ming…
2Â months, 1Â week ago
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Episode 1574
🤗 Upvotes: 29 | cs.LG
Authors:
Maksim Velikanov, Ilyas Chahed, Jingwei Zuo, Dhia Eddine Rhaiem, Younes Belkada, …
2Â months, 1Â week ago
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes
Episode 1573
🤗 Upvotes: 26 | cs.CV
Authors:
Yuan-Kang Lee, Kuan-Lin Chen, Chia-Che Chang, Yu-Lun Liu
Title:
…
2Â months, 1Â week ago
Token-Level LLM Collaboration via FusionRoute
Episode 1572
🤗 Upvotes: 26 | cs.AI, cs.CL, cs.LG
Authors:
Nuoya Xiong, Yuhang Zhou, Hanqing Zeng, Zhaorun Chen, Furong Huang,…
2Â months, 1Â week ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
Episode 1571
🤗 Upvotes: 67 | cs.LG, cs.AI, cs.CL
Authors:
Muxi Diao, Lele Yang, Wuxuan Gong, Yutong Zhang, Zhonghao Yan, Yufe…
2Â months, 1Â week ago
Evolving Programmatic Skill Networks
Episode 1570
🤗 Upvotes: 56 | cs.AI, cs.NE
Authors:
Haochen Shi, Xingdi Yuan, Bang Liu
Title:
Evolving…
2Â months, 1Â week ago