Podcast Episodes

Back to Search
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Episode 1579

🤗 Upvotes: 38 | cs.CL, cs.AI

Authors:
Qiguang Chen, Yantao Du, Ziniu Li, Jinhao Liu, Songyao Duan, Jiarui Guo, M…

2 months ago

Short Long
View Episode
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Episode 1578

🤗 Upvotes: 30 | cs.CL

Authors:
Jiajie Zhang, Xin Lv, Ling Feng, Lei Hou, Juanzi Li

Title:
…

2 months ago

Short Long
View Episode
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Episode 1577

🤗 Upvotes: 25 | cs.CL, cs.AI, cs.LG

Authors:
Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Zhicheng Do…

2 months ago

Short Long
View Episode
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Episode 1576

🤗 Upvotes: 22 | cs.CL

Authors:
Mingxin Li, Yanzhao Zhang, Dingkun Long, Keqin Chen, Sibo Song, Shuai Bai, Zhibo …

2 months ago

Short Long
View Episode
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Episode 1575

🤗 Upvotes: 98 | cs.CL, cs.AI, cs.LG

Authors:
Shih-Yang Liu, Xin Dong, Ximing Lu, Shizhe Diao, Peter Belcak, Ming…

2 months, 1 week ago

Short Long
View Episode
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Episode 1574

🤗 Upvotes: 29 | cs.LG

Authors:
Maksim Velikanov, Ilyas Chahed, Jingwei Zuo, Dhia Eddine Rhaiem, Younes Belkada, …

2 months, 1 week ago

Short Long
View Episode
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Episode 1573

🤗 Upvotes: 26 | cs.CV

Authors:
Yuan-Kang Lee, Kuan-Lin Chen, Chia-Che Chang, Yu-Lun Liu

Title:
…

2 months, 1 week ago

Short Long
View Episode
Token-Level LLM Collaboration via FusionRoute

Episode 1572

🤗 Upvotes: 26 | cs.AI, cs.CL, cs.LG

Authors:
Nuoya Xiong, Yuhang Zhou, Hanqing Zeng, Zhaorun Chen, Furong Huang,…

2 months, 1 week ago

Short Long
View Episode
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Episode 1571

🤗 Upvotes: 67 | cs.LG, cs.AI, cs.CL

Authors:
Muxi Diao, Lele Yang, Wuxuan Gong, Yutong Zhang, Zhonghao Yan, Yufe…

2 months, 1 week ago

Short Long
View Episode
Evolving Programmatic Skill Networks

Episode 1570

🤗 Upvotes: 56 | cs.AI, cs.NE

Authors:
Haochen Shi, Xingdi Yuan, Bang Liu

Title:
Evolving…

2 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us