Podcast Episodes

Back to Search
Learning Adaptive Parallel Reasoning with Language Models

Episode 705

🤗 Upvotes: 35 | cs.AI, cs.CL

Authors:
Jiayi Pan, Xiuyu Li, Long Lian, Charlie Snell, Yifei Zhou, Adam Yala, Trev…

10 months, 3 weeks ago

Short Long
View Episode
Learning to Reason under Off-Policy Guidance

Episode 704

🤗 Upvotes: 59 | cs.LG, cs.AI, cs.CL

Authors:
Jianhao Yan, Yafu Li, Zican Hu, Zhi Wang, Ganqu Cui, Xiaoye Qu, Yu …

10 months, 3 weeks ago

Short Long
View Episode
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Episode 703

🤗 Upvotes: 50 | cs.CV

Authors:
Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, De-An Hua…

10 months, 3 weeks ago

Short Long
View Episode
FlowReasoner: Reinforcing Query-Level Meta-Agents

Episode 702

🤗 Upvotes: 36 | cs.AI

Authors:
Hongcheng Gao, Yue Liu, Yufei He, Longxu Dou, Chao Du, Zhijie Deng, Bryan Hooi, M…

10 months, 3 weeks ago

Short Long
View Episode
ToolRL: Reward is All Tool Learning Needs

Episode 701

🤗 Upvotes: 33 | cs.LG, cs.AI, cs.CL

Authors:
Cheng Qian, Emre Can Acikgoz, Qi He, Hongru Wang, Xiusi Chen, Dilek…

10 months, 3 weeks ago

Short Long
View Episode
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

Episode 700

🤗 Upvotes: 25 | cs.CR, cs.AI, cs.CL, cs.LG, cs.MA

Authors:
Salman Rahman, Liwei Jiang, James Shiffer, Genglin Li…

10 months, 3 weeks ago

Short Long
View Episode
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

Episode 699

🤗 Upvotes: 21 | cs.CV

Authors:
Cailin Zhuang, Yaoqi Hu, Xuanyang Zhang, Wei Cheng, Jiacheng Bao, Shengqi Liu, Yi…

10 months, 3 weeks ago

Short Long
View Episode
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Episode 698

🤗 Upvotes: 64 | cs.AI, cs.CL, cs.CV

Authors:
Yang Yue, Zhiqi Chen, Rui Lu, Andrew Zhao, Zhaokai Wang, Yang Yue, …

10 months, 3 weeks ago

Short Long
View Episode
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

Episode 697

🤗 Upvotes: 31 | cs.CL, cs.AI

Authors:
Yicheng Chen, Yining Li, Kai Hu, Zerun Ma, Haochen Ye, Kai Chen

…

10 months, 3 weeks ago

Short Long
View Episode
NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes

Episode 696

🤗 Upvotes: 30 | cs.AI

Authors:
Tianyang Xu, Haojie Zheng, Chengze Li, Haoxiang Chen, Yixin Liu, Ruoxi Chen, Lich…

10 months, 3 weeks ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us