Podcast Episodes

Back to Search
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Episode 1045

🤗 Upvotes: 28 | cs.LG, cs.AI, cs.CL

Authors:
Zhenpeng Su, Leiyu Pan, Xue Bai, Dening Liu, Guanting Dong, Jiaming…

7 months, 1 week ago

Short Long
View Episode
MolmoAct: Action Reasoning Models that can Reason in Space

Episode 1044

🤗 Upvotes: 22 | cs.RO

Authors:
Jason Lee, Jiafei Duan, Haoquan Fang, Yuquan Deng, Shuo Liu, Boyang Li, Bohan Fan…

7 months, 1 week ago

Short Long
View Episode
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Episode 1043

🤗 Upvotes: 79 | cs.CL

Authors:
GLM-4. 5 Team, :, Aohan Zeng, Xin Lv, Qinkai Zheng, Zhenyu Hou, Bin Chen, Chengxi…

7 months, 1 week ago

Short Long
View Episode
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Episode 1042

🤗 Upvotes: 36 | cs.GR, cs.AI, cs.CV, cs.LG

Authors:
Seungyong Lee, Jeong-gi Kwak

Title:
…

7 months, 1 week ago

Short Long
View Episode
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Episode 1041

🤗 Upvotes: 143 | cs.AI, cs.CL, cs.LG

Authors:
Chengshuai Zhao, Zhen Tan, Pingchuan Ma, Dawei Li, Bohan Jiang, Ya…

7 months, 2 weeks ago

Short Long
View Episode
VeriGUI: Verifiable Long-Chain GUI Dataset

Episode 1040

🤗 Upvotes: 117 | cs.HC

Authors:
Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong…

7 months, 2 weeks ago

Short Long
View Episode
Efficient Agents: Building Effective Agents While Reducing Cost

Episode 1039

🤗 Upvotes: 53 | cs.AI, cs.CL, cs.MA

Authors:
Ningning Wang, Xavier Hu, Pai Liu, He Zhu, Yue Hou, Heyuan Huang, S…

7 months, 2 weeks ago

Short Long
View Episode
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Episode 1038

🤗 Upvotes: 38 | cs.AI, cs.CL, cs.CV, cs.LG, cs.MA, cs.MM

Authors:
Zeyi Sun, Ziyu Liu, Yuhang Zang, Yuhang Cao, X…

7 months, 2 weeks ago

Short Long
View Episode
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Episode 1037

🤗 Upvotes: 32 | cs.LG, cs.CL, cs.SE

Authors:
Alexander Golubev, Maria Trofimova, Sergei Polezhaev, Ibragim Bader…

7 months, 2 weeks ago

Short Long
View Episode
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Episode 1036

🤗 Upvotes: 29 | cs.LG, cs.AI

Authors:
George Bredis, Stanislav Dereka, Viacheslav Sinii, Ruslan Rakhimov, Daniil…

7 months, 2 weeks ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us