Podcast Episodes

Back to Search
No image available

The End of Manual Decoding: Towards Truly End-to-End Language Models


Episode 1337


🤗 Upvotes: 70 | cs.CL, cs.AI

Authors:
Zhichao Wang, Dongyang Ma, Xinting Huang, Deng Cai, Tian Lan, Jiahao Xu, Haitao Mi, Xiaoying Tang, Yan Wang

Titl…


Published on 1 week ago

No image available

Kimi Linear: An Expressive, Efficient Attention Architecture


Episode 1336


🤗 Upvotes: 40 | cs.CL, cs.LG

Authors:
Kimi Team, Yu Zhang, Zongyu Lin, Xingcheng Yao, Jiaxi Hu, Fanqing Meng, Chengyin Liu, Xin Men, Songlin Yang, Zhiyuan Li, Wen…


Published on 1 week ago

No image available

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents


Episode 1335


🤗 Upvotes: 28 | cs.AI

Authors:
Mathieu Andreux, Märt Bakler, Yanael Barbier, Hamza Benchekroun, Emilien Biré, Antoine Bonnet, Riaz Bordie, Nathan Bout, Matthias B…


Published on 1 week ago

No image available

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark


Episode 1334


🤗 Upvotes: 27 | cs.CV, cs.AI, cs.CL

Authors:
Ziyu Guo, Xinyan Chen, Renrui Zhang, Ruichuan An, Yu Qi, Dongzhi Jiang, Xiangtai Li, Manyuan Zhang, Hongsheng Li, Phe…


Published on 1 week ago

No image available

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation


Episode 1333


🤗 Upvotes: 25 | cs.CV

Authors:
Jing Lin, Ruisi Wang, Junzhe Lu, Ziqi Huang, Guorui Song, Ailing Zeng, Xian Liu, Chen Wei, Wanqi Yin, Qingping Sun, Zhongang Cai, L…


Published on 1 week ago

No image available

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations


Episode 1332


🤗 Upvotes: 147 | cs.CV

Authors:
Yujia Zhang, Xiaoyang Wu, Yixing Lao, Chengyao Wang, Zhuotao Tian, Naiyan Wang, Hengshuang Zhao

Title:
Con…


Published on 1 week, 3 days ago

No image available

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning


Episode 1331


🤗 Upvotes: 79 | cs.LG, cs.AI, cs.CL

Authors:
Ling Team, Bin Han, Caizhi Tang, Chen Liang, Donghao Zhang, Fan Yuan, Feng Zhu, Jie Gao, Jingyu Hu, Longfei Li, Meng …


Published on 2 weeks, 1 day ago

No image available

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping


Episode 1330


🤗 Upvotes: 68 | cs.LG, cs.AI, cs.CL

Authors:
Zhiheng Xi, Xin Guo, Yang Nan, Enyu Zhou, Junrui Shen, Wenxiang Chen, Jiaqi Liu, Jixuan Huang, Zhihao Zhang, Honglin …


Published on 2 weeks, 1 day ago

No image available

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts


Episode 1329


🤗 Upvotes: 44 | cs.CL

Authors:
Siyuan Wang, Gaokai Zhang, Li Lyna Zhang, Ning Shang, Fan Yang, Dongyao Chen, Mao Yang

Title:
LoongRL:Reinf…


Published on 2 weeks, 1 day ago

No image available

Language Models are Injective and Hence Invertible


Episode 1328


🤗 Upvotes: 42 | cs.LG, cs.AI

Authors:
Giorgos Nikolaou, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Yannis Panagakis, Emanuele Rodolà

Titl…


Published on 2 weeks, 1 day ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate