Podcast Episodes
Back to SearchDiagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts
Episode 885
🤗 Upvotes: 32 | cs.LG, cs.CL
Authors:
Danil Sivtsov, Ivan Rodkin, Gleb Kuzmin, Yuri Kuratov, Ivan Oseledets
9Â months, 2Â weeks ago
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
Episode 884
🤗 Upvotes: 32 | cs.RO, cs.AI, cs.CV
Authors:
Enshen Zhou, Jingkun An, Cheng Chi, Yi Han, Shanyu Rong, Chi Zhang,…
9Â months, 2Â weeks ago
Video World Models with Long-term Spatial Memory
Episode 883
🤗 Upvotes: 30 | cs.CV
Authors:
Tong Wu, Shuai Yang, Ryan Po, Yinghao Xu, Ziwei Liu, Dahua Lin, Gordon Wetzstein
9Â months, 2Â weeks ago
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights
Episode 882
🤗 Upvotes: 27 | cs.AI
Authors:
Mathieu Andreux, Breno Baldas Skuk, Hamza Benchekroun, Emilien Biré, Antoine Bonn…
9Â months, 2Â weeks ago
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Episode 881
🤗 Upvotes: 24 | cs.CL
Authors:
Yanzhao Zhang, Mingxin Li, Dingkun Long, Xin Zhang, Huan Lin, Baosong Yang, Pengj…
9Â months, 2Â weeks ago
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
Episode 880
🤗 Upvotes: 23 | cs.CV
Authors:
Xiangdong Zhang, Jiaqi Liao, Shaofeng Zhang, Fanqing Meng, Xiangpeng Wan, Junchi …
9Â months, 2Â weeks ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
Episode 879
🤗 Upvotes: 22 | cs.CL, cs.LG
Authors:
Nikhil Kandpal, Brian Lester, Colin Raffel, Sebastian Majstorovic, Stella …
9Â months, 2Â weeks ago
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
Episode 878
🤗 Upvotes: 21 | cs.CV
Authors:
Hanoona Rasheed, Abdelrahman Shaker, Anqi Tang, Muhammad Maaz, Ming-Hsuan Yang, S…
9Â months, 2Â weeks ago
MiMo-VL Technical Report
Episode 877
🤗 Upvotes: 58 | cs.CL
Authors:
Xiaomi LLM-Core Team, :, Zihao Yue, Zhenru Lin, Yifan Song, Weikun Wang, Shuhuai …
9Â months, 2Â weeks ago
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Episode 876
🤗 Upvotes: 41 | cs.LG, cs.AI, cs.CL, cs.CV
Authors:
Shuang Chen, Yue Guo, Zhaochen Su, Yafu Li, Yulun Wu, Jiache…
9Â months, 2Â weeks ago