Podcast Episodes
Back to SearchMemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models
Episode 1886
🤗 Upvotes: 62 | cs.CV
Authors:
Xiyu Ren, Zhaowei Wang, Yiming Du, Zhongwei Xie, Chi Liu, Xinlin Yang, Haoyue Fen…
1Â month, 2Â weeks ago
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
Episode 1885
🤗 Upvotes: 53 | cs.CV
Authors:
Haoyi Zhu, Haozhe Liu, Yuyang Zhao, Tian Ye, Junsong Chen, Jincheng Yu, Tong He, …
1Â month, 2Â weeks ago
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
Episode 1884
🤗 Upvotes: 47 | cs.CV, cs.CL, cs.IR
Authors:
Minghao Guo, Qingyue Jiao, Zeru Shi, Yihao Quan, Boxuan Zhang, Danr…
1Â month, 2Â weeks ago
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning
Episode 1883
🤗 Upvotes: 44 | cs.NE, cs.AI
Authors:
Taebong Kim, Youngsik Hong, Minsik Kim, Sunyoung Choi, Jaewon Jang, Jungho…
1Â month, 2Â weeks ago
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
Episode 1882
🤗 Upvotes: 41 | cs.AI
Authors:
Shihao Qi, Jie Ma, Rui Xing, Wei Guo, Xiao Huang, Zhitao Gao, Jianhao Deng, Jun L…
1Â month, 2Â weeks ago
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
Episode 1881
🤗 Upvotes: 37 | cs.CL
Authors:
Hanxiang Chao, Yihan Bai, Rui Sheng, Tianle Li, Yushi Sun
Title:
…
1Â month, 2Â weeks ago
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
Episode 1880
🤗 Upvotes: 36 | cs.CL
Authors:
Shuangrui Ding, Xuanlang Dai, Long Xing, Shengyuan Ding, Ziyu Liu, Yang JingYi, P…
1Â month, 2Â weeks ago
MinT: Managed Infrastructure for Training and Serving Millions of LLMs
Episode 1879
🤗 Upvotes: 144 | cs.LG, cs.AI, cs.DC
Authors:
Mind Lab, :, Song Cao, Vic Cao, Andrew Chen, Kaijie Chen, Cleon Ch…
1Â month, 2Â weeks ago
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image
Episode 1878
🤗 Upvotes: 122 | cs.LG, cs.CL, cs.CV
Authors:
Alan Arazi, Eilam Shapira, Shoham Grunblat, Mor Ventura, Elad Hoff…
1Â month, 2Â weeks ago
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
Episode 1877
🤗 Upvotes: 79 | cs.CV, cs.AI
Authors:
Yuchao Gu, Guian Fang, Yuxin Jiang, Weijia Mao, Song Han, Han Cai, Mike Zh…
1Â month, 2Â weeks ago