Podcast Episodes
Back to SearchMVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Episode 1382
🤗 Upvotes: 24 | cs.CV
Authors:
Huiyi Chen, Jiawei Peng, Dehai Min, Changchang Sun, Kaijie Chen, Yan Yan, Xu Yang…
7Â months, 2Â weeks ago
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
Episode 1381
🤗 Upvotes: 22 | cs.CV
Authors:
Jiaze Li, Hao Yin, Wenhui Tan, Jingyang Chen, Boshen Xu, Yuxun Qu, Yijing Chen, J…
7Â months, 2Â weeks ago
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
Episode 1380
🤗 Upvotes: 87 | cs.CL, cs.AI, cs.CV
Authors:
Yunxin Li, Xinyu Chen, Shenyuan Jiang, Haoyuan Shi, Zhenyu Liu, Xua…
7Â months, 2Â weeks ago
P1: Mastering Physics Olympiads with Reinforcement Learning
Episode 1379
🤗 Upvotes: 107 | cs.LG, cs.AI, cs.CL
Authors:
Jiacheng Chen, Qianjia Cheng, Fangchen Yu, Haiyuan Wan, Yuchen Zha…
7Â months, 2Â weeks ago
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
Episode 1378
🤗 Upvotes: 104 | cs.CL
Authors:
MiroMind Team, Song Bai, Lidong Bing, Carson Chen, Guanzheng Chen, Yuntao Chen, …
7Â months, 2Â weeks ago
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
Episode 1377
🤗 Upvotes: 75 | cs.CL
Authors:
Shalini Maiti, Amar Budhiraja, Bhavul Gauri, Gaurav Chaurasia, Anton Protopopov, …
7Â months, 2Â weeks ago
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
Episode 1376
🤗 Upvotes: 63 | cs.CV
Authors:
Chunshi Wang, Junliang Ye, Yunhan Yang, Yang Li, Zizhuo Lin, Jun Zhu, Zhuo Chen, …
7Â months, 2Â weeks ago
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation
Episode 1375
🤗 Upvotes: 47 | cs.CV
Authors:
Ye Tian, Ling Yang, Jiongfan Yang, Anran Wang, Yu Tian, Jiani Zheng, Haochen Wang…
7Â months, 2Â weeks ago
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning
Episode 1374
🤗 Upvotes: 46 | cs.IR, cs.AI, cs.LG
Authors:
Duolin Sun, Meixiu Long, Dan Yang, Yihan Jiao, Zhehao Tan, Jie Feng…
7Â months, 2Â weeks ago
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
Episode 1373
🤗 Upvotes: 40 | cs.CV
Authors:
Harold Haodong Chen, Disen Lan, Wen-Jie Shu, Qingyang Liu, Zihan Wang, Sirui Chen…
7Â months, 2Â weeks ago