Podcast Episodes

Back to Search

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Episode 1382

🤗 Upvotes: 24 | cs.CV

Authors:
Huiyi Chen, Jiawei Peng, Dehai Min, Changchang Sun, Kaijie Chen, Yan Yan, Xu Yang…

7 months, 2 weeks ago

Short Long

View Episode

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Episode 1381

🤗 Upvotes: 22 | cs.CV

Authors:
Jiaze Li, Hao Yin, Wenhui Tan, Jingyang Chen, Boshen Xu, Yuxun Qu, Yijing Chen, J…

7 months, 2 weeks ago

Short Long

View Episode

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Episode 1380

🤗 Upvotes: 87 | cs.CL, cs.AI, cs.CV

Authors:
Yunxin Li, Xinyu Chen, Shenyuan Jiang, Haoyuan Shi, Zhenyu Liu, Xua…

7 months, 2 weeks ago

Short Long

View Episode

P1: Mastering Physics Olympiads with Reinforcement Learning

Episode 1379

🤗 Upvotes: 107 | cs.LG, cs.AI, cs.CL

Authors:
Jiacheng Chen, Qianjia Cheng, Fangchen Yu, Haiyuan Wan, Yuchen Zha…

7 months, 2 weeks ago

Short Long

View Episode

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Episode 1378

🤗 Upvotes: 104 | cs.CL

Authors:
MiroMind Team, Song Bai, Lidong Bing, Carson Chen, Guanzheng Chen, Yuntao Chen, …

7 months, 2 weeks ago

Short Long

View Episode

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Episode 1377

🤗 Upvotes: 75 | cs.CL

Authors:
Shalini Maiti, Amar Budhiraja, Bhavul Gauri, Gaurav Chaurasia, Anton Protopopov, …

7 months, 2 weeks ago

Short Long

View Episode

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

Episode 1376

🤗 Upvotes: 63 | cs.CV

Authors:
Chunshi Wang, Junliang Ye, Yunhan Yang, Yang Li, Zizhuo Lin, Jun Zhu, Zhuo Chen, …

7 months, 2 weeks ago

Short Long

View Episode

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Episode 1375

🤗 Upvotes: 47 | cs.CV

Authors:
Ye Tian, Ling Yang, Jiongfan Yang, Anran Wang, Yu Tian, Jiani Zheng, Haochen Wang…

7 months, 2 weeks ago

Short Long

View Episode

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Episode 1374

🤗 Upvotes: 46 | cs.IR, cs.AI, cs.LG

Authors:
Duolin Sun, Meixiu Long, Dan Yang, Yihan Jiao, Zhehao Tan, Jie Feng…

7 months, 2 weeks ago

Short Long

View Episode

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Episode 1373

🤗 Upvotes: 40 | cs.CV

Authors:
Harold Haodong Chen, Disen Lan, Wen-Jie Shu, Qingyang Liu, Zihan Wang, Sirui Chen…

7 months, 2 weeks ago

Short Long

View Episode

Podcast Episodes

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

P1: Mastering Physics Olympiads with Reinforcement Learning

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Love PodBriefly?