Podcast Episodes

Back to Search

AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models

Episode 1385

🤗 Upvotes: 58 | cs.CL, cs.AI, cs.LG

Authors:
Mohammad Zbib, Hasan Abed Al Kader Hammoud, Sina Mukalled, Nadine R…

5 months, 2 weeks ago

Short Long

View Episode

A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space

Episode 1384

🤗 Upvotes: 41 | cs.CV, cs.AI

Authors:
Huijie Liu, Shuhao Cui, Haoxiang Cao, Shuai Ma, Kai Wu, Guoliang Kang

…

5 months, 2 weeks ago

Short Long

View Episode

Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark

Episode 1383

🤗 Upvotes: 32 | cs.CV

Authors:
Xinxin Liu, Zhaopan Xu, Kai Wang, Yong Jae Lee, Yuzhang Shang

Title:
…

5 months, 2 weeks ago

Short Long

View Episode

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Episode 1382

🤗 Upvotes: 24 | cs.CV

Authors:
Huiyi Chen, Jiawei Peng, Dehai Min, Changchang Sun, Kaijie Chen, Yan Yan, Xu Yang…

5 months, 2 weeks ago

Short Long

View Episode

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Episode 1381

🤗 Upvotes: 22 | cs.CV

Authors:
Jiaze Li, Hao Yin, Wenhui Tan, Jingyang Chen, Boshen Xu, Yuxun Qu, Yijing Chen, J…

5 months, 2 weeks ago

Short Long

View Episode

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Episode 1380

🤗 Upvotes: 87 | cs.CL, cs.AI, cs.CV

Authors:
Yunxin Li, Xinyu Chen, Shenyuan Jiang, Haoyuan Shi, Zhenyu Liu, Xua…

5 months, 2 weeks ago

Short Long

View Episode

P1: Mastering Physics Olympiads with Reinforcement Learning

Episode 1379

🤗 Upvotes: 107 | cs.LG, cs.AI, cs.CL

Authors:
Jiacheng Chen, Qianjia Cheng, Fangchen Yu, Haiyuan Wan, Yuchen Zha…

5 months, 2 weeks ago

Short Long

View Episode

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Episode 1378

🤗 Upvotes: 104 | cs.CL

Authors:
MiroMind Team, Song Bai, Lidong Bing, Carson Chen, Guanzheng Chen, Yuntao Chen, …

5 months, 2 weeks ago

Short Long

View Episode

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Episode 1377

🤗 Upvotes: 75 | cs.CL

Authors:
Shalini Maiti, Amar Budhiraja, Bhavul Gauri, Gaurav Chaurasia, Anton Protopopov, …

5 months, 2 weeks ago

Short Long

View Episode

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

Episode 1376

🤗 Upvotes: 63 | cs.CV

Authors:
Chunshi Wang, Junliang Ye, Yunhan Yang, Yang Li, Zizhuo Lin, Jun Zhu, Zhuo Chen, …

5 months, 2 weeks ago

Short Long

View Episode

Podcast Episodes

AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models

A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space

Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

P1: Mastering Physics Olympiads with Reinforcement Learning

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

Love PodBriefly?