Podcast Episodes

Back to Search
VisPlay: Self-Evolving Vision-Language Models from Images

Episode 1389

🤗 Upvotes: 31 | cs.CV, cs.AI, cs.CL, cs.LG

Authors:
Yicheng He, Chengsong Huang, Zongxia Li, Jiaxin Huang, Yongh…

3 months, 3 weeks ago

Short Long
View Episode
Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

Episode 1388

🤗 Upvotes: 23 | cs.CV

Authors:
Geon Choi, Hangyul Yoon, Hyunju Shin, Hyunki Park, Sang Hoon Seo, Eunho Yang, Edw…

3 months, 3 weeks ago

Short Long
View Episode
VIDEOP2R: Video Understanding from Perception to Reasoning

Episode 1387

🤗 Upvotes: 70 | cs.CV, cs.AI, cs.LG

Authors:
Yifan Jiang, Yueying Wang, Rui Zhao, Toufiq Parag, Zhimin Chen, Zhe…

3 months, 3 weeks ago

Short Long
View Episode
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Episode 1386

🤗 Upvotes: 66 | cs.CL, cs.AI, cs.LG, cs.PF

Authors:
Tianyu Fu, Yichen You, Zekai Chen, Guohao Dai, Huazhong Yang…

3 months, 3 weeks ago

Short Long
View Episode
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models

Episode 1385

🤗 Upvotes: 58 | cs.CL, cs.AI, cs.LG

Authors:
Mohammad Zbib, Hasan Abed Al Kader Hammoud, Sina Mukalled, Nadine R…

3 months, 3 weeks ago

Short Long
View Episode
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space

Episode 1384

🤗 Upvotes: 41 | cs.CV, cs.AI

Authors:
Huijie Liu, Shuhao Cui, Haoxiang Cao, Shuai Ma, Kai Wu, Guoliang Kang

…

3 months, 3 weeks ago

Short Long
View Episode
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark

Episode 1383

🤗 Upvotes: 32 | cs.CV

Authors:
Xinxin Liu, Zhaopan Xu, Kai Wang, Yong Jae Lee, Yuzhang Shang

Title:
…

3 months, 3 weeks ago

Short Long
View Episode
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Episode 1382

🤗 Upvotes: 24 | cs.CV

Authors:
Huiyi Chen, Jiawei Peng, Dehai Min, Changchang Sun, Kaijie Chen, Yan Yan, Xu Yang…

3 months, 3 weeks ago

Short Long
View Episode
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Episode 1381

🤗 Upvotes: 22 | cs.CV

Authors:
Jiaze Li, Hao Yin, Wenhui Tan, Jingyang Chen, Boshen Xu, Yuxun Qu, Yijing Chen, J…

3 months, 3 weeks ago

Short Long
View Episode
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Episode 1380

🤗 Upvotes: 87 | cs.CL, cs.AI, cs.CV

Authors:
Yunxin Li, Xinyu Chen, Shenyuan Jiang, Haoyuan Shi, Zhenyu Liu, Xua…

4 months ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us