Podcast Episodes
Back to SearchVisPlay: Self-Evolving Vision-Language Models from Images
Episode 1389
🤗 Upvotes: 31 | cs.CV, cs.AI, cs.CL, cs.LG
Authors:
Yicheng He, Chengsong Huang, Zongxia Li, Jiaxin Huang, Yongh…
3Â months, 3Â weeks ago
Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset
Episode 1388
🤗 Upvotes: 23 | cs.CV
Authors:
Geon Choi, Hangyul Yoon, Hyunju Shin, Hyunki Park, Sang Hoon Seo, Eunho Yang, Edw…
3Â months, 3Â weeks ago
VIDEOP2R: Video Understanding from Perception to Reasoning
Episode 1387
🤗 Upvotes: 70 | cs.CV, cs.AI, cs.LG
Authors:
Yifan Jiang, Yueying Wang, Rui Zhao, Toufiq Parag, Zhimin Chen, Zhe…
3Â months, 3Â weeks ago
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
Episode 1386
🤗 Upvotes: 66 | cs.CL, cs.AI, cs.LG, cs.PF
Authors:
Tianyu Fu, Yichen You, Zekai Chen, Guohao Dai, Huazhong Yang…
3Â months, 3Â weeks ago
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models
Episode 1385
🤗 Upvotes: 58 | cs.CL, cs.AI, cs.LG
Authors:
Mohammad Zbib, Hasan Abed Al Kader Hammoud, Sina Mukalled, Nadine R…
3Â months, 3Â weeks ago
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space
Episode 1384
🤗 Upvotes: 41 | cs.CV, cs.AI
Authors:
Huijie Liu, Shuhao Cui, Haoxiang Cao, Shuai Ma, Kai Wu, Guoliang Kang
3Â months, 3Â weeks ago
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark
Episode 1383
🤗 Upvotes: 32 | cs.CV
Authors:
Xinxin Liu, Zhaopan Xu, Kai Wang, Yong Jae Lee, Yuzhang Shang
Title:
…
3Â months, 3Â weeks ago
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Episode 1382
🤗 Upvotes: 24 | cs.CV
Authors:
Huiyi Chen, Jiawei Peng, Dehai Min, Changchang Sun, Kaijie Chen, Yan Yan, Xu Yang…
3Â months, 3Â weeks ago
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
Episode 1381
🤗 Upvotes: 22 | cs.CV
Authors:
Jiaze Li, Hao Yin, Wenhui Tan, Jingyang Chen, Boshen Xu, Yuxun Qu, Yijing Chen, J…
3Â months, 3Â weeks ago
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
Episode 1380
🤗 Upvotes: 87 | cs.CL, cs.AI, cs.CV
Authors:
Yunxin Li, Xinyu Chen, Shenyuan Jiang, Haoyuan Shi, Zhenyu Liu, Xua…
4Â months ago