Podcast Episodes

Back to Search
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Episode 1813

🤗 Upvotes: 102 | cs.CV

Authors:
Weijie Wang, Xiaoxuan He, Youping Gu, Yifan Yang, Zeyu Zhang, Yefei He, Yanbo Di…

1 week ago

Short Long
View Episode
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Episode 1812

🤗 Upvotes: 100 | cs.AI

Authors:
Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, …

1 week ago

Short Long
View Episode
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Episode 1811

🤗 Upvotes: 57 | cs.CV

Authors:
Yiming Zhang, Jiacheng Chen, Jiaqi Tan, Yongsen Mao, Wenhu Chen, Angel X. Chang

…

1 week ago

Short Long
View Episode
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Episode 1810

🤗 Upvotes: 47 | cs.CV

Authors:
Zhiheng Liu, Weiming Ren, Xiaoke Huang, Shoufa Chen, Tianhong Li, Mengzhao Chen, …

1 week ago

Short Long
View Episode
Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Episode 1809

🤗 Upvotes: 42 | cs.RO

Authors:
Qi Li, Bo Yin, Weiqi Huang, Ruhao Liu, Bojun Zou, Runpeng Yu, Jingwen Ye, Weihao …

1 week ago

Short Long
View Episode
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Episode 1808

🤗 Upvotes: 27 | cs.CV, cs.SE

Authors:
Fanqing Meng, Lingxiao Du, Zijian Wu, Guanzheng Chen, Xiangyan Liu, Jiaqi …

1 week ago

Short Long
View Episode
SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Episode 1807

🤗 Upvotes: 22 | cs.CV, cs.AI

Authors:
Brandon Collins, Logan Bolton, Hung Huy Nguyen, Mohammad Reza Taesiri, Tru…

1 week ago

Short Long
View Episode
Video Analysis and Generation via a Semantic Progress Function

Episode 1806

🤗 Upvotes: 42 | cs.CV

Authors:
Gal Metzer, Sagi Polaczek, Ali Mahdavi-Amiri, Raja Giryes, Daniel Cohen-Or

…

1 week, 1 day ago

Short Long
View Episode
DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic Reconstruction

Episode 1805

🤗 Upvotes: 27 | eess.IV, cs.CV

Authors:
Shiyan Su, Ruyi Zha, Danli Shi, Hongdong Li, Xuelian Cheng

T…

1 week, 1 day ago

Short Long
View Episode
LLM Safety From Within: Detecting Harmful Content with Internal Representations

Episode 1804

🤗 Upvotes: 21 | cs.AI

Authors:
Difan Jiao, Yilun Liu, Ye Yuan, Zhenwei Tang, Linfeng Du, Haolun Wu, Ashton Ander…

1 week, 1 day ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us