Podcast Episodes
Back to SearchWorld-R1: Reinforcing 3D Constraints for Text-to-Video Generation
Episode 1813
🤗 Upvotes: 102 | cs.CV
Authors:
Weijie Wang, Xiaoxuan He, Youping Gu, Yifan Yang, Zeyu Zhang, Yefei He, Yanbo Di…
1Â week ago
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
Episode 1812
🤗 Upvotes: 100 | cs.AI
Authors:
Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, …
1Â week ago
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
Episode 1811
🤗 Upvotes: 57 | cs.CV
Authors:
Yiming Zhang, Jiacheng Chen, Jiaqi Tan, Yongsen Mao, Wenhu Chen, Angel X. Chang
1Â week ago
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
Episode 1810
🤗 Upvotes: 47 | cs.CV
Authors:
Zhiheng Liu, Weiming Ren, Xiaoke Huang, Shoufa Chen, Tianhong Li, Mengzhao Chen, …
1Â week ago
Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
Episode 1809
🤗 Upvotes: 42 | cs.RO
Authors:
Qi Li, Bo Yin, Weiqi Huang, Ruhao Liu, Bojun Zou, Runpeng Yu, Jingwen Ye, Weihao …
1Â week ago
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents
Episode 1808
🤗 Upvotes: 27 | cs.CV, cs.SE
Authors:
Fanqing Meng, Lingxiao Du, Zijian Wu, Guanzheng Chen, Xiangyan Liu, Jiaqi …
1Â week ago
SketchVLM: Vision language models can annotate images to explain thoughts and guide users
Episode 1807
🤗 Upvotes: 22 | cs.CV, cs.AI
Authors:
Brandon Collins, Logan Bolton, Hung Huy Nguyen, Mohammad Reza Taesiri, Tru…
1Â week ago
Video Analysis and Generation via a Semantic Progress Function
Episode 1806
🤗 Upvotes: 42 | cs.CV
Authors:
Gal Metzer, Sagi Polaczek, Ali Mahdavi-Amiri, Raja Giryes, Daniel Cohen-Or
1Â week, 1Â day ago
DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic Reconstruction
Episode 1805
🤗 Upvotes: 27 | eess.IV, cs.CV
Authors:
Shiyan Su, Ruyi Zha, Danli Shi, Hongdong Li, Xuelian Cheng
T…
1Â week, 1Â day ago
LLM Safety From Within: Detecting Harmful Content with Internal Representations
Episode 1804
🤗 Upvotes: 21 | cs.AI
Authors:
Difan Jiao, Yilun Liu, Ye Yuan, Zhenwei Tang, Linfeng Du, Haolun Wu, Ashton Ander…
1Â week, 1Â day ago