Podcast Episodes
Back to SearchGenerative Refocusing: Flexible Defocus Control from a Single Image
Episode 1499
🤗 Upvotes: 26 | cs.CV
Authors:
Chun-Wei Tuan Mu, Jia-Bin Huang, Yu-Lun Liu
Title:
Genera…
2Â months, 3Â weeks ago
DeContext as Defense: Safe Image Editing in Diffusion Transformers
Episode 1498
🤗 Upvotes: 22 | cs.CV
Authors:
Linghui Shen, Mingyue Cui, Xingyi Yang
Title:
DeContext a…
2Â months, 3Â weeks ago
Step-GUI Technical Report
Episode 1497
🤗 Upvotes: 87 | cs.CV
Authors:
Haolong Yan, Jia Wang, Xin Huang, Yeqing Shen, Ziyang Meng, Zhimin Fan, Kaijun Ta…
3Â months ago
DEER: Draft with Diffusion, Verify with Autoregressive Models
Episode 1496
🤗 Upvotes: 39 | cs.LG, cs.AI
Authors:
Zicong Cheng, Guo-Wei Yang, Jia Li, Zhijie Deng, Meng-Hao Guo, Shi-Min Hu
3Â months ago
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing
Episode 1495
🤗 Upvotes: 36 | cs.CL
Authors:
Lanxiang Hu, Siqi Kou, Yichao Fu, Samyam Rajbhandari, Tajana Rosing, Yuxiong He, …
3Â months ago
HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Episode 1494
🤗 Upvotes: 31 | cs.CV, cs.CL
Authors:
HyperAI Team, Yuchen Liu, Kaiyang Han, Zhiqiang Xia, Yuhang Dong, Chen Son…
3Â months ago
Puzzle Curriculum GRPO for Vision-Centric Reasoning
Episode 1493
🤗 Upvotes: 30 | cs.CV
Authors:
Ahmadreza Jeddi, Hakki Can Karaimer, Hue Nguyen, Zhongling Wang, Ke Zhao, Javad R…
3Â months ago
MMGR: Multi-Modal Generative Reasoning
Episode 1492
🤗 Upvotes: 82 | cs.CL, cs.CV
Authors:
Zefan Cai, Haoyi Qiu, Tianyi Ma, Haozhe Zhao, Gengze Zhou, Kung-Hsiang Hua…
3Â months ago
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
Episode 1491
🤗 Upvotes: 53 | cs.CV
Authors:
Jiaqi Wang, Weijia Wu, Yi Zhan, Rui Zhao, Ming Hu, James Cheng, Wei Liu, Philip T…
3Â months ago
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Episode 1490
🤗 Upvotes: 49 | cs.CV, cs.GR
Authors:
Wenqiang Sun, Haiyu Zhang, Haoyuan Wang, Junta Wu, Zehan Wang, Zhenwei Wan…
3Â months ago