Podcast Episodes
Back to SearchGaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
Episode 1542
🤗 Upvotes: 22 | cs.CV
Authors:
Yi-Chuan Huang, Hao-Jen Chien, Chin-Yang Lin, Ying-Huan Chen, Yu-Lun Liu
4Â months ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Episode 1541
🤗 Upvotes: 72 | cs.CL, cs.LG
Authors:
Ang Lv, Jin Ma, Yiyuan Ma, Siyuan Qiao
Title:
Coup…
4Â months, 1Â week ago
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Episode 1540
🤗 Upvotes: 51 | cs.CV
Authors:
Ethan Chern, Zhulin Hu, Bohao Tang, Jiadi Su, Steffi Chern, Zhijie Deng, Pengfei …
4Â months, 1Â week ago
Yume-1.5: A Text-Controlled Interactive World Generation Model
Episode 1539
🤗 Upvotes: 50 | cs.CV
Authors:
Xiaofeng Mao, Zhen Li, Chuanhao Li, Xiaojie Xu, Kaining Ying, Tong He, Jiangmiao …
4Â months, 1Â week ago
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents
Episode 1538
🤗 Upvotes: 33 | cs.CL, cs.AI, cs.CV, cs.LG, cs.MA
Authors:
Shaofei Cai, Yulei Qin, Haojia Lin, Zihan Xu, Gang Li…
4Â months, 1Â week ago
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
Episode 1537
🤗 Upvotes: 32 | cs.CV
Authors:
Shaocong Xu, Songlin Wei, Qizhe Wei, Zheng Geng, Hong Li, Licheng Shen, Qianpu Su…
4Â months, 1Â week ago
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion
Episode 1536
🤗 Upvotes: 30 | cs.CV
Authors:
Hau-Shiang Shiu, Chin-Yang Lin, Zhixiang Wang, Chi-Wei Hsiao, Po-Fan Yu, Yu-Chih …
4Â months, 1Â week ago
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Episode 1535
🤗 Upvotes: 28 | cs.CV, cs.CL
Authors:
Jiacheng Ye, Shansan Gong, Jiahui Gao, Junming Fan, Shuang Wu, Wei Bi, Hao…
4Â months, 1Â week ago
SpotEdit: Selective Region Editing in Diffusion Transformers
Episode 1534
🤗 Upvotes: 27 | cs.CV, cs.AI
Authors:
Zhibin Qin, Zhenxiong Tan, Zeqing Wang, Songhua Liu, Xinchao Wang
4Â months, 1Â week ago
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models
Episode 1533
🤗 Upvotes: 21 | cs.CV
Authors:
Bozhou Li, Sihan Yang, Yushuo Guan, Ruichuan An, Xinlong Chen, Yang Shi, Pengfei …
4Â months, 1Â week ago