Podcast Episodes
Back to SearchMoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
Episode 925
🤗 Upvotes: 30 | cs.CV, cs.AI, cs.CL
Authors:
Haonan Chen, Hong Liu, Yuping Luo, Liang Wang, Nan Yang, Furu Wei, …
8Â months, 2Â weeks ago
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
Episode 924
🤗 Upvotes: 29 | cs.CV, cs.AI, cs.LG
Authors:
Xingyang Li, Muyang Li, Tianle Cai, Haocheng Xi, Shuo Yang, Yujun L…
8Â months, 2Â weeks ago
Ovis-U1 Technical Report
Episode 923
🤗 Upvotes: 51 | cs.CV, cs.AI
Authors:
Guo-Hua Wang, Shanshan Zhao, Xinjie Zhang, Liangfu Cao, Pengxin Zhan, Lunh…
8Â months, 2Â weeks ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Episode 922
🤗 Upvotes: 27 | cs.AI, cs.CL, cs.LG
Authors:
Bo Liu, Leon Guertler, Simon Yu, Zichen Liu, Penghui Qi, Daniel Bal…
8Â months, 2Â weeks ago
VMoBA: Mixture-of-Block Attention for Video Diffusion Models
Episode 921
🤗 Upvotes: 26 | cs.CV
Authors:
Jianzong Wu, Liang Hou, Haotian Yang, Xin Tao, Ye Tian, Pengfei Wan, Di Zhang, Yu…
8Â months, 2Â weeks ago
Calligrapher: Freestyle Text Image Customization
Episode 920
🤗 Upvotes: 24 | cs.CV
Authors:
Yue Ma, Qingyan Bai, Hao Ouyang, Ka Leong Cheng, Qiuyu Wang, Hongyu Liu, Zichen L…
8Â months, 2Â weeks ago
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing
Episode 919
🤗 Upvotes: 46 | cs.GR, cs.CV
Authors:
Jiacheng Chen, Ramin Mehran, Xuhui Jia, Saining Xie, Sanghyun Woo
8Â months, 3Â weeks ago
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs
Episode 918
🤗 Upvotes: 30 | cs.CV, cs.AI, cs.HC, cs.MM
Authors:
Boyuan Sun, Jiaxing Zhao, Xihan Wei, Qibin Hou
T…
8Â months, 3Â weeks ago
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation
Episode 917
🤗 Upvotes: 25 | cs.CV
Authors:
Bowen Chen, Mengyi Zhao, Haomiao Sun, Li Chen, Xu Wang, Kang Du, Xinglong Wu
8Â months, 3Â weeks ago
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Episode 916
🤗 Upvotes: 39 | cs.CL
Authors:
Dongwei Jiang, Alvin Zhang, Andrew Wang, Nicholas Andrews, Daniel Khashabi
9Â months ago