Podcast Episodes
Back to SearchDreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Episode 639
🤗 Upvotes: 24 | cs.CV, cs.AI
Authors:
Yuxuan Luo, Zhengkun Rong, Lizhen Wang, Longhao Zhang, Tianshu Hu, Yongmin…
11Â months, 2Â weeks ago
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Episode 638
🤗 Upvotes: 22 | cs.CV
Authors:
Hanyang Wang, Fangfu Liu, Jiawei Chi, Yueqi Duan
Title:
V…
11Â months, 2Â weeks ago
START: Self-taught Reasoner with Tools
Episode 637
🤗 Upvotes: 49 | cs.CL
Authors:
Chengpeng Li, Mingfeng Xue, Zhenru Zhang, Jiaxi Yang, Beichen Zhang, Xiang Wang, …
1Â year ago
Token-Efficient Long Video Understanding for Multimodal LLMs
Episode 636
🤗 Upvotes: 41 | cs.CV
Authors:
Jindong Jiang, Xiuyu Li, Zhijian Liu, Muyang Li, Guo Chen, Zhiqi Li, De-An Huang,…
1Â year ago
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Episode 635
🤗 Upvotes: 33 | cs.CL
Authors:
Sambal Shikhar, Mohammed Irfan Kurpath, Sahal Shaji Mullappilly, Jean Lahoud, Fah…
1Â year ago
EgoLife: Towards Egocentric Life Assistant
Episode 634
🤗 Upvotes: 21 | cs.CV
Authors:
Jingkang Yang, Shuai Liu, Hongming Guo, Yuhao Dong, Xiamengwei Zhang, Sicheng Zha…
1Â year ago
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers
Episode 633
🤗 Upvotes: 42 | cs.CL, cs.AI
Authors:
Yiran Zhao, Chaoqun Liu, Yue Deng, Jiahao Ying, Mahani Aljunied, Zhaodongh…
1Â year ago
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs
Episode 632
🤗 Upvotes: 27 | cs.CL, cs.HC
Authors:
Tin Nguyen, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen
1Â year ago
Process-based Self-Rewarding Language Models
Episode 631
🤗 Upvotes: 27 | cs.CL, cs.AI
Authors:
Shimao Zhang, Xiao Liu, Xin Zhang, Junxiao Liu, Zheheng Luo, Shujian Huang…
1Â year ago
Visual-RFT: Visual Reinforcement Fine-Tuning
Episode 630
🤗 Upvotes: 44 | cs.CV
Authors:
Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin…
1Â year ago