Podcast Episodes
Back to SearchUniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
Episode 1345
🤗 Upvotes: 27 | cs.CV
Authors:
Ropeway Liu, Hangjie Yuan, Bo Dong, Jiazheng Xing, Jinwang Wang, Rui Zhao, Yan Xi…
4Â months, 2Â weeks ago
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
Episode 1344
🤗 Upvotes: 24 | cs.CV
Authors:
Yongyuan Liang, Wei Chow, Feng Li, Ziqiao Ma, Xiyao Wang, Jiageng Mao, Jiuhai Che…
4Â months, 2Â weeks ago
PHUMA: Physically-Grounded Humanoid Locomotion Dataset
Episode 1343
🤗 Upvotes: 23 | cs.RO
Authors:
Kyungmin Lee, Sibeen Kim, Minho Park, Hyunseung Kim, Dongyoon Hwang, Hojoon Lee, …
4Â months, 2Â weeks ago
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
Episode 1342
🤗 Upvotes: 22 | cs.CV
Authors:
Feng Han, Yibin Wang, Chenglin Li, Zheming Liang, Dianyi Wang, Yang Jiao, Zhipeng…
4Â months, 2Â weeks ago
World Simulation with Video Foundation Models for Physical AI
Episode 1341
🤗 Upvotes: 22 | cs.CV, cs.AI, cs.LG, cs.RO
Authors:
NVIDIA, :, Arslan Ali, Junjie Bai, Maciej Bala, Yogesh Balaj…
4Â months, 2Â weeks ago
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Episode 1340
🤗 Upvotes: 56 | cs.CV
Authors:
Jiawei Gu, Yunzhuo Hao, Huichen Will Wang, Linjie Li, Michael Qizhe Shieh, Yejin …
4Â months, 2Â weeks ago
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Episode 1339
🤗 Upvotes: 49 | cs.LG, cs.AI
Authors:
Mengzhao Chen, Meng Wu, Hui Jin, Zhihang Yuan, Jing Liu, Chaoyi Zhang, Yun…
4Â months, 2Â weeks ago
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
Episode 1338
🤗 Upvotes: 21 | cs.CV, cs.AI
Authors:
Yuhong Liu, Beichen Zhang, Yuhang Zang, Yuhang Cao, Long Xing, Xiaoyi Dong…
4Â months, 2Â weeks ago
The End of Manual Decoding: Towards Truly End-to-End Language Models
Episode 1337
🤗 Upvotes: 70 | cs.CL, cs.AI
Authors:
Zhichao Wang, Dongyang Ma, Xinting Huang, Deng Cai, Tian Lan, Jiahao Xu, H…
4Â months, 2Â weeks ago
Kimi Linear: An Expressive, Efficient Attention Architecture
Episode 1336
🤗 Upvotes: 40 | cs.CL, cs.LG
Authors:
Kimi Team, Yu Zhang, Zongyu Lin, Xingcheng Yao, Jiaxi Hu, Fanqing Meng, Ch…
4Â months, 2Â weeks ago