Podcast Episodes
Back to SearchOmni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models
Episode 1672
🤗 Upvotes: 110 | cs.CV
Authors:
Meiqi Wu, Zhixin Cai, Fufangchen Zhao, Xiaokun Feng, Rujing Dang, Bingze Song, R…
3Â months, 1Â week ago
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
Episode 1671
🤗 Upvotes: 90 | cs.CV
Authors:
SII-GAIR, Sand. ai, :, Ethan Chern, Hansi Teng, Hanwen Sun, Hao Wang, Hong Pan, H…
3Â months, 1Â week ago
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
Episode 1670
🤗 Upvotes: 63 | cs.AI, cs.CL
Authors:
Jianing Wang, Jianfei Zhang, Qi Guo, Linsen Guo, Rumei Li, Chao Zhang, Cho…
3Â months, 1Â week ago
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
Episode 1669
🤗 Upvotes: 60 | cs.CV, cs.AI
Authors:
Nimrod Shabtay, Moshe Kimhi, Artem Spector, Sivan Haray, Ehud Rivlin, Chai…
3Â months, 1Â week ago
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Episode 1668
🤗 Upvotes: 55 | cs.IR, cs.AI, cs.CL
Authors:
Zhuofeng Li, Dongfu Jiang, Xueguang Ma, Haoxiang Zhang, Ping Nie, Y…
3Â months, 1Â week ago
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
Episode 1667
🤗 Upvotes: 45 | cs.CV
Authors:
Ruoliu Yang, Chu Wu, Caifeng Shan, Ran He, Chaoyou Fu
Title:
…
3Â months, 1Â week ago
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning
Episode 1666
🤗 Upvotes: 39 | cs.CV
Authors:
Byungwoo Jeon, Dongyoung Kim, Huiwon Jang, Insoo Kim, Jinwoo Shin
Tit…
3Â months, 1Â week ago
F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting
Episode 1665
🤗 Upvotes: 31 | cs.CV
Authors:
Injae Kim, Chaehyeon Kim, Minseong Bae, Minseok Joo, Hyunwoo J. Kim
T…
3Â months, 1Â week ago
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
Episode 1664
🤗 Upvotes: 28 | cs.LG, cs.AI
Authors:
Woosung Koh, Jeyoung Jeon, Youngjin Song, Yujin Cheon, Soowon Oh, Jaehyeon…
3Â months, 1Â week ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning
Episode 1663
🤗 Upvotes: 96 | cs.CV, cs.AI, cs.CL
Authors:
Shenzhi Wang, Shixuan Liu, Jing Zhou, Chang Gao, Xiong-Hui Chen, Bi…
3Â months, 1Â week ago