Podcast Episodes
Back to SearchSpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning
Episode 1666
🤗 Upvotes: 39 | cs.CV
Authors:
Byungwoo Jeon, Dongyoung Kim, Huiwon Jang, Insoo Kim, Jinwoo Shin
Tit…
1Â month, 1Â week ago
F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting
Episode 1665
🤗 Upvotes: 31 | cs.CV
Authors:
Injae Kim, Chaehyeon Kim, Minseong Bae, Minseok Joo, Hyunwoo J. Kim
T…
1Â month, 1Â week ago
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
Episode 1664
🤗 Upvotes: 28 | cs.LG, cs.AI
Authors:
Woosung Koh, Jeyoung Jeon, Youngjin Song, Yujin Cheon, Soowon Oh, Jaehyeon…
1Â month, 1Â week ago
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning
Episode 1663
🤗 Upvotes: 96 | cs.CV, cs.AI, cs.CL
Authors:
Shenzhi Wang, Shixuan Liu, Jing Zhou, Chang Gao, Xiong-Hui Chen, Bi…
1Â month, 1Â week ago
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Episode 1662
🤗 Upvotes: 87 | cs.CV
Authors:
Songchun Zhang, Zeyue Xue, Siming Fu, Jie Huang, Xianghao Kong, Y Ma, Haoyang Hua…
1Â month, 1Â week ago
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation
Episode 1661
🤗 Upvotes: 42 | cs.CV
Authors:
Yan Shu, Bin Ren, Zhitong Xiong, Xiao Xiang Zhu, Begüm Demir, Nicu Sebe, Paolo Ro…
1Â month, 1Â week ago
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models
Episode 1660
🤗 Upvotes: 30 | cs.CV
Authors:
Thomas De Min, Subhankar Roy, Stéphane Lathuilière, Elisa Ricci, Massimiliano Man…
1Â month, 1Â week ago
FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow
Episode 1659
🤗 Upvotes: 26 | cs.CV
Authors:
Zhifei Yang, Guangyao Zhai, Keyang Lu, YuYang Yin, Chao Zhang, Zhen Xiao, Jieyi L…
1Â month, 1Â week ago
The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus
Episode 1658
🤗 Upvotes: 25 | cs.LG, cs.AI
Authors:
Amartya Roy, Rasul Tutunov, Xiaotong Ji, Matthieu Zimmer, Haitham Bou-Amma…
1Â month, 1Â week ago
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
Episode 1657
🤗 Upvotes: 21 | cs.CV, cs.AI
Authors:
Jiazheng Xing, Fei Du, Hangjie Yuan, Pengwei Liu, Hongbin Xu, Hai Ci, Ruig…
1Â month, 1Â week ago