Podcast Episodes
Back to SearchInternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
Episode 1145
🤗 Upvotes: 23 | cs.CV, cs.RO
Authors:
Weipeng Zhong, Peizhou Cao, Yichen Jin, Li Luo, Wenzhe Cai, Jingli Lin, Ha…
6Â months ago
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Episode 1144
🤗 Upvotes: 22 | cs.CL
Authors:
Xingwei Tan, Mahathi Parvatham, Chiara Gambi, Gabriele Pergola
Title:…
6Â months ago
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
Episode 1143
🤗 Upvotes: 21 | cs.AI
Authors:
Akshit Sinha, Arvindh Arun, Shashwat Goel, Steffen Staab, Jonas Geiping
6Â months ago
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Episode 1142
🤗 Upvotes: 114 | cs.RO
Authors:
Yihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui, Zirui Ge, Xinyang Tong, Wenxua…
6Â months, 1Â week ago
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
Episode 1141
🤗 Upvotes: 88 | cs.CV, cs.MM
Authors:
Liyang Chen, Tianxiang Ma, Jiawei Liu, Bingchuan Li, Zhuowei Chen, Lijie L…
6Â months, 1Â week ago
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Episode 1140
🤗 Upvotes: 57 | cs.RO, cs.AI, cs.CL, cs.LG
Authors:
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, …
6Â months, 1Â week ago
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs
Episode 1139
🤗 Upvotes: 52 | cs.CL, cs.AI, cs.SD
Authors:
Yuhao Zhang, Yuhao Du, Zhanchen Dai, Xiangnan Ma, Kaiqi Kou, Benyou…
6Â months, 1Â week ago
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
Episode 1138
🤗 Upvotes: 34 | cs.LG, cs.CL
Authors:
Jiawei Wang, Jiacai Liu, Yuqian Fu, Yingru Li, Xintao Wang, Yuan Lin, Yu Y…
6Â months, 1Â week ago
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Episode 1137
🤗 Upvotes: 34 | cs.CV
Authors:
Yikang Ding, Jiwen Liu, Wenyuan Zhang, Zekun Wang, Wentao Hu, Liyuan Cui, Mingmin…
6Â months, 1Â week ago
FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark
Episode 1136
🤗 Upvotes: 28 | cs.CV, cs.CL
Authors:
Rongyao Fang, Aldrich Yu, Chengqi Duan, Linjiang Huang, Shuai Bai, Yuxuan …
6Â months, 1Â week ago