Podcast Episodes

Back to Search
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts

Episode 1145

🤗 Upvotes: 23 | cs.CV, cs.RO

Authors:
Weipeng Zhong, Peizhou Cao, Yichen Jin, Li Luo, Wenzhe Cai, Jingli Lin, Ha…

6 months ago

Short Long
View Episode
IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Episode 1144

🤗 Upvotes: 22 | cs.CL

Authors:
Xingwei Tan, Mahathi Parvatham, Chiara Gambi, Gabriele Pergola

Title:…

6 months ago

Short Long
View Episode
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Episode 1143

🤗 Upvotes: 21 | cs.AI

Authors:
Akshit Sinha, Arvindh Arun, Shashwat Goel, Steffen Staab, Jonas Geiping

…

6 months ago

Short Long
View Episode
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Episode 1142

🤗 Upvotes: 114 | cs.RO

Authors:
Yihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui, Zirui Ge, Xinyang Tong, Wenxua…

6 months, 1 week ago

Short Long
View Episode
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Episode 1141

🤗 Upvotes: 88 | cs.CV, cs.MM

Authors:
Liyang Chen, Tianxiang Ma, Jiawei Liu, Bingchuan Li, Zhuowei Chen, Lijie L…

6 months, 1 week ago

Short Long
View Episode
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Episode 1140

🤗 Upvotes: 57 | cs.RO, cs.AI, cs.CL, cs.LG

Authors:
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, …

6 months, 1 week ago

Short Long
View Episode
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Episode 1139

🤗 Upvotes: 52 | cs.CL, cs.AI, cs.SD

Authors:
Yuhao Zhang, Yuhao Du, Zhanchen Dai, Xiangnan Ma, Kaiqi Kou, Benyou…

6 months, 1 week ago

Short Long
View Episode
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Episode 1138

🤗 Upvotes: 34 | cs.LG, cs.CL

Authors:
Jiawei Wang, Jiacai Liu, Yuqian Fu, Yingru Li, Xintao Wang, Yuan Lin, Yu Y…

6 months, 1 week ago

Short Long
View Episode
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Episode 1137

🤗 Upvotes: 34 | cs.CV

Authors:
Yikang Ding, Jiwen Liu, Wenyuan Zhang, Zekun Wang, Wentao Hu, Liyuan Cui, Mingmin…

6 months, 1 week ago

Short Long
View Episode
FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Episode 1136

🤗 Upvotes: 28 | cs.CV, cs.CL

Authors:
Rongyao Fang, Aldrich Yu, Chengqi Duan, Linjiang Huang, Shuai Bai, Yuxuan …

6 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us