Podcast Episodes
Back to SearchMMSkills: Towards Multimodal Skills for General Visual Agents
Episode 1895
🤗 Upvotes: 104 | cs.AI
Authors:
Kangning Zhang, Shuai Shao, Qingyao Li, Jianghao Lin, Lingyue Fu, Shijian Wang, …
1Â month, 1Â week ago
DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo
Episode 1894
🤗 Upvotes: 47 | cs.RO
Authors:
Hanwen Wang, Weizhi Zhao, Xiangyu Wang, Siyuan Huang, He Lin, Boyuan Zheng, Rongt…
1Â month, 1Â week ago
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding
Episode 1893
🤗 Upvotes: 34 | cs.AI
Authors:
Taewon Yun, Jisu Shin, Jeonghwan Choi, Seunghwan Bang, Hwanjun Song
T…
1Â month, 1Â week ago
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation
Episode 1892
🤗 Upvotes: 31 | cs.CV
Authors:
Yang Yue, Fangyun Wei, Tianyu He, Jinjing Zhao, Zanlin Ni, Zeyu Liu, Jiayi Guo, L…
1Â month, 1Â week ago
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Episode 1891
🤗 Upvotes: 29 | cs.CV
Authors:
Xiaoxuan He, Siming Fu, Zeyue Xue, Weijie Wang, Ruizhe He, Yuming Li, Dacheng Yin…
1Â month, 1Â week ago
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR
Episode 1890
🤗 Upvotes: 28 | cs.AI, cs.CL
Authors:
Chanuk Lee, Sangwoo Park, Minki Kang, Sung Ju Hwang
Title:
…
1Â month, 1Â week ago
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Episode 1889
🤗 Upvotes: 129 | cs.AI, cs.CL
Authors:
Yafu Li, Runzhe Zhan, Haoran Zhang, Shunkai Zhang, Yizhuo Li, Zhilin Wang…
1Â month, 2Â weeks ago
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation
Episode 1888
🤗 Upvotes: 76 | cs.CV
Authors:
Min Zhao, Hongzhou Zhu, Kaiwen Zheng, Zihan Zhou, Bokai Yan, Xinyuan Li, Xiao Yan…
1Â month, 2Â weeks ago
Self-Distilled Agentic Reinforcement Learning
Episode 1887
🤗 Upvotes: 67 | cs.LG, cs.AI, cs.CL
Authors:
Zhengxi Lu, Zhiyuan Yao, Zhuowen Han, Zi-Han Wang, Jinyang Wu, Qi G…
1Â month, 2Â weeks ago
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models
Episode 1886
🤗 Upvotes: 62 | cs.CV
Authors:
Xiyu Ren, Zhaowei Wang, Yiming Du, Zhongwei Xie, Chi Liu, Xinlin Yang, Haoyue Fen…
1Â month, 2Â weeks ago