Podcast Episodes
Back to SearchCSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs
Episode 865
🤗 Upvotes: 43 | cs.CV, cs.AI
Authors:
Ai Jian, Weijie Qiu, Xiaokun Wang, Peiyu Wang, Yunzhuo Hao, Jiangbo Pei, Y…
9Â months, 2Â weeks ago
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Episode 864
🤗 Upvotes: 29 | cs.CL, cs.AI, cs.CV
Authors:
Qianhui Wu, Kanzhi Cheng, Rui Yang, Chaoyun Zhang, Jianwei Yang, Hu…
9Â months, 2Â weeks ago
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
Episode 863
🤗 Upvotes: 29 | cs.CV, cs.RO
Authors:
Gen Luo, Ganlin Yang, Ziyang Gong, Guanzhou Chen, Haonan Duan, Erfei Cui, …
9Â months, 2Â weeks ago
OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation
Episode 862
🤗 Upvotes: 28 | cs.AI
Authors:
Shengjia Zhang, Junjie Wu, Jiawei Chen, Changwang Zhang, Xingyu Lou, Wangchunshu …
9Â months, 2Â weeks ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Episode 861
🤗 Upvotes: 99 | cs.CL, cs.AI, cs.LG
Authors:
Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, …
9Â months, 2Â weeks ago
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
Episode 860
🤗 Upvotes: 52 | cs.LG, cs.AI, cs.CL
Authors:
Zafir Stojanovski, Oliver Stanley, Joe Sharratt, Richard Jones, Abd…
9Â months, 2Â weeks ago
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Episode 859
🤗 Upvotes: 48 | cs.LG, cs.RO
Authors:
Mustafa Shukor, Dana Aubakirova, Francesco Capuano, Pepijn Kooijmans, Stev…
9Â months, 2Â weeks ago
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Episode 858
🤗 Upvotes: 33 | cs.LG, cs.AI
Authors:
Siyuan Li, Juanxi Tian, Zedong Wang, Xin Jin, Zicheng Liu, Wentao Zhang, D…
9Â months, 2Â weeks ago
ARIA: Training Language Agents with Intention-Driven Reward Aggregation
Episode 857
🤗 Upvotes: 26 | cs.CL
Authors:
Ruihan Yang, Yikai Zhang, Aili Chen, Xintao Wang, Siyu Yuan, Jiangjie Chen, Deqin…
9Â months, 2Â weeks ago
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models
Episode 856
🤗 Upvotes: 24 | cs.CV
Authors:
Kinam Kim, Junha Hyung, Jaegul Choo
Title:
Temporal In-Co…
9Â months, 2Â weeks ago