Podcast Episodes

Back to Search
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Episode 865

🤗 Upvotes: 43 | cs.CV, cs.AI

Authors:
Ai Jian, Weijie Qiu, Xiaokun Wang, Peiyu Wang, Yunzhuo Hao, Jiangbo Pei, Y…

9 months, 2 weeks ago

Short Long
View Episode
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Episode 864

🤗 Upvotes: 29 | cs.CL, cs.AI, cs.CV

Authors:
Qianhui Wu, Kanzhi Cheng, Rui Yang, Chaoyun Zhang, Jianwei Yang, Hu…

9 months, 2 weeks ago

Short Long
View Episode
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Episode 863

🤗 Upvotes: 29 | cs.CV, cs.RO

Authors:
Gen Luo, Ganlin Yang, Ziyang Gong, Guanzhou Chen, Haonan Duan, Erfei Cui, …

9 months, 2 weeks ago

Short Long
View Episode
OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation

Episode 862

🤗 Upvotes: 28 | cs.AI

Authors:
Shengjia Zhang, Junjie Wu, Jiawei Chen, Changwang Zhang, Xingyu Lou, Wangchunshu …

9 months, 2 weeks ago

Short Long
View Episode
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Episode 861

🤗 Upvotes: 99 | cs.CL, cs.AI, cs.LG

Authors:
Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, …

9 months, 2 weeks ago

Short Long
View Episode
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Episode 860

🤗 Upvotes: 52 | cs.LG, cs.AI, cs.CL

Authors:
Zafir Stojanovski, Oliver Stanley, Joe Sharratt, Richard Jones, Abd…

9 months, 2 weeks ago

Short Long
View Episode
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Episode 859

🤗 Upvotes: 48 | cs.LG, cs.RO

Authors:
Mustafa Shukor, Dana Aubakirova, Francesco Capuano, Pepijn Kooijmans, Stev…

9 months, 2 weeks ago

Short Long
View Episode
Taming LLMs by Scaling Learning Rates with Gradient Grouping

Episode 858

🤗 Upvotes: 33 | cs.LG, cs.AI

Authors:
Siyuan Li, Juanxi Tian, Zedong Wang, Xin Jin, Zicheng Liu, Wentao Zhang, D…

9 months, 2 weeks ago

Short Long
View Episode
ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Episode 857

🤗 Upvotes: 26 | cs.CL

Authors:
Ruihan Yang, Yikai Zhang, Aili Chen, Xintao Wang, Siyu Yuan, Jiangjie Chen, Deqin…

9 months, 2 weeks ago

Short Long
View Episode
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models

Episode 856

🤗 Upvotes: 24 | cs.CV

Authors:
Kinam Kim, Junha Hyung, Jaegul Choo

Title:
Temporal In-Co…

9 months, 2 weeks ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us