Podcast Episodes

Back to Search

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Episode 1582

🤗 Upvotes: 131 | cs.CV, cs.AI, cs.CL

Authors:
Yuxiang Ji, Yong Wang, Ziyu Ma, Yiming Hu, Hailang Huang, Xuecai H…

5 months, 3 weeks ago

Short Long

View Episode

MMFormalizer: Multimodal Autoformalization in the Wild

Episode 1581

🤗 Upvotes: 94 | cs.CL

Authors:
Jing Xiong, Qi Han, Yunta Hsieh, Hui Shen, Huajian Xin, Chaofan Tao, Chenyang Zha…

5 months, 3 weeks ago

Short Long

View Episode

CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature

Episode 1580

🤗 Upvotes: 45 | cs.GR, cs.AI, cs.LG

Authors:
Eldad Matmon, Amit Bracha, Noam Rotstein, Ron Kimmel

Ti…

5 months, 3 weeks ago

Short Long

View Episode

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Episode 1579

🤗 Upvotes: 38 | cs.CL, cs.AI

Authors:
Qiguang Chen, Yantao Du, Ziniu Li, Jinhao Liu, Songyao Duan, Jiarui Guo, M…

5 months, 3 weeks ago

Short Long

View Episode

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Episode 1578

🤗 Upvotes: 30 | cs.CL

Authors:
Jiajie Zhang, Xin Lv, Ling Feng, Lei Hou, Juanzi Li

Title:
…

5 months, 3 weeks ago

Short Long

View Episode

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Episode 1577

🤗 Upvotes: 25 | cs.CL, cs.AI, cs.LG

Authors:
Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Zhicheng Do…

5 months, 3 weeks ago

Short Long

View Episode

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Episode 1576

🤗 Upvotes: 22 | cs.CL

Authors:
Mingxin Li, Yanzhao Zhang, Dingkun Long, Keqin Chen, Sibo Song, Shuai Bai, Zhibo …

5 months, 3 weeks ago

Short Long

View Episode

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Episode 1575

🤗 Upvotes: 98 | cs.CL, cs.AI, cs.LG

Authors:
Shih-Yang Liu, Xin Dong, Ximing Lu, Shizhe Diao, Peter Belcak, Ming…

5 months, 3 weeks ago

Short Long

View Episode

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Episode 1574

🤗 Upvotes: 29 | cs.LG

Authors:
Maksim Velikanov, Ilyas Chahed, Jingwei Zuo, Dhia Eddine Rhaiem, Younes Belkada, …

5 months, 3 weeks ago

Short Long

View Episode

RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Episode 1573

🤗 Upvotes: 26 | cs.CV

Authors:
Yuan-Kang Lee, Kuan-Lin Chen, Chia-Che Chang, Yu-Lun Liu

Title:
…

5 months, 3 weeks ago

Short Long

View Episode

Podcast Episodes

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

MMFormalizer: Multimodal Autoformalization in the Wild

CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Love PodBriefly?