Podcast Episodes
Back to SearchThinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization
Episode 1582
🤗 Upvotes: 131 | cs.CV, cs.AI, cs.CL
Authors:
Yuxiang Ji, Yong Wang, Ziyu Ma, Yiming Hu, Hailang Huang, Xuecai H…
3Â months, 3Â weeks ago
MMFormalizer: Multimodal Autoformalization in the Wild
Episode 1581
🤗 Upvotes: 94 | cs.CL
Authors:
Jing Xiong, Qi Han, Yunta Hsieh, Hui Shen, Huajian Xin, Chaofan Tao, Chenyang Zha…
3Â months, 3Â weeks ago
CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature
Episode 1580
🤗 Upvotes: 45 | cs.GR, cs.AI, cs.LG
Authors:
Eldad Matmon, Amit Bracha, Noam Rotstein, Ron Kimmel
Ti…
3Â months, 3Â weeks ago
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Episode 1579
🤗 Upvotes: 38 | cs.CL, cs.AI
Authors:
Qiguang Chen, Yantao Du, Ziniu Li, Jinhao Liu, Songyao Duan, Jiarui Guo, M…
3Â months, 3Â weeks ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
Episode 1578
🤗 Upvotes: 30 | cs.CL
Authors:
Jiajie Zhang, Xin Lv, Ling Feng, Lei Hou, Juanzi Li
Title:
…
3Â months, 3Â weeks ago
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis
Episode 1577
🤗 Upvotes: 25 | cs.CL, cs.AI, cs.LG
Authors:
Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Zhicheng Do…
3Â months, 3Â weeks ago
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
Episode 1576
🤗 Upvotes: 22 | cs.CL
Authors:
Mingxin Li, Yanzhao Zhang, Dingkun Long, Keqin Chen, Sibo Song, Shuai Bai, Zhibo …
3Â months, 3Â weeks ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Episode 1575
🤗 Upvotes: 98 | cs.CL, cs.AI, cs.LG
Authors:
Shih-Yang Liu, Xin Dong, Ximing Lu, Shizhe Diao, Peter Belcak, Ming…
3Â months, 4Â weeks ago
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Episode 1574
🤗 Upvotes: 29 | cs.LG
Authors:
Maksim Velikanov, Ilyas Chahed, Jingwei Zuo, Dhia Eddine Rhaiem, Younes Belkada, …
3Â months, 4Â weeks ago
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes
Episode 1573
🤗 Upvotes: 26 | cs.CV
Authors:
Yuan-Kang Lee, Kuan-Lin Chen, Chia-Che Chang, Yu-Lun Liu
Title:
…
3Â months, 4Â weeks ago