Podcast Episodes
Back to SearchSTEP3-VL-10B Technical Report
Episode 1609
🤗 Upvotes: 130 | cs.CV
Authors:
Ailin Huang, Chengyuan Yao, Chunrui Han, Fanqi Wan, Hangyu Guo, Haoran Lv, Hongy…
2Â months ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
Episode 1608
🤗 Upvotes: 111 | cs.LG, cs.CL
Authors:
Zhiyuan Hu, Yucheng Wang, Yufei He, Jiaying Wu, Yilun Zhao, See-Kiong Ng,…
2Â months ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
Episode 1607
🤗 Upvotes: 64 | cs.AI, cs.CL
Authors:
Zhiyuan Hu, Yunhai Hu, Juncheng Liu, Shuyue Stella Li, Yucheng Wang, Zhen …
2Â months ago
Controlled Self-Evolution for Algorithmic Code Optimization
Episode 1606
🤗 Upvotes: 97 | cs.CL, cs.AI, cs.NE
Authors:
Tu Hu, Ronghao Chen, Shuo Zhang, Jianghao Yin, Mou Xiao Feng, Jingp…
2Â months ago
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
Episode 1605
🤗 Upvotes: 92 | cs.CL
Authors:
Yibo Wang, Lei Wang, Yue Deng, Keming Wu, Yao Xiao, Huanjin Yao, Liwei Kang, Hai …
2Â months ago
MAXS: Meta-Adaptive Exploration with LLM Agents
Episode 1604
🤗 Upvotes: 82 | cs.AI
Authors:
Jian Zhang, Zhiyuan Wang, Zhangqi Wang, Yu He, Haoran Luo, li yuan, Lingling Zhan…
2Â months ago
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Episode 1603
🤗 Upvotes: 47 | cs.LG, cs.CL
Authors:
Shaotian Yan, Kaiyuan Liu, Chen Shen, Bing Wang, Sinan Fan, Jun Zhang, Yue…
2Â months ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
Episode 1602
🤗 Upvotes: 41 | cs.CV, cs.AI, cs.LG, cs.RO
Authors:
Chi-Pin Huang, Yunze Man, Zhiding Yu, Min-Hung Chen, Jan Kau…
2Â months ago
SkinFlow: Efficient Information Transmission for Open Dermatological Diagnosis via Dynamic Visual Encoding and Staged RL
Episode 1601
🤗 Upvotes: 36 | cs.CV, cs.AI
Authors:
Lijun Liu, Linwei Chen, Zhishou Zhang, Meng Tian, Hengfu Cui, Ruiyang Li, …
2Â months ago
OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG
Episode 1600
🤗 Upvotes: 26 | cs.CL, cs.AI, cs.IR
Authors:
Fengran Mo, Zhan Su, Yuchen Hui, Jinghan Zhang, Jia Ao Sun, Zheyuan…
2Â months ago