Podcast Episodes
Back to SearchDistilling LLM Agent into Small Models with Retrieval and Code Tools
Episode 801
🤗 Upvotes: 49 | cs.CL, cs.AI
Authors:
Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang
9Â months, 3Â weeks ago
QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization
Episode 800
🤗 Upvotes: 39 | cs.CL
Authors:
Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Ying…
9Â months, 3Â weeks ago
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Episode 799
🤗 Upvotes: 38 | cs.AI
Authors:
Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng…
9Â months, 3Â weeks ago
Scaling Image and Video Generation via Test-Time Evolutionary Search
Episode 798
🤗 Upvotes: 33 | cs.CV, cs.AI, cs.LG
Authors:
Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Ga…
9Â months, 3Â weeks ago
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback
Episode 797
🤗 Upvotes: 25 | cs.CL, cs.AI, cs.CE
Authors:
Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan…
9Â months, 3Â weeks ago
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Episode 796
🤗 Upvotes: 86 | cs.AI, cs.CL, cs.CV
Authors:
NovelSeek Team, Bo Zhang, Shiyang Feng, Xiangchao Yan, Jiakang Yuan…
9Â months, 3Â weeks ago
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
Episode 795
🤗 Upvotes: 49 | cs.CL, cs.AI
Authors:
Tingchen Fu, Jiawei Gu, Yafu Li, Xiaoye Qu, Yu Cheng
Title:
…
9Â months, 3Â weeks ago
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
Episode 794
🤗 Upvotes: 43 | cs.CL, cs.AI, cs.LG
Authors:
Guanting Dong, Yifei Chen, Xiaoxi Li, Jiajie Jin, Hongjin Qian, Yut…
9Â months, 3Â weeks ago
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Episode 793
🤗 Upvotes: 37 | cs.CV, cs.AI, cs.CL
Authors:
Alex Su, Haozhe Wang, Weimin Ren, Fangzhen Lin, Wenhu Chen
9Â months, 3Â weeks ago
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
Episode 792
🤗 Upvotes: 36 | cs.CV
Authors:
Yongliang Wu, Zonghui Li, Xinting Hu, Xinyu Ye, Xianfang Zeng, Gang Yu, Wenbo Zhu…
9Â months, 3Â weeks ago