Podcast Episodes
Back to SearchWhy Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?
Episode 1682
🤗 Upvotes: 27 | cs.CL, cs.LG
Authors:
Jeonghye Kim, Xufang Luo, Minbeom Kim, Sangmook Lee, Dohyung Kim, Jiwon Je…
3Â months, 1Â week ago
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding
Episode 1681
🤗 Upvotes: 112 | cs.CV
Authors:
Hejun Dong, Junbo Niu, Bin Wang, Weijun Zeng, Wentao Zhang, Conghui He
3Â months, 1Â week ago
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG
Episode 1680
🤗 Upvotes: 69 | cs.CV
Authors:
Zhen Li, Zian Meng, Shuwei Shi, Wenshuo Peng, Yuwei Wu, Bo Zheng, Chuanhao Li, Ka…
3Â months, 1Â week ago
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents
Episode 1679
🤗 Upvotes: 43 | cs.AI, cs.CL
Authors:
Ling Yue, Kushal Raj Bhandari, Ching-Yun Ko, Dhaval Patel, Shuxin Lin, Nia…
3Â months, 1Â week ago
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning
Episode 1678
🤗 Upvotes: 43 | cs.CV, cs.CL
Authors:
Haoyu Huang, Jinfa Huang, Zhongwei Wan, Xiawu Zheng, Rongrong Ji, Jiebo Lu…
3Â months, 1Â week ago
PEARL: Personalized Streaming Video Understanding Model
Episode 1677
🤗 Upvotes: 36 | cs.CV, cs.AI, cs.IR
Authors:
Yuanhong Zheng, Ruichuan An, Xiaopeng Lin, Yuxing Liu, Sihan Yang, …
3Â months, 1Â week ago
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models
Episode 1676
🤗 Upvotes: 36 | cs.CV
Authors:
Jaewon Min, Jaeeun Lee, Yeji Choi, Paul Hyunbin Cho, Jin Hyeon Kim, Tae-Young Lee…
3Â months, 1Â week ago
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Episode 1675
🤗 Upvotes: 33 | cs.CV, cs.GR, cs.RO
Authors:
Chuanrui Zhang, Minghan Qin, Yuang Wang, Baifeng Xie, Hang Li, Ziwe…
3Â months, 1Â week ago
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation
Episode 1674
🤗 Upvotes: 30 | cs.CV
Authors:
Jie Liu, Zilyu Ye, Linxiao Yuan, Shenhan Zhu, Yu Gao, Jie Wu, Kunchang Li, Xiongh…
3Â months, 1Â week ago
RealMaster: Lifting Rendered Scenes into Photorealistic Video
Episode 1673
🤗 Upvotes: 23 | cs.CV
Authors:
Dana Cohen-Bar, Ido Sobol, Raphael Bensadoun, Shelly Sheynin, Oran Gafni, Or Pata…
3Â months, 1Â week ago