Podcast Episodes

Back to Search

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Episode 1682

🤗 Upvotes: 27 | cs.CL, cs.LG

Authors:
Jeonghye Kim, Xufang Luo, Minbeom Kim, Sangmook Lee, Dohyung Kim, Jiwon Je…

3 months, 1 week ago

Short Long

View Episode

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Episode 1681

🤗 Upvotes: 112 | cs.CV

Authors:
Hejun Dong, Junbo Niu, Bin Wang, Weijun Zeng, Wentao Zhang, Conghui He

…

3 months, 1 week ago

Short Long

View Episode

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Episode 1680

🤗 Upvotes: 69 | cs.CV

Authors:
Zhen Li, Zian Meng, Shuwei Shi, Wenshuo Peng, Yuwei Wu, Bo Zheng, Chuanhao Li, Ka…

3 months, 1 week ago

Short Long

View Episode

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Episode 1679

🤗 Upvotes: 43 | cs.AI, cs.CL

Authors:
Ling Yue, Kushal Raj Bhandari, Ching-Yun Ko, Dhaval Patel, Shuxin Lin, Nia…

3 months, 1 week ago

Short Long

View Episode

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Episode 1678

🤗 Upvotes: 43 | cs.CV, cs.CL

Authors:
Haoyu Huang, Jinfa Huang, Zhongwei Wan, Xiawu Zheng, Rongrong Ji, Jiebo Lu…

3 months, 1 week ago

Short Long

View Episode

PEARL: Personalized Streaming Video Understanding Model

Episode 1677

🤗 Upvotes: 36 | cs.CV, cs.AI, cs.IR

Authors:
Yuanhong Zheng, Ruichuan An, Xiaopeng Lin, Yuxing Liu, Sihan Yang, …

3 months, 1 week ago

Short Long

View Episode

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Episode 1676

🤗 Upvotes: 36 | cs.CV

Authors:
Jaewon Min, Jaeeun Lee, Yeji Choi, Paul Hyunbin Cho, Jin Hyeon Kim, Tae-Young Lee…

3 months, 1 week ago

Short Long

View Episode

SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM

Episode 1675

🤗 Upvotes: 33 | cs.CV, cs.GR, cs.RO

Authors:
Chuanrui Zhang, Minghan Qin, Yuang Wang, Baifeng Xie, Hang Li, Ziwe…

3 months, 1 week ago

Short Long

View Episode

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Episode 1674

🤗 Upvotes: 30 | cs.CV

Authors:
Jie Liu, Zilyu Ye, Linxiao Yuan, Shenhan Zhu, Yu Gao, Jie Wu, Kunchang Li, Xiongh…

3 months, 1 week ago

Short Long

View Episode

RealMaster: Lifting Rendered Scenes into Photorealistic Video

Episode 1673

🤗 Upvotes: 23 | cs.CV

Authors:
Dana Cohen-Bar, Ido Sobol, Raphael Bensadoun, Shelly Sheynin, Oran Gafni, Or Pata…

3 months, 1 week ago

Short Long

View Episode

Podcast Episodes

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

PEARL: Personalized Streaming Video Understanding Model

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

RealMaster: Lifting Rendered Scenes into Photorealistic Video

Love PodBriefly?