Podcast Episodes

Back to Search
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration

Episode 1686

🤗 Upvotes: 40 | cs.CV

Authors:
Danil Tokhchukov, Aysel Mirzoeva, Andrey Kuznetsov, Konstantin Sobolev

…

1 month, 1 week ago

Short Long
View Episode
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Episode 1685

🤗 Upvotes: 39 | cs.CV

Authors:
Yufeng Yang, Xianfang Zeng, Zhangqi Jiang, Fukun Yin, Jianzhuang Liu, Wei Cheng, …

1 month, 1 week ago

Short Long
View Episode
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Episode 1684

🤗 Upvotes: 26 | cs.CV

Authors:
Zhekai Chen, Yuqing Wang, Manyuan Zhang, Xihui Liu

Title:
…

1 month, 1 week ago

Short Long
View Episode
Voxtral TTS

Episode 1683

🤗 Upvotes: 24 | cs.AI

Authors:
Alexander H. Liu, Alexis Tacnet, Andy Ehrenberg, Andy Lo, Chen-Yo Sun, Guillaume …

1 month, 1 week ago

Short Long
View Episode
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Episode 1682

🤗 Upvotes: 27 | cs.CL, cs.LG

Authors:
Jeonghye Kim, Xufang Luo, Minbeom Kim, Sangmook Lee, Dohyung Kim, Jiwon Je…

1 month, 1 week ago

Short Long
View Episode
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Episode 1681

🤗 Upvotes: 112 | cs.CV

Authors:
Hejun Dong, Junbo Niu, Bin Wang, Weijun Zeng, Wentao Zhang, Conghui He

…

1 month, 1 week ago

Short Long
View Episode
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Episode 1680

🤗 Upvotes: 69 | cs.CV

Authors:
Zhen Li, Zian Meng, Shuwei Shi, Wenshuo Peng, Yuwei Wu, Bo Zheng, Chuanhao Li, Ka…

1 month, 1 week ago

Short Long
View Episode
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Episode 1679

🤗 Upvotes: 43 | cs.AI, cs.CL

Authors:
Ling Yue, Kushal Raj Bhandari, Ching-Yun Ko, Dhaval Patel, Shuxin Lin, Nia…

1 month, 1 week ago

Short Long
View Episode
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Episode 1678

🤗 Upvotes: 43 | cs.CV, cs.CL

Authors:
Haoyu Huang, Jinfa Huang, Zhongwei Wan, Xiawu Zheng, Rongrong Ji, Jiebo Lu…

1 month, 1 week ago

Short Long
View Episode
PEARL: Personalized Streaming Video Understanding Model

Episode 1677

🤗 Upvotes: 36 | cs.CV, cs.AI, cs.IR

Authors:
Yuanhong Zheng, Ruichuan An, Xiaopeng Lin, Yuxing Liu, Sihan Yang, …

1 month, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us