Podcast Episodes

Back to Search
No image available

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times


Episode 1527


🤗 Upvotes: 51 | cs.CV, cs.AI, cs.LG

Authors:
Jintao Zhang, Kaiwen Zheng, Kai Jiang, Haoxu Wang, Ion Stoica, Joseph E. Gonzalez, Jianfei Chen, Jun Zhu

…


Published on 9 hours ago

No image available

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models


Episode 1526


🤗 Upvotes: 42 | cs.CV

Authors:
Shengchao Zhou, Yuxin Chen, Yuying Ge, Wei Huang, Jiehong Lin, Ying Shan, Xiaojuan Qi

Title:
Learning to Re…


Published on 9 hours ago

No image available

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation


Episode 1525


🤗 Upvotes: 26 | cs.CV

Authors:
Jiawei Liu, Junqiao Li, Jiangfan Deng, Gen Li, Siyu Zhou, Zetao Fang, Shanshan Lao, Zengde Deng, Jianing Zhu, Tingting Ma, Jiayi Li…


Published on 9 hours ago

No image available

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation


Episode 1524


🤗 Upvotes: 23 | cs.CV

Authors:
Zhe Cao, Tao Wang, Jiaming Wang, Yanghai Wang, Yuanxing Zhang, Jialu Chen, Miao Deng, Jiahao Wang, Yubin Guo, Chenxi Liao, Yize Zha…


Published on 9 hours ago

No image available

SemanticGen: Video Generation in Semantic Space


Episode 1523


🤗 Upvotes: 78 | cs.CV

Authors:
Jianhong Bai, Xiaoshi Wu, Xintao Wang, Xiao Fu, Yuanxing Zhang, Qinghe Wang, Xiaoyu Shi, Menghan Xia, Zuozhu Liu, Haoji Hu, Pengfei…


Published on 1 day, 9 hours ago

No image available

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies


Episode 1522


🤗 Upvotes: 49 | cs.LG, cs.AI, cs.CL

Authors:
Yuqiao Tan, Minzheng Wang, Shizhu He, Huanxuan Liao, Chengfeng Zhao, Qiunan Lu, Tian Liang, Jun Zhao, Kang Liu

…


Published on 1 day, 9 hours ago

No image available

LongVideoAgent: Multi-Agent Reasoning with Long Videos


Episode 1521


🤗 Upvotes: 38 | cs.AI, cs.CV, cs.LG, cs.MA

Authors:
Runtao Liu, Ziyi Liu, Jiaqi Tang, Yue Ma, Renjie Pi, Jipeng Zhang, Qifeng Chen

Title:
…


Published on 1 day, 9 hours ago

No image available

SpatialTree: How Spatial Abilities Branch Out in MLLMs


Episode 1520


🤗 Upvotes: 35 | cs.CV

Authors:
Yuxi Xiao, Longfei Li, Shen Yan, Xinhang Liu, Sida Peng, Yunchao Wei, Xiaowei Zhou, Bingyi Kang

Title:
Spat…


Published on 1 day, 9 hours ago

No image available

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI


Episode 1519


🤗 Upvotes: 159 | cs.LG, cs.CL

Authors:
Hao Liang, Xiaochen Ma, Zhou Liu, Zhen Hao Wong, Zhengyang Zhao, Zimo Meng, Runming He, Chengyu Shen, Qifeng Cai, Zhaoyang …


Published on 2 days, 9 hours ago

No image available

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding


Episode 1518


🤗 Upvotes: 53 | cs.CV

Authors:
Weichen Fan, Haiwen Diao, Quan Wang, Dahua Lin, Ziwei Liu

Title:
The Prism Hypothesis: Harmonizing Semantic…


Published on 2 days, 9 hours ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate