Podcast Episodes

Back to Search
No image available

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times


Episode 1527


🤗 Upvotes: 51 | cs.CV, cs.AI, cs.LG

Authors:
Jintao Zhang, Kaiwen Zheng, Kai Jiang, Haoxu Wang, Ion Stoica, Joseph E. Gonzalez, Jianfei Chen, Jun Zhu

…


Published on 13 hours ago

No image available

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models


Episode 1526


🤗 Upvotes: 42 | cs.CV

Authors:
Shengchao Zhou, Yuxin Chen, Yuying Ge, Wei Huang, Jiehong Lin, Ying Shan, Xiaojuan Qi

Title:
Learning to Re…


Published on 13 hours ago

No image available

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation


Episode 1525


🤗 Upvotes: 26 | cs.CV

Authors:
Jiawei Liu, Junqiao Li, Jiangfan Deng, Gen Li, Siyu Zhou, Zetao Fang, Shanshan Lao, Zengde Deng, Jianing Zhu, Tingting Ma, Jiayi Li…


Published on 13 hours ago

No image available

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation


Episode 1524


🤗 Upvotes: 23 | cs.CV

Authors:
Zhe Cao, Tao Wang, Jiaming Wang, Yanghai Wang, Yuanxing Zhang, Jialu Chen, Miao Deng, Jiahao Wang, Yubin Guo, Chenxi Liao, Yize Zha…


Published on 13 hours ago

No image available

SemanticGen: Video Generation in Semantic Space


Episode 1523


🤗 Upvotes: 78 | cs.CV

Authors:
Jianhong Bai, Xiaoshi Wu, Xintao Wang, Xiao Fu, Yuanxing Zhang, Qinghe Wang, Xiaoyu Shi, Menghan Xia, Zuozhu Liu, Haoji Hu, Pengfei…


Published on 1 day, 13 hours ago

No image available

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies


Episode 1522


🤗 Upvotes: 49 | cs.LG, cs.AI, cs.CL

Authors:
Yuqiao Tan, Minzheng Wang, Shizhu He, Huanxuan Liao, Chengfeng Zhao, Qiunan Lu, Tian Liang, Jun Zhao, Kang Liu

…


Published on 1 day, 13 hours ago

No image available

LongVideoAgent: Multi-Agent Reasoning with Long Videos


Episode 1521


🤗 Upvotes: 38 | cs.AI, cs.CV, cs.LG, cs.MA

Authors:
Runtao Liu, Ziyi Liu, Jiaqi Tang, Yue Ma, Renjie Pi, Jipeng Zhang, Qifeng Chen

Title:
…


Published on 1 day, 13 hours ago

No image available

SpatialTree: How Spatial Abilities Branch Out in MLLMs


Episode 1520


🤗 Upvotes: 35 | cs.CV

Authors:
Yuxi Xiao, Longfei Li, Shen Yan, Xinhang Liu, Sida Peng, Yunchao Wei, Xiaowei Zhou, Bingyi Kang

Title:
Spat…


Published on 1 day, 13 hours ago

No image available

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI


Episode 1519


🤗 Upvotes: 159 | cs.LG, cs.CL

Authors:
Hao Liang, Xiaochen Ma, Zhou Liu, Zhen Hao Wong, Zhengyang Zhao, Zimo Meng, Runming He, Chengyu Shen, Qifeng Cai, Zhaoyang …


Published on 2 days, 13 hours ago

No image available

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding


Episode 1518


🤗 Upvotes: 53 | cs.CV

Authors:
Weichen Fan, Haiwen Diao, Quan Wang, Dahua Lin, Ziwei Liu

Title:
The Prism Hypothesis: Harmonizing Semantic…


Published on 2 days, 13 hours ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate