Podcast Episodes

Back to Search
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Episode 1532

🤗 Upvotes: 74 | cs.CV, cs.AI

Authors:
Hoiyeong Jin, Hyojin Jang, Jeongho Kim, Junha Hyung, Kinam Kim, Dongjin Ki…

4 months, 1 week ago

Short Long
View Episode
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Episode 1531

🤗 Upvotes: 70 | cs.CL

Authors:
Yuqing Li, Jiangnan Li, Zheng Lin, Ziyan Zhou, Junjie Wu, Weiping Wang, Jie Zhou,…

4 months, 1 week ago

Short Long
View Episode
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Episode 1530

🤗 Upvotes: 21 | cs.CV

Authors:
Hanzhang Zhou, Xu Zhang, Panrong Tong, Jianan Zhang, Liangyu Chen, Quyu Kong, Che…

4 months, 1 week ago

Short Long
View Episode
Latent Implicit Visual Reasoning

Episode 1529

🤗 Upvotes: 34 | cs.CV

Authors:
Kelvin Li, Chuyi Shang, Leonid Karlinsky, Rogerio Feris, Trevor Darrell, Roei Her…

4 months, 1 week ago

Short Long
View Episode
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Episode 1528

🤗 Upvotes: 26 | cs.LG, cs.AI

Authors:
Seijin Kobayashi, Yanick Schimpf, Maximilian Schlegel, Angelika Steger, Ma…

4 months, 1 week ago

Short Long
View Episode
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Episode 1527

🤗 Upvotes: 51 | cs.CV, cs.AI, cs.LG

Authors:
Jintao Zhang, Kaiwen Zheng, Kai Jiang, Haoxu Wang, Ion Stoica, Jose…

4 months, 1 week ago

Short Long
View Episode
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Episode 1526

🤗 Upvotes: 42 | cs.CV

Authors:
Shengchao Zhou, Yuxin Chen, Yuying Ge, Wei Huang, Jiehong Lin, Ying Shan, Xiaojua…

4 months, 1 week ago

Short Long
View Episode
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation

Episode 1525

🤗 Upvotes: 26 | cs.CV

Authors:
Jiawei Liu, Junqiao Li, Jiangfan Deng, Gen Li, Siyu Zhou, Zetao Fang, Shanshan La…

4 months, 1 week ago

Short Long
View Episode
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Episode 1524

🤗 Upvotes: 23 | cs.CV

Authors:
Zhe Cao, Tao Wang, Jiaming Wang, Yanghai Wang, Yuanxing Zhang, Jialu Chen, Miao D…

4 months, 1 week ago

Short Long
View Episode
SemanticGen: Video Generation in Semantic Space

Episode 1523

🤗 Upvotes: 78 | cs.CV

Authors:
Jianhong Bai, Xiaoshi Wu, Xintao Wang, Xiao Fu, Yuanxing Zhang, Qinghe Wang, Xiao…

4 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us