Podcast Episodes
Back to SearchInsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion
Episode 1532
🤗 Upvotes: 74 | cs.CV, cs.AI
Authors:
Hoiyeong Jin, Hyojin Jang, Jeongho Kim, Junha Hyung, Kinam Kim, Dongjin Ki…
4Â months, 1Â week ago
Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding
Episode 1531
🤗 Upvotes: 70 | cs.CL
Authors:
Yuqing Li, Jiangnan Li, Zheng Lin, Ziyan Zhou, Junjie Wu, Weiping Wang, Jie Zhou,…
4Â months, 1Â week ago
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents
Episode 1530
🤗 Upvotes: 21 | cs.CV
Authors:
Hanzhang Zhou, Xu Zhang, Panrong Tong, Jianan Zhang, Liangyu Chen, Quyu Kong, Che…
4Â months, 1Â week ago
Latent Implicit Visual Reasoning
Episode 1529
🤗 Upvotes: 34 | cs.CV
Authors:
Kelvin Li, Chuyi Shang, Leonid Karlinsky, Rogerio Feris, Trevor Darrell, Roei Her…
4Â months, 1Â week ago
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Episode 1528
🤗 Upvotes: 26 | cs.LG, cs.AI
Authors:
Seijin Kobayashi, Yanick Schimpf, Maximilian Schlegel, Angelika Steger, Ma…
4Â months, 1Â week ago
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Episode 1527
🤗 Upvotes: 51 | cs.CV, cs.AI, cs.LG
Authors:
Jintao Zhang, Kaiwen Zheng, Kai Jiang, Haoxu Wang, Ion Stoica, Jose…
4Â months, 1Â week ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Episode 1526
🤗 Upvotes: 42 | cs.CV
Authors:
Shengchao Zhou, Yuxin Chen, Yuying Ge, Wei Huang, Jiehong Lin, Ying Shan, Xiaojua…
4Â months, 1Â week ago
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation
Episode 1525
🤗 Upvotes: 26 | cs.CV
Authors:
Jiawei Liu, Junqiao Li, Jiangfan Deng, Gen Li, Siyu Zhou, Zetao Fang, Shanshan La…
4Â months, 1Â week ago
T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
Episode 1524
🤗 Upvotes: 23 | cs.CV
Authors:
Zhe Cao, Tao Wang, Jiaming Wang, Yanghai Wang, Yuanxing Zhang, Jialu Chen, Miao D…
4Â months, 1Â week ago
SemanticGen: Video Generation in Semantic Space
Episode 1523
🤗 Upvotes: 78 | cs.CV
Authors:
Jianhong Bai, Xiaoshi Wu, Xintao Wang, Xiao Fu, Yuanxing Zhang, Qinghe Wang, Xiao…
4Â months, 1Â week ago