Podcast Episodes
Back to SearchLongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
Episode 1429
🤗 Upvotes: 140 | cs.CV
Authors:
Zuhao Yang, Sudong Wang, Kaichen Zhang, Keming Wu, Sicong Leng, Yifan Zhang, Che…
3Â months, 2Â weeks ago
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights
Episode 1428
🤗 Upvotes: 83 | cs.CV, cs.AI
Authors:
Juanxi Tian, Siyuan Li, Conghui He, Lijun Wu, Cheng Tan
Title:…
3Â months, 2Â weeks ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Episode 1427
🤗 Upvotes: 56 | cs.LG, cs.AI, cs.CL
Authors:
Chujie Zheng, Kai Dang, Bowen Yu, Mingze Li, Huiqiang Jiang, Junron…
3Â months, 2Â weeks ago
How Far Are We from Genuinely Useful Deep Research Agents?
Episode 1426
🤗 Upvotes: 44 | cs.CL
Authors:
Dingling Zhang, He Zhu, Jincheng Ren, Kangqi Song, Xinran Zhou, Boyu Feng, Shudon…
3Â months, 2Â weeks ago
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Episode 1425
🤗 Upvotes: 41 | cs.CV
Authors:
Minh-Quan Le, Yuanzhi Zhu, Vicky Kalogeiton, Dimitris Samaras
Title:
…
3Â months, 2Â weeks ago
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Episode 1424
🤗 Upvotes: 38 | cs.CV
Authors:
Hidir Yesiltepe, Tuna Han Salih Meral, Adil Kaan Akan, Kaan Oktay, Pinar Yanardag…
3Â months, 2Â weeks ago
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
Episode 1423
🤗 Upvotes: 36 | cs.CV
Authors:
Ziheng Ouyang, Yiren Song, Yaoli Liu, Shihao Zhu, Qibin Hou, Ming-Ming Cheng, Mik…
3Â months, 2Â weeks ago
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Episode 1422
🤗 Upvotes: 33 | cs.CV
Authors:
Zhiheng Liu, Weiming Ren, Haozhe Liu, Zijian Zhou, Shoufa Chen, Haonan Qiu, Xiaok…
3Â months, 2Â weeks ago
LFM2 Technical Report
Episode 1421
🤗 Upvotes: 31 | cs.LG, cs.AI
Authors:
Alexander Amini, Anna Banaszak, Harold Benoit, Arthur Böök, Tarek Dakhran,…
3Â months, 2Â weeks ago
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Episode 1420
🤗 Upvotes: 78 | cs.CV
Authors:
Z-Image Team, Huanqia Cai, Sihan Cao, Ruoyi Du, Peng Gao, Steven Hoi, Shijie Huan…
3Â months, 2Â weeks ago