Podcast Episodes
Back to SearchDA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models
Episode 1676
🤗 Upvotes: 36 | cs.CV
Authors:
Jaewon Min, Jaeeun Lee, Yeji Choi, Paul Hyunbin Cho, Jin Hyeon Kim, Tae-Young Lee…
1Â month, 1Â week ago
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Episode 1675
🤗 Upvotes: 33 | cs.CV, cs.GR, cs.RO
Authors:
Chuanrui Zhang, Minghan Qin, Yuang Wang, Baifeng Xie, Hang Li, Ziwe…
1Â month, 1Â week ago
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation
Episode 1674
🤗 Upvotes: 30 | cs.CV
Authors:
Jie Liu, Zilyu Ye, Linxiao Yuan, Shenhan Zhu, Yu Gao, Jie Wu, Kunchang Li, Xiongh…
1Â month, 1Â week ago
RealMaster: Lifting Rendered Scenes into Photorealistic Video
Episode 1673
🤗 Upvotes: 23 | cs.CV
Authors:
Dana Cohen-Bar, Ido Sobol, Raphael Bensadoun, Shelly Sheynin, Oran Gafni, Or Pata…
1Â month, 1Â week ago
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models
Episode 1672
🤗 Upvotes: 110 | cs.CV
Authors:
Meiqi Wu, Zhixin Cai, Fufangchen Zhao, Xiaokun Feng, Rujing Dang, Bingze Song, R…
1Â month, 1Â week ago
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
Episode 1671
🤗 Upvotes: 90 | cs.CV
Authors:
SII-GAIR, Sand. ai, :, Ethan Chern, Hansi Teng, Hanwen Sun, Hao Wang, Hong Pan, H…
1Â month, 1Â week ago
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
Episode 1670
🤗 Upvotes: 63 | cs.AI, cs.CL
Authors:
Jianing Wang, Jianfei Zhang, Qi Guo, Linsen Guo, Rumei Li, Chao Zhang, Cho…
1Â month, 1Â week ago
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
Episode 1669
🤗 Upvotes: 60 | cs.CV, cs.AI
Authors:
Nimrod Shabtay, Moshe Kimhi, Artem Spector, Sivan Haray, Ehud Rivlin, Chai…
1Â month, 1Â week ago
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Episode 1668
🤗 Upvotes: 55 | cs.IR, cs.AI, cs.CL
Authors:
Zhuofeng Li, Dongfu Jiang, Xueguang Ma, Haoxiang Zhang, Ping Nie, Y…
1Â month, 1Â week ago
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
Episode 1667
🤗 Upvotes: 45 | cs.CV
Authors:
Ruoliu Yang, Chu Wu, Caifeng Shan, Ran He, Chaoyou Fu
Title:
…
1Â month, 1Â week ago