Podcast Episodes

Back to Search
Scaling RL to Long Videos

Episode 965

🤗 Upvotes: 95 | cs.CV, cs.AI, cs.CL

Authors:
Yukang Chen, Wei Huang, Baifeng Shi, Qinghao Hu, Hanrong Ye, Ligeng…

8 months, 1 week ago

Short Long
View Episode
T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Episode 964

🤗 Upvotes: 83 | cs.CV

Authors:
Vera Soboleva, Aibek Alanov, Andrey Kuznetsov, Konstantin Sobolev

Tit…

8 months, 1 week ago

Short Long
View Episode
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Episode 963

🤗 Upvotes: 37 | cs.CV, cs.AI, cs.CL

Authors:
Haochen Wang, Xiangtai Li, Zilong Huang, Anran Wang, Jiacong Wang, …

8 months, 1 week ago

Short Long
View Episode
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Episode 962

🤗 Upvotes: 29 | cs.CV

Authors:
JingLi Lin, Chenming Zhu, Runsen Xu, Xiaohan Mao, Xihui Liu, Tai Wang, Jiangmiao …

8 months, 1 week ago

Short Long
View Episode
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Episode 961

🤗 Upvotes: 24 | cs.CV, cs.AI

Authors:
Jeongseok Hyun, Sukjun Hwang, Su Ho Han, Taeoh Kim, Inwoong Lee, Dongyoon …

8 months, 1 week ago

Short Long
View Episode
Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Episode 960

🤗 Upvotes: 23 | cs.CV, cs.AI

Authors:
Haoyu Wu, Diankun Wu, Tianyu He, Junliang Guo, Yang Ye, Yueqi Duan, Jiang …

8 months, 1 week ago

Short Long
View Episode
PyVision: Agentic Vision with Dynamic Tooling

Episode 959

🤗 Upvotes: 22 | cs.CL, cs.AI, cs.CV

Authors:
Shitian Zhao, Haoquan Zhang, Shaoheng Lin, Ming Li, Qilong Wu, Kaip…

8 months, 1 week ago

Short Long
View Episode
4KAgent: Agentic Any Image to 4K Super-Resolution

Episode 958

🤗 Upvotes: 56 | cs.CV, eess.IV

Authors:
Yushen Zuo, Qi Zheng, Mingyang Wu, Xinrui Jiang, Renjie Li, Jian Wang, Y…

8 months, 1 week ago

Short Long
View Episode
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Episode 957

🤗 Upvotes: 41 | cs.CV

Authors:
Ke Fan, Shunlin Lu, Minyue Dai, Runyi Yu, Lixing Xiao, Zhiyang Dou, Junting Dong,…

8 months, 1 week ago

Short Long
View Episode
Perception-Aware Policy Optimization for Multimodal Reasoning

Episode 956

🤗 Upvotes: 34 | cs.CL

Authors:
Zhenhailong Wang, Xuehang Guo, Sofia Stoica, Haiyang Xu, Hongru Wang, Hyeonjeong …

8 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us