Podcast Episodes

Back to Search
No image available

Step-GUI Technical Report


Episode 1497


🤗 Upvotes: 87 | cs.CV

Authors:
Haolong Yan, Jia Wang, Xin Huang, Yeqing Shen, Ziyang Meng, Zhimin Fan, Kaijun Tan, Jin Gao, Lieyu Shi, Mi Yang, Shiliang Yang, Zhi…


Published on 1 week ago

No image available

DEER: Draft with Diffusion, Verify with Autoregressive Models


Episode 1496


🤗 Upvotes: 39 | cs.LG, cs.AI

Authors:
Zicong Cheng, Guo-Wei Yang, Jia Li, Zhijie Deng, Meng-Hao Guo, Shi-Min Hu

Title:
DEER: Draft with Di…


Published on 1 week ago

No image available

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing


Episode 1495


🤗 Upvotes: 36 | cs.CL

Authors:
Lanxiang Hu, Siqi Kou, Yichao Fu, Samyam Rajbhandari, Tajana Rosing, Yuxiong He, Zhijie Deng, Hao Zhang

Title:
…


Published on 1 week ago

No image available

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices


Episode 1494


🤗 Upvotes: 31 | cs.CV, cs.CL

Authors:
HyperAI Team, Yuchen Liu, Kaiyang Han, Zhiqiang Xia, Yuhang Dong, Chen Song, Kangyu Tang, Jiaming Xu, Xiushi Feng, WenXuan Y…


Published on 1 week ago

No image available

Puzzle Curriculum GRPO for Vision-Centric Reasoning


Episode 1493


🤗 Upvotes: 30 | cs.CV

Authors:
Ahmadreza Jeddi, Hakki Can Karaimer, Hue Nguyen, Zhongling Wang, Ke Zhao, Javad Rajabi, Ran Zhang, Raghav Goyal, Babak Taati, Radek…


Published on 1 week ago

No image available

MMGR: Multi-Modal Generative Reasoning


Episode 1492


🤗 Upvotes: 82 | cs.CL, cs.CV

Authors:
Zefan Cai, Haoyi Qiu, Tianyi Ma, Haozhe Zhao, Gengze Zhou, Kung-Hsiang Huang, Parisa Kordjamshidi, Minjia Zhang, Wen Xiao, J…


Published on 1 week, 1 day ago

No image available

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?


Episode 1491


🤗 Upvotes: 53 | cs.CV

Authors:
Jiaqi Wang, Weijia Wu, Yi Zhan, Rui Zhao, Ming Hu, James Cheng, Wei Liu, Philip Torr, Kevin Qinghong Lin

Title:
…


Published on 1 week, 1 day ago

No image available

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling


Episode 1490


🤗 Upvotes: 49 | cs.CV, cs.GR

Authors:
Wenqiang Sun, Haiyu Zhang, Haoyuan Wang, Junta Wu, Zehan Wang, Zhenwei Wang, Yunhong Wang, Jun Zhang, Tengfei Wang, Chunchao…


Published on 1 week, 1 day ago

No image available

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling


Episode 1489


🤗 Upvotes: 39 | cs.CV, cs.AI

Authors:
Yuran Wang, Bohan Zeng, Chengzhuo Tong, Wenxuan Liu, Yang Shi, Xiaochen Ma, Hao Liang, Yuanxing Zhang, Wentao Zhang

…


Published on 1 week, 1 day ago

No image available

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics


Episode 1488


🤗 Upvotes: 31 | cs.RO, cs.CV

Authors:
Enshen Zhou, Cheng Chi, Yibo Li, Jingkun An, Jiayuan Zhang, Shanyu Rong, Yi Han, Yuheng Ji, Mengzhen Liu, Pengwei Wang, Zhon…


Published on 1 week, 1 day ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate