Podcast Episodes

Back to Search
No image available

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm


Episode 1357


🤗 Upvotes: 127 | cs.CV, cs.CL

Authors:
Jingqi Tong, Yurong Mou, Hangcheng Li, Mingzhe Li, Yongzhuo Yang, Ming Zhang, Qiguang Chen, Tianyi Liang, Xiaomeng Hu, Yini…


Published on 14 hours ago

No image available

V-Thinker: Interactive Thinking with Images


Episode 1356


🤗 Upvotes: 66 | cs.CV

Authors:
Runqi Qiao, Qiuna Tan, Minghan Yang, Guanting Dong, Peiqing Yang, Shiqiang Lang, Enhui Wan, Xiaowan Wang, Yida Xu, Lan Yang, Chong …


Published on 14 hours ago

No image available

Scaling Agent Learning via Experience Synthesis


Episode 1355


🤗 Upvotes: 51 | cs.AI

Authors:
Zhaorun Chen, Zhuokai Zhao, Kai Zhang, Bo Liu, Qi Qi, Yifan Wu, Tarun Kalluri, Sara Cao, Yuanhao Xiong, Haibo Tong, Huaxiu Yao, Hen…


Published on 14 hours ago

No image available

Diffusion Language Models are Super Data Learners


Episode 1354


🤗 Upvotes: 67 | cs.LG

Authors:
Jinjie Ni, Qian Liu, Longxu Dou, Chao Du, Zili Wang, Hang Yan, Tianyu Pang, Michael Qizhe Shieh

Title:
Diff…


Published on 1 day, 14 hours ago

No image available

LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation


Episode 1353


🤗 Upvotes: 39 | cs.CL

Authors:
Gyeom Hwangbo, Hyungjoo Chae, Minseok Kang, Hyeonjong Ju, Soohyun Oh, Jinyoung Yeo

Title:
LEGO-Eval: Toward…


Published on 1 day, 14 hours ago

No image available

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions


Episode 1352


🤗 Upvotes: 39 | cs.CV

Authors:
Guozhen Zhang, Zixiang Zhou, Teng Hu, Ziqiao Peng, Youliang Zhang, Yi Chen, Yuan Zhou, Qinglin Lu, Limin Wang

Title:
…


Published on 1 day, 14 hours ago

No image available

Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization


Episode 1351


🤗 Upvotes: 71 | cs.LG, cs.AI, cs.RO

Authors:
Nikita Kachaev, Mikhail Kolosov, Daniil Zelezetsky, Alexey K. Kovalev, Aleksandr I. Panov

Title:
…


Published on 2 days, 14 hours ago

No image available

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation


Episode 1350


🤗 Upvotes: 65 | cs.CV, cs.CL

Authors:
Kevin Qinghong Lin, Yuhao Zheng, Hangyu Ran, Dantong Zhu, Dongxing Mao, Linjie Li, Philip Torr, Alex Jinpeng Wang

…


Published on 2 days, 14 hours ago

No image available

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought


Episode 1349


🤗 Upvotes: 42 | cs.CV

Authors:
Yiyang Zhou, Haoqin Tu, Zijun Wang, Zeyu Wang, Niklas Muennighoff, Fan Nie, Yejin Choi, James Zou, Chaorui Deng, Shen Yan, Haoqi Fa…


Published on 2 days, 14 hours ago

No image available

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation


Episode 1348


🤗 Upvotes: 61 | cs.CL, cs.AI

Authors:
Ling-Team, Ang Li, Ben Liu, Binbin Hu, Bing Li, Bingwei Zeng, Borui Ye, Caizhi Tang, Changxin Tian, Chao Huang, Chao Zhang, …


Published on 3 days, 13 hours ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate