Podcast Episodes
Back to SearchMMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation
Episode 1375
🤗 Upvotes: 47 | cs.CV
Authors:
Ye Tian, Ling Yang, Jiongfan Yang, Anran Wang, Yu Tian, Jiani Zheng, Haochen Wang…
5Â months, 2Â weeks ago
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning
Episode 1374
🤗 Upvotes: 46 | cs.IR, cs.AI, cs.LG
Authors:
Duolin Sun, Meixiu Long, Dan Yang, Yihan Jiao, Zhehao Tan, Jie Feng…
5Â months, 2Â weeks ago
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
Episode 1373
🤗 Upvotes: 40 | cs.CV
Authors:
Harold Haodong Chen, Disen Lan, Wen-Jie Shu, Qingyang Liu, Zihan Wang, Sirui Chen…
5Â months, 2Â weeks ago
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image
Episode 1372
🤗 Upvotes: 31 | cs.CV, cs.RO
Authors:
Ziang Cao, Fangzhou Hong, Zhaoxi Chen, Liang Pan, Ziwei Liu
Ti…
5Â months, 2Â weeks ago
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
Episode 1371
🤗 Upvotes: 30 | cs.AI
Authors:
Jingxuan Wei, Caijun Jia, Xi Bai, Xinglong Xu, Siyuan Li, Linzhuang Sun, Bihui Yu…
5Â months, 3Â weeks ago
DoPE: Denoising Rotary Position Embedding
Episode 1370
🤗 Upvotes: 64 | cs.CL
Authors:
Jing Xiong, Liyang Fan, Hui Shen, Zunhai Su, Min Yang, Lingpeng Kong, Ngai Wong
5Â months, 3Â weeks ago
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Episode 1369
🤗 Upvotes: 41 | cs.CV
Authors:
Wei Chow, Jiachun Pan, Yongyuan Liang, Mingze Zhou, Xue Song, Liyu Jia, Saining Z…
5Â months, 3Â weeks ago
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Episode 1368
🤗 Upvotes: 28 | cs.CV
Authors:
Zhen Yang, Wenyi Hong, Mingde Xu, Xinyue Fan, Weihan Wang, Jiele Cheng, Xiaotao G…
5Â months, 3Â weeks ago
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
Episode 1367
🤗 Upvotes: 24 | cs.AI, cs.CE, cs.LG
Authors:
Yuqi Yin, Yibo Fu, Siyuan Wang, Peng Sun, Hongyu Wang, Xiaohui Wang…
5Â months, 3Â weeks ago
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers
Episode 1366
🤗 Upvotes: 23 | cs.CV, cs.AI
Authors:
Dor Shmilovich, Tony Wu, Aviad Dahan, Yuval Domb
Title:
…
5Â months, 3Â weeks ago