Podcast Episodes
Back to SearchWEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Episode 1369
🤗 Upvotes: 41 | cs.CV
Authors:
Wei Chow, Jiachun Pan, Yongyuan Liang, Mingze Zhou, Xue Song, Liyu Jia, Saining Z…
4Â months ago
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Episode 1368
🤗 Upvotes: 28 | cs.CV
Authors:
Zhen Yang, Wenyi Hong, Mingde Xu, Xinyue Fan, Weihan Wang, Jiele Cheng, Xiaotao G…
4Â months ago
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
Episode 1367
🤗 Upvotes: 24 | cs.AI, cs.CE, cs.LG
Authors:
Yuqi Yin, Yibo Fu, Siyuan Wang, Peng Sun, Hongyu Wang, Xiaohui Wang…
4Â months ago
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers
Episode 1366
🤗 Upvotes: 23 | cs.CV, cs.AI
Authors:
Dor Shmilovich, Tony Wu, Aviad Dahan, Yuval Domb
Title:
…
4Â months ago
Virtual Width Networks
Episode 1365
🤗 Upvotes: 23 | cs.LG, cs.AI
Authors:
Seed, Baisheng Li, Banggu Wu, Bole Ma, Bowen Xiao, Chaoyi Zhang, Cheng Li,…
4Â months ago
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
Episode 1364
🤗 Upvotes: 55 | cs.CV
Authors:
Aleksandr Razin, Danil Kazantsev, Ilya Makarov
Title:
One…
4Â months ago
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Episode 1363
🤗 Upvotes: 32 | cs.CV, cs.AI, cs.CL, cs.LG
Authors:
PAN Team, Jiannan Xiang, Yi Gu, Zihan Liu, Zeyu Feng, Qiyue …
4Â months ago
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
Episode 1362
🤗 Upvotes: 26 | cs.CV
Authors:
Zhengyang Liang, Daoan Zhang, Huichi Zhou, Rui Huang, Bobo Li, Yuechen Zhang, She…
4Â months ago
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
Episode 1361
🤗 Upvotes: 34 | cs.CL, cs.AI
Authors:
Zihao Yi, Qingxuan Jiang, Ruotian Ma, Xingyu Chen, Qu Yang, Mengru Wang, F…
4Â months, 1Â week ago
DeepEyesV2: Toward Agentic Multimodal Model
Episode 1360
🤗 Upvotes: 31 | cs.CV, cs.AI
Authors:
Jack Hong, Chenxiao Zhao, ChengLin Zhu, Weiheng Lu, Guohai Xu, Xing Yu
4Â months, 1Â week ago