Podcast Episodes

Back to Search

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Episode 1372

🤗 Upvotes: 31 | cs.CV, cs.RO

Authors:
Ziang Cao, Fangzhou Hong, Zhaoxi Chen, Liang Pan, Ziwei Liu

Ti…

7 months, 2 weeks ago

Short Long

View Episode

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

Episode 1371

🤗 Upvotes: 30 | cs.AI

Authors:
Jingxuan Wei, Caijun Jia, Xi Bai, Xinglong Xu, Siyuan Li, Linzhuang Sun, Bihui Yu…

7 months, 2 weeks ago

Short Long

View Episode

DoPE: Denoising Rotary Position Embedding

Episode 1370

🤗 Upvotes: 64 | cs.CL

Authors:
Jing Xiong, Liyang Fan, Hui Shen, Zunhai Su, Min Yang, Lingpeng Kong, Ngai Wong

…

7 months, 2 weeks ago

Short Long

View Episode

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Episode 1369

🤗 Upvotes: 41 | cs.CV

Authors:
Wei Chow, Jiachun Pan, Yongyuan Liang, Mingze Zhou, Xue Song, Liyu Jia, Saining Z…

7 months, 2 weeks ago

Short Long

View Episode

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Episode 1368

🤗 Upvotes: 28 | cs.CV

Authors:
Zhen Yang, Wenyi Hong, Mingde Xu, Xinyue Fan, Weihan Wang, Jiele Cheng, Xiaotao G…

7 months, 2 weeks ago

Short Long

View Episode

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Episode 1367

🤗 Upvotes: 24 | cs.AI, cs.CE, cs.LG

Authors:
Yuqi Yin, Yibo Fu, Siyuan Wang, Peng Sun, Hongyu Wang, Xiaohui Wang…

7 months, 2 weeks ago

Short Long

View Episode

LiteAttention: A Temporal Sparse Attention for Diffusion Transformers

Episode 1366

🤗 Upvotes: 23 | cs.CV, cs.AI

Authors:
Dor Shmilovich, Tony Wu, Aviad Dahan, Yuval Domb

Title:
…

7 months, 2 weeks ago

Short Long

View Episode

Virtual Width Networks

Episode 1365

🤗 Upvotes: 23 | cs.LG, cs.AI

Authors:
Seed, Baisheng Li, Banggu Wu, Bole Ma, Bowen Xiao, Chaoyi Zhang, Cheng Li,…

7 months, 2 weeks ago

Short Long

View Episode

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Episode 1364

🤗 Upvotes: 55 | cs.CV

Authors:
Aleksandr Razin, Danil Kazantsev, Ilya Makarov

Title:
One…

7 months, 2 weeks ago

Short Long

View Episode

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Episode 1363

🤗 Upvotes: 32 | cs.CV, cs.AI, cs.CL, cs.LG

Authors:
PAN Team, Jiannan Xiang, Yi Gu, Zihan Liu, Zeyu Feng, Qiyue …

7 months, 2 weeks ago

Short Long

View Episode

Podcast Episodes

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

DoPE: Denoising Rotary Position Embedding

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

LiteAttention: A Temporal Sparse Attention for Diffusion Transformers

Virtual Width Networks

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Love PodBriefly?