Podcast Episodes

Back to Search
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Episode 1275

🤗 Upvotes: 25 | cs.CV

Authors:
Haomin Wang, Jinhui Yin, Qi Wei, Wenguang Zeng, Lixin Gu, Shenglong Ye, Zhangwei …

5 months ago

Short Long
View Episode
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Episode 1274

🤗 Upvotes: 25 | cs.CL, cs.AI

Authors:
Tao Yu, Zhengbo Zhang, Zhiheng Lyu, Junhao Gong, Hongzhu Yi, Xinming Wang,…

5 months ago

Short Long
View Episode
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Episode 1273

🤗 Upvotes: 104 | cs.AI, cs.CV, cs.RO

Authors:
Suwhan Choi, Jaeyoon Jung, Haebin Seong, Minchan Kim, Minyeong Kim…

5 months ago

Short Long
View Episode
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Episode 1272

🤗 Upvotes: 86 | cs.CV

Authors:
Kang Liao, Size Wu, Zhonghua Wu, Linyi Jin, Chao Wang, Yikai Wang, Fei Wang, Wei …

5 months ago

Short Long
View Episode
TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling

Episode 1271

🤗 Upvotes: 38 | cs.CV

Authors:
Hyunmin Cho, Donghoon Ahn, Susung Hong, Jee Eun Kim, Seungryong Kim, Kyong Hwan J…

5 months ago

Short Long
View Episode
AutoPR: Let's Automate Your Academic Promotion!

Episode 1270

🤗 Upvotes: 38 | cs.CL

Authors:
Qiguang Chen, Zheng Yan, Mingda Yang, Libo Qin, Yixin Yuan, Hanjing Li, Jinhao Li…

5 months ago

Short Long
View Episode
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Episode 1269

🤗 Upvotes: 37 | cs.LG, cs.AI, cs.CL

Authors:
Yumin Choi, Dongki Kim, Jinheon Baek, Sung Ju Hwang

Tit…

5 months ago

Short Long
View Episode
BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Episode 1268

🤗 Upvotes: 30 | cs.CV, cs.RO

Authors:
Yu Qi, Haibo Zhao, Ziyu Guo, Siyuan Ma, Ziyan Chen, Yaokun Han, Renrui Zha…

5 months ago

Short Long
View Episode
StreamingVLM: Real-Time Understanding for Infinite Video Streams

Episode 1267

🤗 Upvotes: 26 | cs.CV, cs.AI, cs.CL

Authors:
Ruyi Xu, Guangxuan Xiao, Yukang Chen, Liuning He, Kelly Peng, Yao L…

5 months ago

Short Long
View Episode
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Episode 1266

🤗 Upvotes: 22 | cs.CL, cs.AI

Authors:
Zhepeng Cen, Haolin Chen, Shiyu Wang, Zuxin Liu, Zhiwei Liu, Ding Zhao, Si…

5 months ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us