Podcast Episodes
Back to SearchInternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Episode 1275
🤗 Upvotes: 25 | cs.CV
Authors:
Haomin Wang, Jinhui Yin, Qi Wei, Wenguang Zeng, Lixin Gu, Shenglong Ye, Zhangwei …
5Â months ago
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
Episode 1274
🤗 Upvotes: 25 | cs.CL, cs.AI
Authors:
Tao Yu, Zhengbo Zhang, Zhiheng Lyu, Junhao Gong, Hongzhu Yi, Xinming Wang,…
5Â months ago
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Episode 1273
🤗 Upvotes: 104 | cs.AI, cs.CV, cs.RO
Authors:
Suwhan Choi, Jaeyoon Jung, Haebin Seong, Minchan Kim, Minyeong Kim…
5Â months ago
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
Episode 1272
🤗 Upvotes: 86 | cs.CV
Authors:
Kang Liao, Size Wu, Zhonghua Wu, Linyi Jin, Chao Wang, Yikai Wang, Fei Wang, Wei …
5Â months ago
TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling
Episode 1271
🤗 Upvotes: 38 | cs.CV
Authors:
Hyunmin Cho, Donghoon Ahn, Susung Hong, Jee Eun Kim, Seungryong Kim, Kyong Hwan J…
5Â months ago
AutoPR: Let's Automate Your Academic Promotion!
Episode 1270
🤗 Upvotes: 38 | cs.CL
Authors:
Qiguang Chen, Zheng Yan, Mingda Yang, Libo Qin, Yixin Yuan, Hanjing Li, Jinhao Li…
5Â months ago
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
Episode 1269
🤗 Upvotes: 37 | cs.LG, cs.AI, cs.CL
Authors:
Yumin Choi, Dongki Kim, Jinheon Baek, Sung Ju Hwang
Tit…
5Â months ago
BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities
Episode 1268
🤗 Upvotes: 30 | cs.CV, cs.RO
Authors:
Yu Qi, Haibo Zhao, Ziyu Guo, Siyuan Ma, Ziyan Chen, Yaokun Han, Renrui Zha…
5Â months ago
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Episode 1267
🤗 Upvotes: 26 | cs.CV, cs.AI, cs.CL
Authors:
Ruyi Xu, Guangxuan Xiao, Yukang Chen, Liuning He, Kelly Peng, Yao L…
5Â months ago
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
Episode 1266
🤗 Upvotes: 22 | cs.CL, cs.AI
Authors:
Zhepeng Cen, Haolin Chen, Shiyu Wang, Zuxin Liu, Zhiwei Liu, Ding Zhao, Si…
5Â months ago