Podcast Episodes
Back to SearchMiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
Episode 1172
🤗 Upvotes: 32 | cs.LG, cs.CV
Authors:
Tianyu Yu, Zefan Wang, Chongyi Wang, Fuwei Huang, Wenshuo Ma, Zhihui He, T…
7Â months, 1Â week ago
LIMI: Less is More for Agency
Episode 1171
🤗 Upvotes: 69 | cs.AI
Authors:
Yang Xiao, Mohan Jiang, Jie Sun, Keyu Li, Jifan Lin, Yumin Zhuang, Ji Zeng, Shiji…
7Â months, 2Â weeks ago
Qwen3-Omni Technical Report
Episode 1170
🤗 Upvotes: 56 | cs.CL, cs.AI, cs.CV, eess.AS
Authors:
Jin Xu, Zhifang Guo, Hangrui Hu, Yunfei Chu, Xiong Wang, J…
7Â months, 2Â weeks ago
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
Episode 1169
🤗 Upvotes: 49 | cs.CV
Authors:
Jinshu Chen, Xinghui Li, Xu Bai, Tianxiang Ma, Pengze Zhang, Zhuowei Chen, Gen Li…
7Â months, 2Â weeks ago
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
Episode 1168
🤗 Upvotes: 27 | cs.IR, cs.AI, cs.CL
Authors:
Sunhao Dai, Jiakai Tang, Jiahua Wu, Kun Wang, Yuxuan Zhu, Bingjun C…
7Â months, 2Â weeks ago
TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs
Episode 1167
🤗 Upvotes: 26 | cs.CV
Authors:
Yunheng Li, Jing Cheng, Shaoyong Jia, Hangyi Kuang, Shaohui Jiao, Qibin Hou, Ming…
7Â months, 2Â weeks ago
RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
Episode 1166
🤗 Upvotes: 89 | cs.CL, cs.AI, cs.SE
Authors:
Jane Luo, Xin Zhang, Steven Liu, Jie Wu, Yiming Huang, Yangyu Huang…
7Â months, 2Â weeks ago
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Episode 1165
🤗 Upvotes: 37 | cs.CV, cs.CL, cs.LG
Authors:
Yanghao Li, Rui Qian, Bowen Pan, Haotian Zhang, Haoshuo Huang, Bowe…
7Â months, 2Â weeks ago
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification
Episode 1164
🤗 Upvotes: 28 | cs.LG, cs.AI, cs.CV, stat.ML
Authors:
Zinan Lin, Enshu Liu, Xuefei Ning, Junyi Zhu, Wenyu Wang, …
7Â months, 2Â weeks ago
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Episode 1163
🤗 Upvotes: 81 | cs.CV
Authors:
Zhaoyang Liu, JingJing Xie, Zichen Ding, Zehao Li, Bowen Yang, Zhenyu Wu, Xuehui …
7Â months, 2Â weeks ago