Podcast Episodes
Back to Search$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning
Episode 985
🤗 Upvotes: 36 | cs.CV
Authors:
Yifan Wang, Jianjun Zhou, Haoyi Zhu, Wenzheng Chang, Yang Zhou, Zizun Li, Junyi C…
8 months ago
The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner
Episode 984
🤗 Upvotes: 33 | cs.CL
Authors:
Zhouqi Hua, Wenwei Zhang, Chengqi Lyu, Yuzhe Gu, Songyang Gao, Kuikun Liu, Kai Ch…
8 months ago
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning
Episode 983
🤗 Upvotes: 30 | cs.CV
Authors:
Yiming Ren, Zhiqiang Lin, Yu Li, Gao Meng, Weiyun Wang, Junjie Wang, Zicheng Lin,…
8 months ago
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
Episode 982
🤗 Upvotes: 29 | cs.CV
Authors:
Yudong Jin, Sida Peng, Xuan Wang, Tao Xie, Zhen Xu, Yifan Yang, Yujun Shen, Hujun…
8 months ago
RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization
Episode 981
🤗 Upvotes: 23 | cs.LG, cs.CL, cs.NA, math.DG, math.NA, 68T07, 65F55, 53Z50
Authors:
Vladimir Bogachev, Vladimir …
8 months ago
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Episode 980
🤗 Upvotes: 50 | cs.CL, cs.AI
Authors:
Yangning Li, Weizhi Zhang, Yuyao Yang, Wei-Chieh Huang, Yaozu Wu, Junyu Lu…
8 months ago
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models
Episode 979
🤗 Upvotes: 32 | cs.CV
Authors:
Tiezheng Zhang, Yitong Li, Yu-cheng Chou, Jieneng Chen, Alan Yuille, Chen Wei, Ju…
8 months ago
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes
Episode 978
🤗 Upvotes: 24 | cs.CL, cs.AI
Authors:
LG AI Research, :, Kyunghoon Bae, Eunbi Choi, Kibong Choi, Stanley Jungkyu…
8 months ago
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
Episode 977
🤗 Upvotes: 44 | cs.LG, cs.AI, cs.CL
Authors:
Mingqi Wu, Zhihao Zhang, Qiaole Dong, Zhiheng Xi, Jun Zhao, Senjie …
8 months ago
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation
Episode 976
🤗 Upvotes: 43 | cs.CV, eess.AS
Authors:
Youliang Zhang, Zhaoyang Li, Duomin Wang, Jiahe Zhang, Deyu Zhou, Zixin …
8 months ago