Podcast Episodes
Back to SearchOptimal Scaling Needs Optimal Norm
Episode 1232
🤗 Upvotes: 22 | cs.LG, cs.AI, stat.ML
Authors:
Oleg Filatov, Jiangtao Wang, Jan Ebert, Stefan Kesselheim
7Â months ago
Apriel-1.5-15b-Thinker
Episode 1231
🤗 Upvotes: 78 | cs.AI
Authors:
Shruthan Radhakrishna, Aman Tiwari, Aanjaneya Shukla, Masoud Hashemi, Rishabh Mah…
7Â months ago
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Episode 1230
🤗 Upvotes: 34 | cs.LG
Authors:
ShengYun Peng, Eric Smith, Ivan Evtimov, Song Jiang, Pin-Yu Chen, Hongyuan Zhan, …
7Â months ago
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Episode 1229
🤗 Upvotes: 30 | cs.CV
Authors:
Zichen Wen, Shaobo Wang, Yufa Zhou, Junyuan Zhang, Qintong Zhang, Yifeng Gao, Zha…
7Â months ago
LongCodeZip: Compress Long Context for Code Language Models
Episode 1228
🤗 Upvotes: 70 | cs.CL, cs.SE
Authors:
Yuling Shi, Yichun Qian, Hongyu Zhang, Beijun Shen, Xiaodong Gu
7Â months ago
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Episode 1227
🤗 Upvotes: 61 | cs.CV, cs.AI
Authors:
Justin Cui, Jie Wu, Ming Li, Tao Yang, Xiaojie Li, Rui Wang, Andrew Bai, Y…
7Â months ago
ExGRPO: Learning to Reason from Experience
Episode 1226
🤗 Upvotes: 50 | cs.LG, cs.AI, cs.CL
Authors:
Runzhe Zhan, Yafu Li, Zhi Wang, Xiaoye Qu, Dongrui Liu, Jing Shao, …
7Â months ago
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions
Episode 1225
🤗 Upvotes: 46 | cs.CV
Authors:
Bo-Hsu Ke, You-Zhe Xie, Yu-Lun Liu, Wei-Chen Chiu
Title:
…
7Â months ago
Interactive Training: Feedback-Driven Neural Network Optimization
Episode 1224
🤗 Upvotes: 33 | cs.LG, cs.AI, cs.CL
Authors:
Wentao Zhang, Yang Young Lu, Yuntian Deng
Title:
…
7Â months ago
ModernVBERT: Towards Smaller Visual Document Retrievers
Episode 1223
🤗 Upvotes: 24 | cs.IR
Authors:
Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Co…
7Â months ago