Podcast Episodes
Back to SearchStealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions
Episode 1225
🤗 Upvotes: 46 | cs.CV
Authors:
Bo-Hsu Ke, You-Zhe Xie, Yu-Lun Liu, Wei-Chen Chiu
Title:
…
5Â months, 2Â weeks ago
Interactive Training: Feedback-Driven Neural Network Optimization
Episode 1224
🤗 Upvotes: 33 | cs.LG, cs.AI, cs.CL
Authors:
Wentao Zhang, Yang Young Lu, Yuntian Deng
Title:
…
5Â months, 2Â weeks ago
ModernVBERT: Towards Smaller Visual Document Retrievers
Episode 1223
🤗 Upvotes: 24 | cs.IR
Authors:
Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Co…
5Â months, 2Â weeks ago
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Episode 1222
🤗 Upvotes: 24 | cs.LG, cs.CL
Authors:
Yanxu Chen, Zijun Yao, Yantao Liu, Jin Ye, Jianing Yu, Lei Hou, Juanzi Li
5Â months, 2Â weeks ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Episode 1221
🤗 Upvotes: 100 | cs.AI, cs.CL
Authors:
Fang Wu, Weihao Xuan, Heli Qi, Ximing Lu, Aaron Tu, Li Erran Li, Yejin Ch…
5Â months, 2Â weeks ago
GEM: A Gym for Agentic LLMs
Episode 1220
🤗 Upvotes: 53 | cs.LG, cs.AI, cs.CL
Authors:
Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Simon Yu, Xiangxin …
5Â months, 2Â weeks ago
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Episode 1219
🤗 Upvotes: 52 | cs.RO, cs.CV
Authors:
Hengtao Li, Pengxiang Ding, Runze Suo, Yihao Wang, Zirui Ge, Dongyuan Zang…
5Â months, 2Â weeks ago
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Episode 1218
🤗 Upvotes: 32 | cs.LG, cs.AI, cs.CL
Authors:
Ziniu Li, Congliang Chen, Tianyun Yang, Tian Ding, Ruoyu Sun, Ge Zh…
5Â months, 2Â weeks ago
PIPer: On-Device Environment Setup via Online Reinforcement Learning
Episode 1217
🤗 Upvotes: 26 | cs.SE, cs.AI, cs.LG
Authors:
Alexander Kovrigin, Aleksandra Eliseeva, Konstantin Grotov, Egor Bo…
5Â months, 2Â weeks ago
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Episode 1216
🤗 Upvotes: 25 | cs.LG
Authors:
Lorenz K. Müller, Philippe Bich, Jiawei Zhuang, Ahmet Çelik, Luca Benfenati, Luka…
5Â months, 2Â weeks ago