Podcast Episodes
Back to SearchAmbiK: Dataset of Ambiguous Tasks in Kitchen Environment
Episode 875
🤗 Upvotes: 39 | cs.LG, cs.AI, cs.CL, cs.RO
Authors:
Anastasiia Ivanova, Eva Bakaeva, Zoya Volovikova, Alexey K. …
9Â months, 2Â weeks ago
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
Episode 874
🤗 Upvotes: 35 | cs.AR, cs.AI, cs.CL, cs.LG, cs.PL
Authors:
Ahmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seu…
9Â months, 2Â weeks ago
A Controllable Examination for Long-Context Language Models
Episode 873
🤗 Upvotes: 30 | cs.CL
Authors:
Yijun Yang, Zeyu Huang, Wenhao Zhu, Zihan Qiu, Fei Yuan, Jeff Z. Pan, Ivan Titov
9Â months, 2Â weeks ago
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
Episode 872
🤗 Upvotes: 25 | cs.CV, cs.CL
Authors:
Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei …
9Â months, 2Â weeks ago
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Episode 871
🤗 Upvotes: 23 | cs.CL
Authors:
Kejian Zhu, Shangqing Tu, Zhuoran Jin, Lei Hou, Juanzi Li, Jun Zhao
T…
9Â months, 2Â weeks ago
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Episode 870
🤗 Upvotes: 23 | cs.CL
Authors:
Yuhao Wu, Yushi Bai, Zhiqiang Hu, Juanzi Li, Roy Ka-Wei Lee
Title:
…
9Â months, 2Â weeks ago
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Episode 869
🤗 Upvotes: 144 | cs.CL
Authors:
Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble, Dmyt…
9Â months, 2Â weeks ago
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
Episode 868
🤗 Upvotes: 51 | cs.AI
Authors:
Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yi Wu, Chao Yu, Yu …
9Â months, 2Â weeks ago
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Episode 867
🤗 Upvotes: 49 | cs.CV, cs.AI, cs.CL
Authors:
Bin Lin, Zongjian Li, Xinhua Cheng, Yuwei Niu, Yang Ye, Xianyi He, …
9Â months, 2Â weeks ago
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Episode 866
🤗 Upvotes: 46 | cs.LG, cs.CL, cs.CV
Authors:
Zijian Wu, Jinjie Ni, Xiangyan Liu, Zichen Liu, Hang Yan, Michael Q…
9Â months, 2Â weeks ago