Podcast Episodes

Back to Search
AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Episode 875

🤗 Upvotes: 39 | cs.LG, cs.AI, cs.CL, cs.RO

Authors:
Anastasiia Ivanova, Eva Bakaeva, Zoya Volovikova, Alexey K. …

9 months, 2 weeks ago

Short Long
View Episode
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Episode 874

🤗 Upvotes: 35 | cs.AR, cs.AI, cs.CL, cs.LG, cs.PL

Authors:
Ahmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seu…

9 months, 2 weeks ago

Short Long
View Episode
A Controllable Examination for Long-Context Language Models

Episode 873

🤗 Upvotes: 30 | cs.CL

Authors:
Yijun Yang, Zeyu Huang, Wenhao Zhu, Zihan Qiu, Fei Yuan, Jeff Z. Pan, Ivan Titov

…

9 months, 2 weeks ago

Short Long
View Episode
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Episode 872

🤗 Upvotes: 25 | cs.CV, cs.CL

Authors:
Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei …

9 months, 2 weeks ago

Short Long
View Episode
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Episode 871

🤗 Upvotes: 23 | cs.CL

Authors:
Kejian Zhu, Shangqing Tu, Zhuoran Jin, Lei Hou, Juanzi Li, Jun Zhao

T…

9 months, 2 weeks ago

Short Long
View Episode
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Episode 870

🤗 Upvotes: 23 | cs.CL

Authors:
Yuhao Wu, Yushi Bai, Zhiqiang Hu, Juanzi Li, Roy Ka-Wei Lee

Title:
…

9 months, 2 weeks ago

Short Long
View Episode
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Episode 869

🤗 Upvotes: 144 | cs.CL

Authors:
Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble, Dmyt…

9 months, 2 weeks ago

Short Long
View Episode
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Episode 868

🤗 Upvotes: 51 | cs.AI

Authors:
Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yi Wu, Chao Yu, Yu …

9 months, 2 weeks ago

Short Long
View Episode
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Episode 867

🤗 Upvotes: 49 | cs.CV, cs.AI, cs.CL

Authors:
Bin Lin, Zongjian Li, Xinhua Cheng, Yuwei Niu, Yang Ye, Xianyi He, …

9 months, 2 weeks ago

Short Long
View Episode
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Episode 866

🤗 Upvotes: 46 | cs.LG, cs.CL, cs.CV

Authors:
Zijian Wu, Jinjie Ni, Xiangyan Liu, Zichen Liu, Hang Yan, Michael Q…

9 months, 2 weeks ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us