Podcast Episodes

Back to Search
A Controllable Examination for Long-Context Language Models

Episode 873

🤗 Upvotes: 30 | cs.CL

Authors:
Yijun Yang, Zeyu Huang, Wenhao Zhu, Zihan Qiu, Fei Yuan, Jeff Z. Pan, Ivan Titov

…

11 months ago

Short Long
View Episode
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Episode 872

🤗 Upvotes: 25 | cs.CV, cs.CL

Authors:
Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei …

11 months ago

Short Long
View Episode
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Episode 871

🤗 Upvotes: 23 | cs.CL

Authors:
Kejian Zhu, Shangqing Tu, Zhuoran Jin, Lei Hou, Juanzi Li, Jun Zhao

T…

11 months ago

Short Long
View Episode
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Episode 870

🤗 Upvotes: 23 | cs.CL

Authors:
Yuhao Wu, Yushi Bai, Zhiqiang Hu, Juanzi Li, Roy Ka-Wei Lee

Title:
…

11 months ago

Short Long
View Episode
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Episode 869

🤗 Upvotes: 144 | cs.CL

Authors:
Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble, Dmyt…

11 months ago

Short Long
View Episode
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Episode 868

🤗 Upvotes: 51 | cs.AI

Authors:
Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yi Wu, Chao Yu, Yu …

11 months ago

Short Long
View Episode
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Episode 867

🤗 Upvotes: 49 | cs.CV, cs.AI, cs.CL

Authors:
Bin Lin, Zongjian Li, Xinhua Cheng, Yuwei Niu, Yang Ye, Xianyi He, …

11 months ago

Short Long
View Episode
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Episode 866

🤗 Upvotes: 46 | cs.LG, cs.CL, cs.CV

Authors:
Zijian Wu, Jinjie Ni, Xiangyan Liu, Zichen Liu, Hang Yan, Michael Q…

11 months ago

Short Long
View Episode
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Episode 865

🤗 Upvotes: 43 | cs.CV, cs.AI

Authors:
Ai Jian, Weijie Qiu, Xiaokun Wang, Peiyu Wang, Yunzhuo Hao, Jiangbo Pei, Y…

11 months ago

Short Long
View Episode
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Episode 864

🤗 Upvotes: 29 | cs.CL, cs.AI, cs.CV

Authors:
Qianhui Wu, Kanzhi Cheng, Rui Yang, Chaoyun Zhang, Jianwei Yang, Hu…

11 months ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us