Podcast Episodes
Back to SearchA Controllable Examination for Long-Context Language Models
Episode 873
🤗 Upvotes: 30 | cs.CL
Authors:
Yijun Yang, Zeyu Huang, Wenhao Zhu, Zihan Qiu, Fei Yuan, Jeff Z. Pan, Ivan Titov
11Â months ago
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
Episode 872
🤗 Upvotes: 25 | cs.CV, cs.CL
Authors:
Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei …
11Â months ago
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Episode 871
🤗 Upvotes: 23 | cs.CL
Authors:
Kejian Zhu, Shangqing Tu, Zhuoran Jin, Lei Hou, Juanzi Li, Jun Zhao
T…
11Â months ago
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Episode 870
🤗 Upvotes: 23 | cs.CL
Authors:
Yuhao Wu, Yushi Bai, Zhiqiang Hu, Juanzi Li, Roy Ka-Wei Lee
Title:
…
11Â months ago
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Episode 869
🤗 Upvotes: 144 | cs.CL
Authors:
Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble, Dmyt…
11Â months ago
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
Episode 868
🤗 Upvotes: 51 | cs.AI
Authors:
Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yi Wu, Chao Yu, Yu …
11Â months ago
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Episode 867
🤗 Upvotes: 49 | cs.CV, cs.AI, cs.CL
Authors:
Bin Lin, Zongjian Li, Xinhua Cheng, Yuwei Niu, Yang Ye, Xianyi He, …
11Â months ago
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Episode 866
🤗 Upvotes: 46 | cs.LG, cs.CL, cs.CV
Authors:
Zijian Wu, Jinjie Ni, Xiangyan Liu, Zichen Liu, Hang Yan, Michael Q…
11Â months ago
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs
Episode 865
🤗 Upvotes: 43 | cs.CV, cs.AI
Authors:
Ai Jian, Weijie Qiu, Xiaokun Wang, Peiyu Wang, Yunzhuo Hao, Jiangbo Pei, Y…
11Â months ago
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Episode 864
🤗 Upvotes: 29 | cs.CL, cs.AI, cs.CV
Authors:
Qianhui Wu, Kanzhi Cheng, Rui Yang, Chaoyun Zhang, Jianwei Yang, Hu…
11Â months ago