Podcast Episodes

Back to Search

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Episode 1096

🤗 Upvotes: 120 | cs.CV

Authors:
Weiyun Wang, Zhangwei Gao, Lixin Gu, Hengjun Pu, Long Cui, Xingguang Wei, Zhaoya…

10 months ago

Short Long

View Episode

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Episode 1095

🤗 Upvotes: 34 | cs.CV

Authors:
Yaqi Li, Peng Chen, Mingyang Han, Pi Bu, Haoxiang Shi, Runzhou Zhao, Yang Yao, Xu…

10 months ago

Short Long

View Episode

MV-RAG: Retrieval Augmented Multiview Diffusion

Episode 1094

🤗 Upvotes: 31 | cs.CV, cs.AI

Authors:
Yosef Dayani, Omer Benishu, Sagie Benaim

Title:
MV…

10 months ago

Short Long

View Episode

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Episode 1093

🤗 Upvotes: 58 | cs.LG, cs.CL

Authors:
Huichi Zhou, Yihang Chen, Siyuan Guo, Xue Yan, Kin Hei Lee, Zihan Wang, Ka…

10 months, 1 week ago

Short Long

View Episode

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Episode 1092

🤗 Upvotes: 41 | cs.CL

Authors:
Xiao Liang, Zhongzhi Li, Yeyun Gong, Yelong Shen, Ying Nian Wu, Zhijiang Guo, Wei…

10 months, 1 week ago

Short Long

View Episode

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Episode 1091

🤗 Upvotes: 34 | cs.RO, cs.CV

Authors:
Kaijun Wang, Liqin Lu, Mingyu Liu, Jianuo Jiang, Zeju Li, Bolin Zhang, Wan…

10 months, 1 week ago

Short Long

View Episode

Intern-S1: A Scientific Multimodal Foundation Model

Episode 1090

🤗 Upvotes: 166 | cs.LG, cs.CL, cs.CV

Authors:
Lei Bai, Zhongrui Cai, Maosong Cao, Weihan Cao, Chiyu Chen, Haojio…

10 months, 1 week ago

Short Long

View Episode

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Episode 1089

🤗 Upvotes: 40 | cs.AI

Authors:
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zhe…

10 months, 1 week ago

Short Long

View Episode

Deep Think with Confidence

Episode 1088

🤗 Upvotes: 26 | cs.LG

Authors:
Yichao Fu, Xuewei Wang, Yuandong Tian, Jiawei Zhao

Title:
…

10 months, 1 week ago

Short Long

View Episode

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Episode 1087

🤗 Upvotes: 26 | cs.CL, cs.AI

Authors:
Ming Yin, Dinghan Shen, Silei Xu, Jianbing Han, Sixun Dong, Mian Zhang, Ye…

10 months, 1 week ago

Short Long

View Episode

Podcast Episodes

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

MV-RAG: Retrieval Augmented Multiview Diffusion

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Intern-S1: A Scientific Multimodal Foundation Model

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Deep Think with Confidence

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Love PodBriefly?