Podcast Episodes

Back to Search

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Episode 1025

🤗 Upvotes: 28 | cs.CV, cs.AI, cs.MM

Authors:
Miaosen Zhang, Ziqiang Xu, Jialiang Zhu, Qi Dai, Kai Qiu, Yifan Yan…

11 months ago

Short Long

View Episode

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Episode 1024

🤗 Upvotes: 62 | cs.CV

Authors:
Yilei Jiang, Yaozhi Zheng, Yuxuan Wan, Jiaming Han, Qunzhong Wang, Michael R. Lyu…

11 months ago

Short Long

View Episode

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Episode 1023

🤗 Upvotes: 46 | cs.GR

Authors:
Longwen Zhang, Qixuan Zhang, Haoran Jiang, Yinuo Bai, Wei Yang, Lan Xu, Jingyi Yu…

11 months ago

Short Long

View Episode

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Episode 1022

🤗 Upvotes: 30 | cs.CV, cs.AI, cs.CL

Authors:
Ruifeng Yuan, Chenghao Xiao, Sicong Leng, Jianyu Wang, Long Li, Wei…

11 months ago

Short Long

View Episode

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Episode 1021

🤗 Upvotes: 67 | cs.CV

Authors:
HunyuanWorld Team, Zhenwei Wang, Yuhao Liu, Junta Wu, Zixiao Gu, Haoyuan Wang, Xu…

11 months ago

Short Long

View Episode

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Episode 1020

🤗 Upvotes: 24 | cs.CV

Authors:
Zigang Geng, Yibing Wang, Yeyao Ma, Chen Li, Yongming Rao, Shuyang Gu, Zhao Zhong…

11 months ago

Short Long

View Episode

ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge

Episode 1019

🤗 Upvotes: 21 | cs.CE, cs.AI

Authors:
Zihan Zhao, Bo Chen, Ziping Wan, Lu Chen, Xuanze Lin, Shiyang Yu, Situo Zh…

11 months ago

Short Long

View Episode

Agentic Reinforced Policy Optimization

Episode 1018

🤗 Upvotes: 84 | cs.LG, cs.AI, cs.CL

Authors:
Guanting Dong, Hangyu Mao, Kai Ma, Licheng Bao, Yifei Chen, Zhongyu…

11 months ago

Short Long

View Episode

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Episode 1017

🤗 Upvotes: 50 | cs.CV

Authors:
Yuying Ge, Yixiao Ge, Chen Li, Teng Wang, Junfu Pu, Yizhuo Li, Lu Qiu, Jin Ma, Li…

11 months ago

Short Long

View Episode

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Episode 1016

🤗 Upvotes: 40 | cs.AI

Authors:
Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Sh…

11 months ago

Short Long

View Episode

Podcast Episodes

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

BANG: Dividing 3D Assets via Generative Exploded Dynamics

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge

Agentic Reinforced Policy Optimization

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Love PodBriefly?