Podcast Episodes

Back to Search
No image available

DarwinLM: Evolutionary Structured Pruning of Large Language Models


Episode 557


🤗 Upvotes: 9 | cs.LG, cs.CL

Authors:
Shengkun Tang, Oliver Sieberling, Eldar Kurtic, Zhiqiang Shen, Dan Alistarh

Title:
DarwinLM: Evolutio…


Published on 10 months, 1 week ago

No image available

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU


Episode 556


🤗 Upvotes: 62 | cs.CL, cs.LG

Authors:
Heejun Lee, Geon Park, Jaduk Suh, Sung Ju Hwang

Title:
InfiniteHiP: Extending Language Model Context…


Published on 10 months, 1 week ago

No image available

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding


Episode 555


🤗 Upvotes: 35 | cs.CL, cs.AI, cs.CV, cs.LG

Authors:
Mo Yu, Lemao Liu, Junjie Wu, Tsz Ting Chung, Shunchi Zhang, Jiangnan Li, Dit-Yan Yeung, Jie Zhou

T…


Published on 10 months, 1 week ago

No image available

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation


Episode 554


🤗 Upvotes: 28 | cs.LG, cs.AI, cs.CV

Authors:
Hoigi Seo, Wongi Jeong, Jae-sun Seo, Se Young Chun

Title:
Skrr: Skip and Re-use Text Encoder …


Published on 10 months, 1 week ago

No image available

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models


Episode 553


🤗 Upvotes: 22 | cs.CL, cs.AI, cs.LG

Authors:
Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James Glass, Shang-…


Published on 10 months, 1 week ago

No image available

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights


Episode 552


🤗 Upvotes: 21 | cs.LG, cs.CV

Authors:
Jonathan Kahana, Or Nathan, Eliahu Horwitz, Yedid Hoshen

Title:
Can this Model Also Recognize Dogs? …


Published on 10 months, 1 week ago

No image available

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging


Episode 551


🤗 Upvotes: 21 | cs.CL, cs.AI

Authors:
Kunat Pipatanakul, Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai

Title:
An Open…


Published on 10 months, 1 week ago

No image available

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents


Episode 550


🤗 Upvotes: 20 | cs.AI, cs.CL, cs.CV

Authors:
Rui Yang, Hanyang Chen, Junyu Zhang, Mark Zhao, Cheng Qian, Kangrui Wang, Qineng Wang, Teja Venkat Koripella, Marziye…


Published on 10 months, 1 week ago

No image available

Exploring the Potential of Encoder-free Architectures in 3D LMMs


Episode 549


🤗 Upvotes: 17 | cs.CV, cs.AI, cs.CL

Authors:
Yiwen Tang, Zoey Guo, Zhuhao Wang, Ray Zhang, Qizhi Chen, Junli Liu, Delin Qu, Zhigang Wang, Dong Wang, Xuelong Li, B…


Published on 10 months, 1 week ago

No image available

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles


Episode 548


🤗 Upvotes: 16 | cs.CL, cs.AI

Authors:
Xintao Wang, Heng Wang, Yifei Zhang, Xinfeng Yuan, Rui Xu, Jen-tse Huang, Siyu Yuan, Haoran Guo, Jiangjie Chen, Wei Wang, Ya…


Published on 10 months, 1 week ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate