Research Paper ML Hub

arXiv.org / 2024

[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster

Qizhe Zhang, Aosong Cheng, Ming Lu, Zhiyong Zhuo, Minqi Wang, Jiajun Cao, Shaobo Guo, Qi She, Shanghang Zhang

ML SystemsPopular and Landmark Papers

No abstract is available for this paper yet.

85 citations13 influential

Full paper

Read the original paper

A direct open-access PDF is not available in the database yet. Use the source page or learning resources below to open the complete paper from the publisher or index.