CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large VIsion-Language Models 文章

ArXiv CS.CV2026-06-02NEWSen作者: Sangin Lee, Yukyung Choi

CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large VIsion-Language Models · 相关技术

相关技术