Can Retrieval Heads See Images? Multimodal Retrieval Heads in Long-Context Vision-Language Models 文章

ArXiv CS.CV2026-05-27NEWSen作者: Aaron Branson Cigres Li, Zhaowei Wang, Yu Zhao, Yiming Du, Haobo Li, Xiyu Ren, Ginny Wong, Simon See, Lishu Luo, Haodong Duan, Pasquale Minervini, Yangqiu Song

Can Retrieval Heads See Images? Multimodal Retrieval Heads in Long-Context Vision-Language Models · 相关人物

暂无数据