Overview of the EReL@MIR 2025 Multimodal Document Retrieval Challenge (Track 1) 文章

ArXiv CS.CV2026-06-04NEWSen作者: Jingbiao Mei

详细信息

来源站点: ArXiv CS.CV
作者: Jingbiao Mei
文章类型: NEWS
语言: en
发布日期: 2026-06-04

摘要

arXiv:2606.04240v1 Announce Type: new Abstract: Retrieval over visually-rich documents, pages that interleave text with figures, tables, and charts, is essential for multimodal retrieval-augmented generation, yet most retrievers still discard the visual channel. The \emph{Multimodal Document Retrieval Challenge}, Track~1 of the MIR Challenge at the first EReL@MIR workshop, co-located with The Web Conference 2025, asks participants to build a \emph{single} retrieval system that handles two complementary regimes: closed-set document page retrieval within long documents from a text query (MMDocIR), and open-domain retrieval of Wikipedia-style passages from an image or image-plus-text query (M2KR). Systems are ranked by the macro-average of mean Recall@$\{1,3,5\}$ over the two tasks. The challenge drew 455 entrants and 586 submissions across 22 teams. This report describes the challenge design, datasets, and evaluation protocol; reports the final standings;

Overview of the EReL@MIR 2025 Multimodal Document Retrieval Challenge (Track 1) 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (2)