GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation 文章

ArXiv CS.AI2026-05-27NEWSen作者: Nicolas Salvy, Hugues Talbot, Bertrand Thirion

摘要

arXiv:2602.16449v2 Announce Type: replace-cross Abstract: Generative model evaluation commonly relies on high-dimensional embedding spaces to compute distances between samples. We show that dataset representations in these spaces are affected by the hubness phenomenon, which distorts nearest-neighbor relationships and biases distance-based metrics. Building on the classical Iterative Contextual Dissimilarity Measure (ICDM), we introduce Generative ICDM (GICDM), a method to correct neighborhood estimation for both real and generated data. We introduce a multi-scale extension to improve empirical behavior. Extensive experiments on synthetic and real benchmarks demonstrate that GICDM resolves hubness-induced failures, restores reliable metric behavior, and improves alignment with human assessment.