Why Self-Inconsistency Arises in GNN Explanations and How to Exploit It 文章

ArXiv CS.AI2026-06-02NEWSen作者: Wenxin Tai, Yaqian Liu, Ting Zhong, Fan Zhou

摘要

arXiv:2605.07527v2 Announce Type: replace-cross Abstract: Recent work has observed that explanations produced by Self-Interpretable Graph Neural Networks (SI-GNNs) can be self-inconsistent: when the model is reapplied to its own explanatory graph subset, it may produce a different explanation. However, why self-inconsistency arises remains poorly understood. In this work, we first identify re-explanation-induced context perturbation as the direct cause of score variation. We then introduce a latent signal assignment hypothesis to explain why only some edges are sensitive to this perturbation, and analyze how conciseness regularization affects latent signal assignment. Given that self-inconsistent edges do not provide stable evidence for the model's prediction, we propose Self-Denoising (SD), a model-agnostic and training-free post-processing strategy that calibrates explanations with only one additional forward pass.