Methods for interpreting and understanding deep neural networks 论文

2017Digital Signal Processing引用 2677
Explainable Artificial Intelligence (XAI)Neural Networks and ApplicationsAdversarial Robustness in Machine Learning

摘要

This paper provides an entry point to the problem of interpreting a deep neural network model and explaining its predictions. It is based on a tutorial given at ICASSP 2017. As a tutorial paper, the set of methods covered here is not exhaustive, but sufficiently representative to discuss a number of questions in interpretability, technical challenges, and possible applications. The second part of the tutorial focuses on the recently proposed layer-wise relevance propagation (LRP) technique, for which we provide theory, recommendations, and tricks, to make most efficient use of it on real data.