Interpreto: An Explainability Library for Transformers 事件

OPEN_SOURCE2026-06-02影响: MEDIUM

Interpreto: An Explainability Library for Transformers arXiv:2512.09730v3 Announce Type: replace Abstract: Interpreto is an open-source Python library for interpreting HuggingFace language models, from early BERT variants to LLMs. It provides two complementary families of methods: attribution methods and concept-based explanations. The library bridges recent research and practical tooling by exposing explanation workflows through a unified API for both classification and text generation. A key