Learn from A Rationalist: Distilling Intermediate Interpretable Rationales 文章

ArXiv CS.AI2026-05-29NEWSen作者: Jiayi Dai, Randy Goebel

摘要

arXiv:2601.22531v2 Announce Type: replace-cross Abstract: Because of the pervasive use of deep neural networks (DNNs), especially in high-stakes domains, the interpretability of DNNs has received increased attention. The general idea of rationale extraction (RE) is to provide an interpretable-by-design framework for DNNs via a select-predict architecture where two neural networks learn jointly to perform feature selection and prediction, respectively. Given only the remote supervision from the final task prediction, the process of learning to select subsets of features (or rationales) requires searching in the space of all possible feature combinations, which is computationally challenging and even harder when the base neural networks are not sufficiently capable. To improve the predictive performance of RE models that are based on less capable or smaller neural networks (i.e.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据