首页|SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence

SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence

来源：

英文摘要

Understanding the decisions made by deep neural networks is essential in high-stakes domains such as medical imaging and autonomous driving. Yet, these models often lack transparency, particularly in computer vision. Prototypical-parts-based neural networks have emerged as a promising solution by offering concept-level explanations. However, most are limited to fine-grained classification tasks, with few exceptions such as InfoDisent. InfoDisent extends prototypical models to large-scale datasets like ImageNet, but produces complex explanations. We introduce Sparse Information Disentanglement for Explainability (SIDE), a novel method that improves the interpretability of prototypical parts through a dedicated training and pruning scheme that enforces sparsity. Combined with sigmoid activations in place of softmax, this approach allows SIDE to associate each class with only a small set of relevant prototypes. Extensive experiments show that SIDE matches the accuracy of existing methods while reducing explanation size by over $90\%$, substantially enhancing the understandability of prototype-based explanations.

作者：Viktar Dubovik、Łukasz Struski、Jacek Tabor、Dawid Rymarczyk

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Viktar Dubovik,Łukasz Struski,Jacek Tabor,Dawid Rymarczyk.SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence[EB/OL].(2025-07-25)[2025-08-10].https://arxiv.org/abs/2507.19321.点此复制

SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence

SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence

评论