Feature Analysis Network: An Interpretable Idea in Deep Learning

IF 4.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation Pub Date : 2024-01-20 DOI:10.1007/s12559-023-10238-0

Xinyu Li, Xiaoguang Gao, Qianglong Wang, Chenfeng Wang, Bo Li, Kaifang Wan

{"title":"Feature Analysis Network: An Interpretable Idea in Deep Learning","authors":"Xinyu Li, Xiaoguang Gao, Qianglong Wang, Chenfeng Wang, Bo Li, Kaifang Wan","doi":"10.1007/s12559-023-10238-0","DOIUrl":null,"url":null,"abstract":"<p>Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features’ physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"14 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12559-023-10238-0","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features’ physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.

Abstract Image

查看原文本刊更多论文

特征分析网络：深度学习中的可解读理念

深度学习（DL）是处理高维数据的领先模型，其隐藏层的非线性变换可有效提取特征。然而，这些无法解释的特征使得 DL 成为一种可解释性较低的模型。相反，贝叶斯网络（BN）透明度高，可解释性强，有助于解释 DL。为了从特征认知的角度提高 DL 的可解释性，我们提出了与贝叶斯网络融合的 DL 结构--特征分析网络（FAN）。FAN 保留了 DL 的特征提取能力，并将 BN 用作输出层，以学习特征与输出之间的关系。这些关系可以通过 BN 的结构和参数直观地用概率表示出来。在进一步的研究中，提出了基于相关聚类的特征分析网络（cc-FAN）来检测输入之间的相关性，并保留这些信息以在一定程度上解释特征的物理意义。为了定量评估模型的可解释性，我们分别设计了网络简化指标和可解释性指标。在八个数据集上的实验表明，在模型精度基本不变、模型复杂度相近的情况下，FAN 比其他模型具有更好的可解释性。在雷达效应机制数据集上，从基于特征结构的相关性可解释性指标来看，FAN 是其他模型的 4.8 倍，cc-FAN 是其他模型的 21.5 倍。FAN和cc-FAN从特征方面提高了DL模型结构的可解释性；此外，基于输入相关性，cc-FAN可以帮助我们更好地理解特征的物理意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cognitive Computation COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-NEUROSCIENCES

CiteScore

9.30

自引率

3.70%

发文量

116

审稿时长

>12 weeks

期刊介绍： Cognitive Computation is an international, peer-reviewed, interdisciplinary journal that publishes cutting-edge articles describing original basic and applied work involving biologically-inspired computational accounts of all aspects of natural and artificial cognitive systems. It provides a new platform for the dissemination of research, current practices and future trends in the emerging discipline of cognitive computation that bridges the gap between life sciences, social sciences, engineering, physical and mathematical sciences, and humanities.