Building consistency in explanations: Harmonizing CNN attributions for satellite-based land cover classification

IF 4.9

Machine learning with applications Pub Date : 2025-05-06 DOI:10.1016/j.mlwa.2025.100653

Timo T. Stomberg , Lennart A. Reißner , Martin G. Schultz , Ribana Roscher

{"title":"Building consistency in explanations: Harmonizing CNN attributions for satellite-based land cover classification","authors":"Timo T. Stomberg , Lennart A. Reißner , Martin G. Schultz , Ribana Roscher","doi":"10.1016/j.mlwa.2025.100653","DOIUrl":null,"url":null,"abstract":"<div><div>Explainable machine learning has gained substantial attention for its role in enhancing transparency and trust in computer vision applications. Attribution methods like Grad-CAM and occlusion sensitivity analysis are frequently used to identify how features contribute to predictions of neural networks. However, a key challenge is that different attribution methods often produce different outcomes undermining trust in their results. Furthermore, the unique characteristics of remote sensing imagery pose additional challenges for attribution interpretation: it primarily comprises continuous “stuff” classes rather than objects, exhibits fine-grained spatial variability, contains mixed pixels, is often multispectral, and exhibits spatially heterogeneity. To tackle this challenge, we present a novel methodology that harmonizes attributions, resulting in: 1. greater consistency across different attribution methods; 2. more meaningful explanations when validated against known segmentation ground truth; and 3. enhanced transparency and traceability. This is achieved by coherently linking feature representations to attributions derived from analyzing the training data, enabling direct attribution assignment to features in (unseen) images. We evaluate our methodology using two satellite-based land cover classification datasets, three convolutional neural network architectures, and nine attribution methods. Harmonizing attributions increases the Pearson correlation coefficient between different attribution methods by an average of 0.18 across all datasets, models, and methods; and improves the micro F1-score — a measure of accuracy — by 12%. We demonstrate that Grad-CAM attributions are inherently well-aligned with the features, whereas other gradient-based attribution methods exhibit significant noise, mitigated through harmonization. It further enhances the resolution of occlusion-based attribution maps and adjusts misleading explanations.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"20 ","pages":"Article 100653"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Explainable machine learning has gained substantial attention for its role in enhancing transparency and trust in computer vision applications. Attribution methods like Grad-CAM and occlusion sensitivity analysis are frequently used to identify how features contribute to predictions of neural networks. However, a key challenge is that different attribution methods often produce different outcomes undermining trust in their results. Furthermore, the unique characteristics of remote sensing imagery pose additional challenges for attribution interpretation: it primarily comprises continuous “stuff” classes rather than objects, exhibits fine-grained spatial variability, contains mixed pixels, is often multispectral, and exhibits spatially heterogeneity. To tackle this challenge, we present a novel methodology that harmonizes attributions, resulting in: 1. greater consistency across different attribution methods; 2. more meaningful explanations when validated against known segmentation ground truth; and 3. enhanced transparency and traceability. This is achieved by coherently linking feature representations to attributions derived from analyzing the training data, enabling direct attribution assignment to features in (unseen) images. We evaluate our methodology using two satellite-based land cover classification datasets, three convolutional neural network architectures, and nine attribution methods. Harmonizing attributions increases the Pearson correlation coefficient between different attribution methods by an average of 0.18 across all datasets, models, and methods; and improves the micro F1-score — a measure of accuracy — by 12%. We demonstrate that Grad-CAM attributions are inherently well-aligned with the features, whereas other gradient-based attribution methods exhibit significant noise, mitigated through harmonization. It further enhances the resolution of occlusion-based attribution maps and adjusts misleading explanations.

查看原文本刊更多论文

建立解释的一致性：协调基于卫星的土地覆盖分类的CNN属性

可解释的机器学习因其在提高计算机视觉应用的透明度和信任方面的作用而获得了大量关注。像Grad-CAM和遮挡敏感性分析这样的归因方法经常被用来确定特征如何对神经网络的预测做出贡献。然而，一个关键的挑战是，不同的归因方法通常会产生不同的结果，从而削弱对其结果的信任。此外，遥感图像的独特特征给归因解释带来了额外的挑战：它主要由连续的“材料”类别而不是物体组成，表现出细粒度的空间变异性，包含混合像元，通常是多光谱的，并表现出空间异质性。为了应对这一挑战，我们提出了一种新的方法来协调归因，从而：1。不同归因方法之间的一致性更强；2. 更有意义的解释验证时，根据已知的分割基础真理；和3。增强透明度和可追溯性。这是通过连贯地将特征表示与分析训练数据得出的属性联系起来实现的，从而实现对（看不见的）图像中的特征的直接属性分配。我们使用两个基于卫星的土地覆盖分类数据集、三个卷积神经网络架构和九种归因方法来评估我们的方法。在所有数据集、模型和方法中，统一归因使不同归因方法之间的Pearson相关系数平均提高0.18；并将微f1分数——一种准确度的衡量标准——提高了12%。我们证明了梯度- cam归因本质上与特征很好地对齐，而其他基于梯度的归因方法表现出明显的噪声，通过协调来缓解。它进一步提高了基于遮挡的属性图的分辨率，并调整了误导性的解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications

自引率

0.00%

发文量

审稿时长

98 days