XAI-2DCOS: Enhancing Interpretability in Spectral Deep Learning Models Through 2D Correlation Spectroscopy

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics Pub Date : 2025-07-11 DOI:10.1002/cem.70045

Jhonatan Contreras, Thomas Bocklitz

{"title":"XAI-2DCOS: Enhancing Interpretability in Spectral Deep Learning Models Through 2D Correlation Spectroscopy","authors":"Jhonatan Contreras, Thomas Bocklitz","doi":"10.1002/cem.70045","DOIUrl":null,"url":null,"abstract":"<p>Deep learning (DL) has significantly advanced Raman spectra analysis, achieving high accuracy and efficiency. However, their complexity and opacity limit their application in areas where understanding and transparency are essential. To address this, we present XAI-2DCOS, an innovative eXplainable Artificial Intelligence (XAI) framework that employs 2D correlation spectroscopy (2DCOS). Traditionally, 2DCOS reveals the sequence of molecular changes under varying conditions. We repurpose it to enhance the interpretability of DL models by linking changes in spectral features to model outputs, identifying critical wavenumbers, and how their variations affect model accuracy. We applied XAI-2DCOS to a DL model trained on a dataset of oil Raman spectra, demonstrating its ability to identify critical spectral features that align with domain knowledge. To improve robustness, we integrated a conditional generative adversarial network (CGAN) for data augmentation. CGAN generates synthetic data, ensuring the presence of spectra across the entire probability range. A normalized relevance score quantifies the contribution for each wavenumber to the model's prediction. A predictive probability map delineates decision boundaries within the PCA space. Synchronous 2DCOS maps are used to guide spectral adjustments that improve prediction confidence for specific class predictions. These adjustments can affect multiple output classes with differential scaling of output activations, suggesting that crossing a threshold can shift the model decision. Our results show that XAI-2DCOS improves the interpretability and reliability of DL models applied to Raman spectra. Furthermore, CGAN data augmentation extends the applicability of XAI-2DCOS to smaller datasets.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 7","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70045","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/cem.70045","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning (DL) has significantly advanced Raman spectra analysis, achieving high accuracy and efficiency. However, their complexity and opacity limit their application in areas where understanding and transparency are essential. To address this, we present XAI-2DCOS, an innovative eXplainable Artificial Intelligence (XAI) framework that employs 2D correlation spectroscopy (2DCOS). Traditionally, 2DCOS reveals the sequence of molecular changes under varying conditions. We repurpose it to enhance the interpretability of DL models by linking changes in spectral features to model outputs, identifying critical wavenumbers, and how their variations affect model accuracy. We applied XAI-2DCOS to a DL model trained on a dataset of oil Raman spectra, demonstrating its ability to identify critical spectral features that align with domain knowledge. To improve robustness, we integrated a conditional generative adversarial network (CGAN) for data augmentation. CGAN generates synthetic data, ensuring the presence of spectra across the entire probability range. A normalized relevance score quantifies the contribution for each wavenumber to the model's prediction. A predictive probability map delineates decision boundaries within the PCA space. Synchronous 2DCOS maps are used to guide spectral adjustments that improve prediction confidence for specific class predictions. These adjustments can affect multiple output classes with differential scaling of output activations, suggesting that crossing a threshold can shift the model decision. Our results show that XAI-2DCOS improves the interpretability and reliability of DL models applied to Raman spectra. Furthermore, CGAN data augmentation extends the applicability of XAI-2DCOS to smaller datasets.

Abstract Image

查看原文本刊更多论文

XAI-2DCOS：通过二维相关光谱增强光谱深度学习模型的可解释性

深度学习（DL）在拉曼光谱分析方面具有显著的进步，实现了高精度和高效率。然而，它们的复杂性和不透明性限制了它们在理解和透明至关重要的领域的应用。为了解决这个问题，我们提出了XAI-2DCOS，这是一种创新的可解释人工智能（XAI）框架，采用2D相关光谱（2DCOS）。传统上，2DCOS揭示了不同条件下分子变化的序列。我们将其重新用于增强DL模型的可解释性，方法是将光谱特征的变化与模型输出联系起来，识别关键波数，以及它们的变化如何影响模型精度。我们将XAI-2DCOS应用于在石油拉曼光谱数据集上训练的深度学习模型，证明了其识别与领域知识一致的关键光谱特征的能力。为了提高鲁棒性，我们集成了一个条件生成对抗网络（CGAN）来进行数据增强。CGAN生成合成数据，确保在整个概率范围内存在光谱。规范化的相关性评分量化了每个波数对模型预测的贡献。预测概率图描绘了PCA空间内的决策边界。同步2DCOS地图用于指导光谱调整，以提高特定类别预测的预测信心。这些调整可以影响具有不同输出激活比例的多个输出类，这表明跨越阈值可以改变模型决策。结果表明，XAI-2DCOS提高了拉曼光谱DL模型的可解释性和可靠性。此外，CGAN数据增强将XAI-2DCOS的适用性扩展到更小的数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Chemometrics 化学-分析化学

CiteScore

5.20

自引率

8.30%

发文量

审稿时长

2 months

期刊介绍： The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.