Interpretable AI for vibration-based structural health monitoring: a comparative study of CNN and transformer architectures on a benchmark shear building

Q2 Engineering

Asian Journal of Civil Engineering Pub Date : 2025-07-21 DOI:10.1007/s42107-025-01446-0

I. V. Sarma, Sarit Chanda, M. Srinivasa Reddy

{"title":"Interpretable AI for vibration-based structural health monitoring: a comparative study of CNN and transformer architectures on a benchmark shear building","authors":"I. V. Sarma, Sarit Chanda, M. Srinivasa Reddy","doi":"10.1007/s42107-025-01446-0","DOIUrl":null,"url":null,"abstract":"<div><p>The proliferation of Artificial Intelligence (AI) in Structural Health Monitoring (SHM) has catalyzed a paradigm shift from traditional, feature-based damage detection to end-to-end, data-driven methodologies. While Deep Learning (DL) models, particularly Convolutional Neural Networks (CNNs), have demonstrated remarkable efficacy, the advent of Transformer architectures presents a new frontier with unparalleled capabilities for sequence modeling. However, a direct comparative analysis of these architectures on a standardized experimental benchmark, coupled with a deep investigation into their decision-making processes, remains a critical research gap. This study addresses this void by conducting a comprehensive investigation using a publicly available experimental dataset from a six-storey laboratory shear building. We develop, train, and evaluate two distinct DL models: a lightweight one-dimensional CNN (Fast CNN) and a state-of-the-art Transformer-based model (Fast Transformer). Both models are tasked with directly classifying the structural state (undamaged vs. damaged) from raw accelerometer time-series data. Performance evaluation based on standard metrics reveals that both models achieve exceptional accuracy, with the Fast CNN reaching 99.44% and the Fast Transformer reaching 98.87% on validation datasets. This work’s core contribution lies in applying Explainable AI (XAI) techniques, including Integrated Gradients and saliency mapping, to deconstruct these models’ “black box” nature. Our analysis reveals a non-intuitive yet consistent finding: both the CNN and the Transformer primarily focus on the vibration signature of the base sensor (Sensor 1) to detect damage located at the fourth storey. This suggests the models have learned to identify damage through their influence on the structure’s global dynamic response as reflected at their boundary conditions. Furthermore, XAI reveals distinct operational strategies: the CNN acts as a highly localized feature detector, whereas the Transformer leverages its self-attention mechanism to weigh a broader spatiotemporal context. This paper provides a rigorous benchmark for modern DL architectures in vibration-based SHM and tells a technical story of how interpretable AI can uncover novel, physically meaningful damage detection strategies, enhancing trust and guiding future development of intelligent monitoring systems.</p></div>","PeriodicalId":8513,"journal":{"name":"Asian Journal of Civil Engineering","volume":"26 11","pages":"4615 - 4628"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s42107-025-01446-0.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Civil Engineering","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s42107-025-01446-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Engineering","Score":null,"Total":0}

引用次数: 0

Abstract

The proliferation of Artificial Intelligence (AI) in Structural Health Monitoring (SHM) has catalyzed a paradigm shift from traditional, feature-based damage detection to end-to-end, data-driven methodologies. While Deep Learning (DL) models, particularly Convolutional Neural Networks (CNNs), have demonstrated remarkable efficacy, the advent of Transformer architectures presents a new frontier with unparalleled capabilities for sequence modeling. However, a direct comparative analysis of these architectures on a standardized experimental benchmark, coupled with a deep investigation into their decision-making processes, remains a critical research gap. This study addresses this void by conducting a comprehensive investigation using a publicly available experimental dataset from a six-storey laboratory shear building. We develop, train, and evaluate two distinct DL models: a lightweight one-dimensional CNN (Fast CNN) and a state-of-the-art Transformer-based model (Fast Transformer). Both models are tasked with directly classifying the structural state (undamaged vs. damaged) from raw accelerometer time-series data. Performance evaluation based on standard metrics reveals that both models achieve exceptional accuracy, with the Fast CNN reaching 99.44% and the Fast Transformer reaching 98.87% on validation datasets. This work’s core contribution lies in applying Explainable AI (XAI) techniques, including Integrated Gradients and saliency mapping, to deconstruct these models’ “black box” nature. Our analysis reveals a non-intuitive yet consistent finding: both the CNN and the Transformer primarily focus on the vibration signature of the base sensor (Sensor 1) to detect damage located at the fourth storey. This suggests the models have learned to identify damage through their influence on the structure’s global dynamic response as reflected at their boundary conditions. Furthermore, XAI reveals distinct operational strategies: the CNN acts as a highly localized feature detector, whereas the Transformer leverages its self-attention mechanism to weigh a broader spatiotemporal context. This paper provides a rigorous benchmark for modern DL architectures in vibration-based SHM and tells a technical story of how interpretable AI can uncover novel, physically meaningful damage detection strategies, enhancing trust and guiding future development of intelligent monitoring systems.

查看原文本刊更多论文

基于振动的结构健康监测的可解释人工智能：基于基准剪力建筑的CNN和变压器结构的比较研究

人工智能（AI）在结构健康监测（SHM）领域的广泛应用促进了从传统的、基于特征的损伤检测到端到端、数据驱动方法的范式转变。虽然深度学习（DL）模型，特别是卷积神经网络（cnn）已经证明了显著的功效，但Transformer架构的出现为序列建模提供了一个无与伦比的新领域。然而，在标准化实验基准上对这些架构进行直接比较分析，再加上对其决策过程的深入调查，仍然是一个关键的研究缺口。本研究通过使用来自六层实验室剪切楼的公开实验数据集进行全面调查，解决了这一空白。我们开发、训练和评估了两种不同的深度学习模型：轻量级一维CNN （Fast CNN）和最先进的基于变压器的模型（Fast Transformer）。这两种模型的任务是从原始加速度计时间序列数据中直接分类结构状态（未损坏与损坏）。基于标准指标的性能评估表明，两种模型都达到了出色的准确率，Fast CNN和Fast Transformer在验证数据集上达到了99.44%和98.87%。这项工作的核心贡献在于应用可解释的人工智能（XAI）技术，包括集成梯度和显著性映射，来解构这些模型的“黑箱”性质。我们的分析揭示了一个非直观但一致的发现：CNN和Transformer都主要关注基础传感器（传感器1）的振动特征，以检测位于四层的损坏。这表明这些模型已经学会了通过它们对结构整体动力响应的影响来识别损伤，这反映在它们的边界条件上。此外，XAI揭示了不同的操作策略：CNN作为高度本地化的特征检测器，而Transformer利用其自关注机制来权衡更广泛的时空背景。本文为基于振动的SHM中的现代深度学习架构提供了严格的基准，并讲述了可解释的AI如何揭示新颖的，物理上有意义的损伤检测策略，增强信任并指导智能监控系统的未来发展的技术故事。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Asian Journal of Civil Engineering Engineering-Civil and Structural Engineering

CiteScore

2.70

自引率

0.00%

发文量

121

期刊介绍： The Asian Journal of Civil Engineering (Building and Housing) welcomes articles and research contributions on topics such as:- Structural analysis and design - Earthquake and structural engineering - New building materials and concrete technology - Sustainable building and energy conservation - Housing and planning - Construction management - Optimal design of structuresPlease note that the journal will not accept papers in the area of hydraulic or geotechnical engineering, traffic/transportation or road making engineering, and on materials relevant to non-structural buildings, e.g. materials for road making and asphalt. Although the journal will publish authoritative papers on theoretical and experimental research works and advanced applications, it may also feature, when appropriate: a) tutorial survey type papers reviewing some fields of civil engineering; b) short communications and research notes; c) book reviews and conference announcements.