{"title":"Multi-branch global Transformer‐assisted network for fault diagnosis","authors":"Xiaorui Shao , Chang-Soo Kim","doi":"10.1016/j.asoc.2025.113572","DOIUrl":null,"url":null,"abstract":"<div><div>Fault Diagnosis (FD) is critical in smart manufacturing, enabling predictive maintenance, reducing operational costs, and enhancing system reliability. To deal with this task more accurately, this paper proposes a generative, effective, and novel framework, a multi-branch global Transformer-assisted network (MBGTNet), for accurate FD. First, a multi-branch global-wide one-dimension convolution operation (MBG-WideConv1D) is proposed to obtain global features in different views. Meanwhile, a Transformer assist scheme (TAS) is designed to leverage the Transformer's global feature extraction capacity. The features extracted by the Transformer are fused with those extracted with MBG-WideConv1D by minimizing their pairwise correlation alignment (CORAL) distances. Benefiting from the well-designed MBG-WideConv1D and TAS, the global features hidden in the raw signals are fully extracted from multiple viewpoints. Each branch of global features is then fed into a one-dimension convolutional neural network (1DCNN) to extract local features in a multi-supervised scheme (MSS) that helps each branch learn thoroughly. Furthermore, the proposed method employs a local feature correlation enhancement scheme (LFCS) to reduce distribution differences and increase robustness among the local features of each branch. As a result, the final features used for FD are a fusion of multi-view global and local features with strong robustness, enabling accurate FD in noisy environments. Comparative experiments on four datasets, including CWRU, MFPT, SU Bearing, and SU Gear, validate the proposed method's effectiveness, achieving over 99.6 % accuracy across four datasets. Moreover, the TAS and LFCS's generalities have been demonstrated on two 1DCNNs and hybrid CNN-LSTMs with four subsets. Also, the effectiveness of each component in the proposed framework has been thoroughly analyzed.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"182 ","pages":"Article 113572"},"PeriodicalIF":7.2000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462500883X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Fault Diagnosis (FD) is critical in smart manufacturing, enabling predictive maintenance, reducing operational costs, and enhancing system reliability. To deal with this task more accurately, this paper proposes a generative, effective, and novel framework, a multi-branch global Transformer-assisted network (MBGTNet), for accurate FD. First, a multi-branch global-wide one-dimension convolution operation (MBG-WideConv1D) is proposed to obtain global features in different views. Meanwhile, a Transformer assist scheme (TAS) is designed to leverage the Transformer's global feature extraction capacity. The features extracted by the Transformer are fused with those extracted with MBG-WideConv1D by minimizing their pairwise correlation alignment (CORAL) distances. Benefiting from the well-designed MBG-WideConv1D and TAS, the global features hidden in the raw signals are fully extracted from multiple viewpoints. Each branch of global features is then fed into a one-dimension convolutional neural network (1DCNN) to extract local features in a multi-supervised scheme (MSS) that helps each branch learn thoroughly. Furthermore, the proposed method employs a local feature correlation enhancement scheme (LFCS) to reduce distribution differences and increase robustness among the local features of each branch. As a result, the final features used for FD are a fusion of multi-view global and local features with strong robustness, enabling accurate FD in noisy environments. Comparative experiments on four datasets, including CWRU, MFPT, SU Bearing, and SU Gear, validate the proposed method's effectiveness, achieving over 99.6 % accuracy across four datasets. Moreover, the TAS and LFCS's generalities have been demonstrated on two 1DCNNs and hybrid CNN-LSTMs with four subsets. Also, the effectiveness of each component in the proposed framework has been thoroughly analyzed.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.