{"title":"Railway switch fault diagnosis based on Multi heads Channel Self Attention, Residual Connection and Deep CNN","authors":"Xirui Chen, Hui Liu, Zhu Duan","doi":"10.1093/tse/tdac045","DOIUrl":null,"url":null,"abstract":"\n A novel switch diagnosis method based on self-attention and residual deep Convolutional Neural Networks (CNN) is proposed. Because of the imbalanced dataset, the Kmeans synthetic minority oversampling technique (SMOTE) is applied to balancing the dataset at first. Then, the deep CNN is utilized to extract local features from long power curves, and the residual connection is performed to handle the performance degeneration. In the end, the Multi-heads Channel Self Attention focuses on those important local features. The ablation and comparison experiments are applied to verifying the effectiveness of the proposed methods. With the residual connection and Multi-heads Channel Self Attention, the proposed method has achieved an accuracy of 99.83% impressively. The t-SNE based visualizations for features of the middle layers enhance the trustworthiness.","PeriodicalId":52804,"journal":{"name":"Transportation Safety and Environment","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Safety and Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/tse/tdac045","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
A novel switch diagnosis method based on self-attention and residual deep Convolutional Neural Networks (CNN) is proposed. Because of the imbalanced dataset, the Kmeans synthetic minority oversampling technique (SMOTE) is applied to balancing the dataset at first. Then, the deep CNN is utilized to extract local features from long power curves, and the residual connection is performed to handle the performance degeneration. In the end, the Multi-heads Channel Self Attention focuses on those important local features. The ablation and comparison experiments are applied to verifying the effectiveness of the proposed methods. With the residual connection and Multi-heads Channel Self Attention, the proposed method has achieved an accuracy of 99.83% impressively. The t-SNE based visualizations for features of the middle layers enhance the trustworthiness.