{"title":"Fault classification method for on-board equipment of metro train control system based on BERT-CNN","authors":"Qian XU, Lei ZHANG, Dongxiu OU, Yunpeng HE","doi":"10.3724/sp.j.1249.2023.05529","DOIUrl":null,"url":null,"abstract":"XU Qian 1, , ZHANG Lei 1, , OU Dongxiu 1, , and HE Yunpeng 3 1) Shanghai Key Laboratory of Rail Infrastructure Durability and System Safety, Tongji University, Shanghai 201804, P. R. China 2) College of Transportation Engineering, Tongji University, Shanghai 201804, P. R. China 3) China Railway Siyuan Survey and Design Group Co. Ltd. , Wuhan 430063, Hubei Province, P. R. China Abstract: The on-board equipment of metro communication based train control (CBTC) is facing laborious maintenance problems, and its textual maintenance logs are criticized for having excessively fragmented information, ambiguous semantics and confused categorization, resulting in low classification metrics by traditional textual distributed representation with basic machine learning algorithms. A fault classification method based on bidirectional encoder representations from transformers convolutional neural network (BERT-CNN) with the focal loss function is proposed to establish the relationship model between the 'fault processing and conclusion' and the 'fault phenomena'. The pre-trained bidirectional encoder representations from transformers (BERT) model is finetuned to fully capture the bidirectional semantics and focus on the keywords to produce better word vectors of the 'fault phenomena'. In order to counteract the classification performance degradation brought by data category imbalance, word vectors are trained using a convolutional neural network (CNN) model with the focal loss function. According to the experimental results conducted by the dataset from an on-board signaling department, the proposed","PeriodicalId":35396,"journal":{"name":"Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3724/sp.j.1249.2023.05529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
XU Qian 1, , ZHANG Lei 1, , OU Dongxiu 1, , and HE Yunpeng 3 1) Shanghai Key Laboratory of Rail Infrastructure Durability and System Safety, Tongji University, Shanghai 201804, P. R. China 2) College of Transportation Engineering, Tongji University, Shanghai 201804, P. R. China 3) China Railway Siyuan Survey and Design Group Co. Ltd. , Wuhan 430063, Hubei Province, P. R. China Abstract: The on-board equipment of metro communication based train control (CBTC) is facing laborious maintenance problems, and its textual maintenance logs are criticized for having excessively fragmented information, ambiguous semantics and confused categorization, resulting in low classification metrics by traditional textual distributed representation with basic machine learning algorithms. A fault classification method based on bidirectional encoder representations from transformers convolutional neural network (BERT-CNN) with the focal loss function is proposed to establish the relationship model between the 'fault processing and conclusion' and the 'fault phenomena'. The pre-trained bidirectional encoder representations from transformers (BERT) model is finetuned to fully capture the bidirectional semantics and focus on the keywords to produce better word vectors of the 'fault phenomena'. In order to counteract the classification performance degradation brought by data category imbalance, word vectors are trained using a convolutional neural network (CNN) model with the focal loss function. According to the experimental results conducted by the dataset from an on-board signaling department, the proposed