Taocun Yang, Xin Liu, Guohua Li, Ming-rui Dai, Lei Tian, Yan Xie
{"title":"Exploring Multi-Layer Convolutional Neural Networks for Railway Safety Text Classification","authors":"Taocun Yang, Xin Liu, Guohua Li, Ming-rui Dai, Lei Tian, Yan Xie","doi":"10.1109/PIC53636.2021.9687014","DOIUrl":null,"url":null,"abstract":"With the rapid development of China High-Speed Rail, massive text data related to railway safety has been accumulated. When analyzing and understanding this data, classifying railway accident report text is essential and tedious work. Usually, such classification tasks are manually done by experts and workers in the railway safety department. Traditional data mining algorithms have been applied in these tasks to classify the text automatically. However, due to the complexity of the text data, classification algorithms sometimes fail and have insufficient learning ability. Meanwhile, the rise of machine learning enables us to deal with these complex problems effectively. In this paper, we propose an end-to-end multi-layer convolutional neural networks model to classify the railway safety-related text. We update the CNN part of the traditional model by increasing layers and adding a multi-height convolutional kernel. Additionally, we develop a data-preprocessing strategy to obtain the neat input data and reduce the complexity of the task. Experiments show that our proposed method achieves competitive performance and is suitable for railway safety-related text classification problems.","PeriodicalId":297239,"journal":{"name":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIC53636.2021.9687014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the rapid development of China High-Speed Rail, massive text data related to railway safety has been accumulated. When analyzing and understanding this data, classifying railway accident report text is essential and tedious work. Usually, such classification tasks are manually done by experts and workers in the railway safety department. Traditional data mining algorithms have been applied in these tasks to classify the text automatically. However, due to the complexity of the text data, classification algorithms sometimes fail and have insufficient learning ability. Meanwhile, the rise of machine learning enables us to deal with these complex problems effectively. In this paper, we propose an end-to-end multi-layer convolutional neural networks model to classify the railway safety-related text. We update the CNN part of the traditional model by increasing layers and adding a multi-height convolutional kernel. Additionally, we develop a data-preprocessing strategy to obtain the neat input data and reduce the complexity of the task. Experiments show that our proposed method achieves competitive performance and is suitable for railway safety-related text classification problems.