Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu
{"title":"基于增强BERT-LDA的飞机EWIS安全风险主题识别方法","authors":"Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu","doi":"10.1109/TAES.2025.3533463","DOIUrl":null,"url":null,"abstract":"To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"7153-7164"},"PeriodicalIF":5.7000,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Aircraft EWIS Safety Risk Topic Recognition Method Based on Enhanced BERT-LDA\",\"authors\":\"Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu\",\"doi\":\"10.1109/TAES.2025.3533463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.\",\"PeriodicalId\":13157,\"journal\":{\"name\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"volume\":\"61 3\",\"pages\":\"7153-7164\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-01-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10852374/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10852374/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
Aircraft EWIS Safety Risk Topic Recognition Method Based on Enhanced BERT-LDA
To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.
期刊介绍:
IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.