基于增强BERT-LDA的飞机EWIS安全风险主题识别方法

IF 5.7 2区 计算机科学 Q1 ENGINEERING, AEROSPACE
Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu
{"title":"基于增强BERT-LDA的飞机EWIS安全风险主题识别方法","authors":"Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu","doi":"10.1109/TAES.2025.3533463","DOIUrl":null,"url":null,"abstract":"To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"7153-7164"},"PeriodicalIF":5.7000,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Aircraft EWIS Safety Risk Topic Recognition Method Based on Enhanced BERT-LDA\",\"authors\":\"Yiqin Sang;Hongjuan Ge;Huang Li;Cong Gao;Wenqi Liu\",\"doi\":\"10.1109/TAES.2025.3533463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.\",\"PeriodicalId\":13157,\"journal\":{\"name\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"volume\":\"61 3\",\"pages\":\"7153-7164\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-01-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Aerospace and Electronic Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10852374/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10852374/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0

摘要

为了识别飞机电气布线互连系统(EWIS)中的安全风险关键主题,本文提出了一种利用航空安全报告系统中的事件概要的无监督文本主题识别方法。该方法利用latent Dirichlet allocation (LDA)构造主题特征向量,结合双向编码器表示(BERT)生成语义特征向量,从而获得EWIS融合特征向量。针对融合向量特征提取和聚类分配过程中存在的信息丢失、聚类中心随机确定、优化目标不统一等问题,提出了一种基于深度嵌入聚类(DEC)算法的增强BERT-LDA模型。该模型在编码器使用DEC后加入聚类层以减轻信息丢失,并通过Kullback-Leibler散度参数调整探索最优聚类中心确定。通过随机梯度下降对编码器和聚类层进行迭代训练,使特征提取和聚类的优化目标协调一致。通过对比研究,对DEC改进前后的各种词向量嵌入方法、最先进的方法、BERT和变体模型进行了评估,以证明所提方法的优越性。结果表明,应用DEC后,所有模型的评价指标的性能都有显著提高。与其他方法相比,改进的BERT-LDA方法在EWIS安全风险主题识别中,在同一主题簇内具有更好的紧凑性,在不同主题簇之间具有更好的分离性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Aircraft EWIS Safety Risk Topic Recognition Method Based on Enhanced BERT-LDA
To identify critical topics of safety risk in the aircraft electrical wiring interconnection system (EWIS), this article proposes an unsupervised text topic recognition method, leveraging the event synopses from the aviation safety reporting system. The approach involves utilizing latent Dirichlet allocation (LDA) for constructing topic feature vectors, coupled with the bidirectional encoder representations from transformers (BERT) to generate semantic feature vectors, thereby obtaining EWIS fused feature vectors. Addressing challenges, such as information loss during transfer, random clustering center determination, and disunity of optimization objectives during the feature extraction and cluster assignment processes of the fused vectors, an enhanced BERT-LDA model based on the deep embedded clustering (DEC) algorithm is proposed. The model incorporates clustering layers after the encoders using DEC to mitigate information loss and explores optimal clustering center determination through Kullback–Leibler divergence parameter adjustment. It also involves iterative training of the encoders and clustering layers through stochastic gradient descent to harmonize the optimization objectives of feature extraction and clustering. Comparative studies are conducted to demonstrate the superiority of the proposed method, evaluating various word vector embedding methods, state-of-the-art methods, BERT, and variant models, before and after DEC improvement. The results indicate that, after applying DEC, the performance of evaluation metrics for all models improved significantly. Compared with other methods, the enhanced BERT-LDA methods exhibit superior compactness within the same topic cluster and greater separation among different topic clusters in EWIS safety risk topic recognition.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.80
自引率
13.60%
发文量
433
审稿时长
8.7 months
期刊介绍: IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信