Zongluo Zhao, Zhixin Zhao, Qiangqiang Li, Jiaxi Zhuang, Xiaoming Ju
{"title":"Multi-Modal Cross-Attention Learning on Detecting Anomalies of Electrical Equipment","authors":"Zongluo Zhao, Zhixin Zhao, Qiangqiang Li, Jiaxi Zhuang, Xiaoming Ju","doi":"10.1109/prmvia58252.2023.00030","DOIUrl":null,"url":null,"abstract":"Currently, in field of electrical power system, techniques of anomalies detection are constantly innovating. Applications of neural network on processing of patrol images spare analyzers plenty of time, but subject to the relatively low resolution of the object contour under heavy weather, results do not show well on recognition of anomalies in electrical equipment, while multi-modal methods can import more information to the objects detected, thus may improve the success rate of capture. In this paper, we propose a feature-fusion model which uses cross-attention learning method to augment features of the anomalies with text of corresponding description and environment condition. After comparing experiments on self-constructed datasets of images and text, our model has achieved the state of art on multiple metrics. More importantly, it is found that adding additional features to the model can achieve better results through ablation experiments, which shows our model is scalable for a better solution.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/prmvia58252.2023.00030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, in field of electrical power system, techniques of anomalies detection are constantly innovating. Applications of neural network on processing of patrol images spare analyzers plenty of time, but subject to the relatively low resolution of the object contour under heavy weather, results do not show well on recognition of anomalies in electrical equipment, while multi-modal methods can import more information to the objects detected, thus may improve the success rate of capture. In this paper, we propose a feature-fusion model which uses cross-attention learning method to augment features of the anomalies with text of corresponding description and environment condition. After comparing experiments on self-constructed datasets of images and text, our model has achieved the state of art on multiple metrics. More importantly, it is found that adding additional features to the model can achieve better results through ablation experiments, which shows our model is scalable for a better solution.