{"title":"基于VQ-VAE的太赫兹安检图像识别算法","authors":"Xinyu Zha, Yechao Bai","doi":"10.1109/ICSPS58776.2022.00074","DOIUrl":null,"url":null,"abstract":"Terahertz imaging technology has been used in the field of security inspection due to its excellent perspective ability. However, the terahertz security inspection images have the problem of poor image quality and cannot cover all types of dangerous objects. In this paper, a terahertz image recognition algorithm based on Vector Quantized-Variational Autoencoder (VQ-VAE) is proposed. The algorithm realizes image data enhancement through dynamically learned prior, and discrete potential space is used to efficiently identify abnormal objects outside the training set. In order to improve the network recognition performance, the attention CBAM module is introduced to increase the attention ratio of the key target area. The algorithm embeds Center Loss metric learning module into the network frame to increase the inter-class distance and reduce the intra-class distance, thereby improving the clustering and generalization performance. The experimental verification is carried out on the actual collected terahertz security image data set. The experimental results show that the improved and optimized VQ-VAE image recognition network is significantly better than the convolutional neural network in recognition accuracy and has better generalization ability.","PeriodicalId":330562,"journal":{"name":"2022 14th International Conference on Signal Processing Systems (ICSPS)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A VQ-VAE Based Recognition Algorithm for Terahertz Security Inspection Images\",\"authors\":\"Xinyu Zha, Yechao Bai\",\"doi\":\"10.1109/ICSPS58776.2022.00074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Terahertz imaging technology has been used in the field of security inspection due to its excellent perspective ability. However, the terahertz security inspection images have the problem of poor image quality and cannot cover all types of dangerous objects. In this paper, a terahertz image recognition algorithm based on Vector Quantized-Variational Autoencoder (VQ-VAE) is proposed. The algorithm realizes image data enhancement through dynamically learned prior, and discrete potential space is used to efficiently identify abnormal objects outside the training set. In order to improve the network recognition performance, the attention CBAM module is introduced to increase the attention ratio of the key target area. The algorithm embeds Center Loss metric learning module into the network frame to increase the inter-class distance and reduce the intra-class distance, thereby improving the clustering and generalization performance. The experimental verification is carried out on the actual collected terahertz security image data set. The experimental results show that the improved and optimized VQ-VAE image recognition network is significantly better than the convolutional neural network in recognition accuracy and has better generalization ability.\",\"PeriodicalId\":330562,\"journal\":{\"name\":\"2022 14th International Conference on Signal Processing Systems (ICSPS)\",\"volume\":\"88 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Signal Processing Systems (ICSPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSPS58776.2022.00074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Signal Processing Systems (ICSPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPS58776.2022.00074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A VQ-VAE Based Recognition Algorithm for Terahertz Security Inspection Images
Terahertz imaging technology has been used in the field of security inspection due to its excellent perspective ability. However, the terahertz security inspection images have the problem of poor image quality and cannot cover all types of dangerous objects. In this paper, a terahertz image recognition algorithm based on Vector Quantized-Variational Autoencoder (VQ-VAE) is proposed. The algorithm realizes image data enhancement through dynamically learned prior, and discrete potential space is used to efficiently identify abnormal objects outside the training set. In order to improve the network recognition performance, the attention CBAM module is introduced to increase the attention ratio of the key target area. The algorithm embeds Center Loss metric learning module into the network frame to increase the inter-class distance and reduce the intra-class distance, thereby improving the clustering and generalization performance. The experimental verification is carried out on the actual collected terahertz security image data set. The experimental results show that the improved and optimized VQ-VAE image recognition network is significantly better than the convolutional neural network in recognition accuracy and has better generalization ability.