Zhiyong Wang, Xuefu Xiang, Kan Zeng, Zhenyu Zhang, Yanan Li, Dengpan Song
{"title":"Infrared small target detection based on the combination of single image super-resolution reconstruction and YOLOX","authors":"Zhiyong Wang, Xuefu Xiang, Kan Zeng, Zhenyu Zhang, Yanan Li, Dengpan Song","doi":"10.1145/3590003.3590104","DOIUrl":null,"url":null,"abstract":"For the infrared search and tracking system, it is necessary to increase the ability to detect small infrared targets against complex backgrounds. YOLOX is a high-performance detector, but its detection performance is constrained when it uses data from low-resolution infrared images with small targets. However, occasionally design constraints and budgetary restraints will prevent the optical system and sensor resolution from being increased enough to improve image quality. Real-ESRGAN is used to solve this issue by reconstructing a high-resolution infrared image from its low-resolution counterpart, which will be used as YOLOX-S's input. Also, the YOLOX-S training strategy is modified further to make it appropriate for the detection of infrared small targets, including the Mosaic and MixUp data augmentation and the size of ground-truth. The average precision achieved by the suggested method in this work increases from 63.70% to 77.19%, which shows a considerable improvement in infrared small target detection when compared with the original model by inputting original images.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3590003.3590104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
For the infrared search and tracking system, it is necessary to increase the ability to detect small infrared targets against complex backgrounds. YOLOX is a high-performance detector, but its detection performance is constrained when it uses data from low-resolution infrared images with small targets. However, occasionally design constraints and budgetary restraints will prevent the optical system and sensor resolution from being increased enough to improve image quality. Real-ESRGAN is used to solve this issue by reconstructing a high-resolution infrared image from its low-resolution counterpart, which will be used as YOLOX-S's input. Also, the YOLOX-S training strategy is modified further to make it appropriate for the detection of infrared small targets, including the Mosaic and MixUp data augmentation and the size of ground-truth. The average precision achieved by the suggested method in this work increases from 63.70% to 77.19%, which shows a considerable improvement in infrared small target detection when compared with the original model by inputting original images.