{"title":"SMPISD-MTPNet:利用多任务感知网络进行场景语义先验辅助红外船舶探测","authors":"Chen Hu;Xiaogang Dong;Yian Huang;Lele Wang;Liang Xu;Tian Pu;Zhenming Peng","doi":"10.1109/TGRS.2024.3516879","DOIUrl":null,"url":null,"abstract":"Infrared ship detection (IRSD) is crucial for numerous applications but faces challenges, such as small targets and complex backgrounds, resulting in misdetections and false alarms. In order to address these challenges, we propose the scene semantic prior-assisted infrared ship detection using multitask perception network (SMPISD-MTPNet). This network employs multitask perception: one task is to predict targets, and the other focuses on scene perception to suppress false alarms caused by background interference. To highlight dim and small targets, we use the scene semantic extractor (SSE) to guide the network using features extracted based on expert knowledge and the gradient-based module to enhance the edge and point features. We apply data augmentation to the networks and employ a training trick called soft fine-tuning to improve the network’s generalization and suppress the distortion caused by the augmentation process. Due to the unavailability of datasets with appropriate scene labels for scene perception, we have developed a new dataset called the infrared ship dataset with scene segmentation (IRSDSS). In addition, we have enhanced an existing dataset by adding scene masks and created the enhanced infrared ship detection dataset (EISDD). Our evaluations using both IRSDSS and EISDD demonstrate that SMPISD-MTPNet exceeds contemporary state-of-the-art (SOTA) methods in accuracy. The source code and dataset for this research can be available at: \n<uri>https://github.com/greekinRoma/SMPISD-MTPNet</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-14"},"PeriodicalIF":8.6000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multitask Perception Networks\",\"authors\":\"Chen Hu;Xiaogang Dong;Yian Huang;Lele Wang;Liang Xu;Tian Pu;Zhenming Peng\",\"doi\":\"10.1109/TGRS.2024.3516879\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Infrared ship detection (IRSD) is crucial for numerous applications but faces challenges, such as small targets and complex backgrounds, resulting in misdetections and false alarms. In order to address these challenges, we propose the scene semantic prior-assisted infrared ship detection using multitask perception network (SMPISD-MTPNet). This network employs multitask perception: one task is to predict targets, and the other focuses on scene perception to suppress false alarms caused by background interference. To highlight dim and small targets, we use the scene semantic extractor (SSE) to guide the network using features extracted based on expert knowledge and the gradient-based module to enhance the edge and point features. We apply data augmentation to the networks and employ a training trick called soft fine-tuning to improve the network’s generalization and suppress the distortion caused by the augmentation process. Due to the unavailability of datasets with appropriate scene labels for scene perception, we have developed a new dataset called the infrared ship dataset with scene segmentation (IRSDSS). In addition, we have enhanced an existing dataset by adding scene masks and created the enhanced infrared ship detection dataset (EISDD). Our evaluations using both IRSDSS and EISDD demonstrate that SMPISD-MTPNet exceeds contemporary state-of-the-art (SOTA) methods in accuracy. The source code and dataset for this research can be available at: \\n<uri>https://github.com/greekinRoma/SMPISD-MTPNet</uri>\\n.\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"63 \",\"pages\":\"1-14\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2024-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10802996/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10802996/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multitask Perception Networks
Infrared ship detection (IRSD) is crucial for numerous applications but faces challenges, such as small targets and complex backgrounds, resulting in misdetections and false alarms. In order to address these challenges, we propose the scene semantic prior-assisted infrared ship detection using multitask perception network (SMPISD-MTPNet). This network employs multitask perception: one task is to predict targets, and the other focuses on scene perception to suppress false alarms caused by background interference. To highlight dim and small targets, we use the scene semantic extractor (SSE) to guide the network using features extracted based on expert knowledge and the gradient-based module to enhance the edge and point features. We apply data augmentation to the networks and employ a training trick called soft fine-tuning to improve the network’s generalization and suppress the distortion caused by the augmentation process. Due to the unavailability of datasets with appropriate scene labels for scene perception, we have developed a new dataset called the infrared ship dataset with scene segmentation (IRSDSS). In addition, we have enhanced an existing dataset by adding scene masks and created the enhanced infrared ship detection dataset (EISDD). Our evaluations using both IRSDSS and EISDD demonstrate that SMPISD-MTPNet exceeds contemporary state-of-the-art (SOTA) methods in accuracy. The source code and dataset for this research can be available at:
https://github.com/greekinRoma/SMPISD-MTPNet
.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.