Bin Wang;Haigang Sui;Guorui Ma;Yuan Zhou;Mingting Zhou
{"title":"GMODet:基于区域感知和语义-空间递进交互的光学遥感图像地面运动目标实时检测器","authors":"Bin Wang;Haigang Sui;Guorui Ma;Yuan Zhou;Mingting Zhou","doi":"10.1109/TGRS.2025.3526799","DOIUrl":null,"url":null,"abstract":"Military conflicts have a significant impact on national security and the ecological environment. Effective detection methods for ground-moving objects in complex scenarios can be further utilized to assess damage and provide recommendations for security and environmental restoration. Current ground-moving object detection in optical remote sensing imagery struggles with balancing detection accuracy and real-time performance, hindering timely threat assessment. To address this, the study proposes ground-moving object detector (GMODet), a real-time detection method incorporating region awareness and semantic-spatial interaction to enhance the detection of partially occluded and fine-grained objects in complex environments. The framework includes three modules: the region awareness module (RAW), cross-scale context-aware feature aggregator (CCFA), and semantic-spatial progressive interaction module (SPIM), focusing on extracting discriminative features for contextual, multiscale, and semantic-spatial information. A new dataset, ground-based moving object dataset (GMOD), is constructed with four object types and high scene complexity, alongside experiments on the publicly available military vehicle remote sensing dataset (MVRSD). GMODet achieves the state-of-the-art performance, with mAP50, mAP75, and mAP scores of 65.5%, 48.5%, and 42.3% on the GMOD, outperforming the second-best results by 1.9%, 5.1%, and 1.5%, respectively. On the MVRSD, it achieves mAP50, mAP75, and mAP scores of 88.2%, 75.2%, and 61.7%, respectively. Notably, with an inference time of just 25 s on large-scale images (<inline-formula> <tex-math>$9152\\times 9152$ </tex-math></inline-formula> pixels), GMODet showcases outstanding accuracy, speed, robustness, and generalization in ground-moving object detection.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-23"},"PeriodicalIF":8.6000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GMODet: A Real-Time Detector for Ground-Moving Objects in Optical Remote Sensing Images With Regional Awareness and Semantic–Spatial Progressive Interaction\",\"authors\":\"Bin Wang;Haigang Sui;Guorui Ma;Yuan Zhou;Mingting Zhou\",\"doi\":\"10.1109/TGRS.2025.3526799\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Military conflicts have a significant impact on national security and the ecological environment. Effective detection methods for ground-moving objects in complex scenarios can be further utilized to assess damage and provide recommendations for security and environmental restoration. Current ground-moving object detection in optical remote sensing imagery struggles with balancing detection accuracy and real-time performance, hindering timely threat assessment. To address this, the study proposes ground-moving object detector (GMODet), a real-time detection method incorporating region awareness and semantic-spatial interaction to enhance the detection of partially occluded and fine-grained objects in complex environments. The framework includes three modules: the region awareness module (RAW), cross-scale context-aware feature aggregator (CCFA), and semantic-spatial progressive interaction module (SPIM), focusing on extracting discriminative features for contextual, multiscale, and semantic-spatial information. A new dataset, ground-based moving object dataset (GMOD), is constructed with four object types and high scene complexity, alongside experiments on the publicly available military vehicle remote sensing dataset (MVRSD). GMODet achieves the state-of-the-art performance, with mAP50, mAP75, and mAP scores of 65.5%, 48.5%, and 42.3% on the GMOD, outperforming the second-best results by 1.9%, 5.1%, and 1.5%, respectively. On the MVRSD, it achieves mAP50, mAP75, and mAP scores of 88.2%, 75.2%, and 61.7%, respectively. Notably, with an inference time of just 25 s on large-scale images (<inline-formula> <tex-math>$9152\\\\times 9152$ </tex-math></inline-formula> pixels), GMODet showcases outstanding accuracy, speed, robustness, and generalization in ground-moving object detection.\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"63 \",\"pages\":\"1-23\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-01-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10830566/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10830566/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
GMODet: A Real-Time Detector for Ground-Moving Objects in Optical Remote Sensing Images With Regional Awareness and Semantic–Spatial Progressive Interaction
Military conflicts have a significant impact on national security and the ecological environment. Effective detection methods for ground-moving objects in complex scenarios can be further utilized to assess damage and provide recommendations for security and environmental restoration. Current ground-moving object detection in optical remote sensing imagery struggles with balancing detection accuracy and real-time performance, hindering timely threat assessment. To address this, the study proposes ground-moving object detector (GMODet), a real-time detection method incorporating region awareness and semantic-spatial interaction to enhance the detection of partially occluded and fine-grained objects in complex environments. The framework includes three modules: the region awareness module (RAW), cross-scale context-aware feature aggregator (CCFA), and semantic-spatial progressive interaction module (SPIM), focusing on extracting discriminative features for contextual, multiscale, and semantic-spatial information. A new dataset, ground-based moving object dataset (GMOD), is constructed with four object types and high scene complexity, alongside experiments on the publicly available military vehicle remote sensing dataset (MVRSD). GMODet achieves the state-of-the-art performance, with mAP50, mAP75, and mAP scores of 65.5%, 48.5%, and 42.3% on the GMOD, outperforming the second-best results by 1.9%, 5.1%, and 1.5%, respectively. On the MVRSD, it achieves mAP50, mAP75, and mAP scores of 88.2%, 75.2%, and 61.7%, respectively. Notably, with an inference time of just 25 s on large-scale images ($9152\times 9152$ pixels), GMODet showcases outstanding accuracy, speed, robustness, and generalization in ground-moving object detection.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.