{"title":"基于几何一致性约束和高斯分布对齐的多源遥感目标检测网络改进","authors":"Yungang Cao, Haibo Cheng, Baikai Sui, Yahui Zeng","doi":"10.1016/j.isprsjprs.2025.07.037","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-source remote sensing object detection, by combining data from different sensors, can comprehensively improve the accuracy and robustness of object detection. However, it faces challenges such as data inconsistency, domain shift, and scarcity of labeled data. Domain adaptation methods can address these challenges by aligning features between the source and target domains, reducing domain shift, and enhancing the model’s generalization ability, thus solving the discrepancies in multi-source data. However, existing domain adaptation object detection methods insufficiently utilize shallow geometric features that are important for geometric consistency, and traditional methods that use adversarial networks for feature alignment often leading to insufficient alignment capability and training instability. To address the insufficient utilization of geometric information in existing methods and considering that shallow features contain abundant geometric information (e.g., points, lines, and surfaces), this paper proposes a shallow feature alignment method based on geometric consistency (GCFA), using shallow features as alignment cues. This method achieves effective feature alignment through partition calculation and weighted loss processing. Furthermore, to tackle the problems of insufficient alignment capability and training instability in the network, we introduce a feature alignment method based on Gaussian distribution (GDFA). This method directly aligns the feature distributions of the source and target domains by leveraging the mean and standard deviation, thereby enhancing the alignment capability of the network. And we can update the network directly through the loss function, without the need for adversarial networks or gradient reversal layers, thus avoiding potential training instability issues. In addition, we design a pseudo-labels refinement module (PLRM) that combines dynamic threshold select and pseudo-labels class weighting to enhance the constraint ability of the model’s unsupervised branch. In order to verify the effectiveness of the method proposed in this paper, we conducted extensive experiments on datasets such as DOTA, DIOR, WHU, and Levir. On the DOTA and DIOR datasets, the proposed method achieves a 3.09 % improvement in mAP50 compared to the best baseline method. On the WHU dataset, it shows a 2.30 % improvement over the best method, and on the Levir and SSDD datasets, the proposed method outperforms the best method by 2.13 %.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"228 ","pages":"Pages 566-581"},"PeriodicalIF":12.2000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved multi-source remote sensing object detection network by geometry consistency constraint and Gaussian distribution alignment\",\"authors\":\"Yungang Cao, Haibo Cheng, Baikai Sui, Yahui Zeng\",\"doi\":\"10.1016/j.isprsjprs.2025.07.037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-source remote sensing object detection, by combining data from different sensors, can comprehensively improve the accuracy and robustness of object detection. However, it faces challenges such as data inconsistency, domain shift, and scarcity of labeled data. Domain adaptation methods can address these challenges by aligning features between the source and target domains, reducing domain shift, and enhancing the model’s generalization ability, thus solving the discrepancies in multi-source data. However, existing domain adaptation object detection methods insufficiently utilize shallow geometric features that are important for geometric consistency, and traditional methods that use adversarial networks for feature alignment often leading to insufficient alignment capability and training instability. To address the insufficient utilization of geometric information in existing methods and considering that shallow features contain abundant geometric information (e.g., points, lines, and surfaces), this paper proposes a shallow feature alignment method based on geometric consistency (GCFA), using shallow features as alignment cues. This method achieves effective feature alignment through partition calculation and weighted loss processing. Furthermore, to tackle the problems of insufficient alignment capability and training instability in the network, we introduce a feature alignment method based on Gaussian distribution (GDFA). This method directly aligns the feature distributions of the source and target domains by leveraging the mean and standard deviation, thereby enhancing the alignment capability of the network. And we can update the network directly through the loss function, without the need for adversarial networks or gradient reversal layers, thus avoiding potential training instability issues. In addition, we design a pseudo-labels refinement module (PLRM) that combines dynamic threshold select and pseudo-labels class weighting to enhance the constraint ability of the model’s unsupervised branch. In order to verify the effectiveness of the method proposed in this paper, we conducted extensive experiments on datasets such as DOTA, DIOR, WHU, and Levir. On the DOTA and DIOR datasets, the proposed method achieves a 3.09 % improvement in mAP50 compared to the best baseline method. On the WHU dataset, it shows a 2.30 % improvement over the best method, and on the Levir and SSDD datasets, the proposed method outperforms the best method by 2.13 %.</div></div>\",\"PeriodicalId\":50269,\"journal\":{\"name\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"volume\":\"228 \",\"pages\":\"Pages 566-581\"},\"PeriodicalIF\":12.2000,\"publicationDate\":\"2025-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0924271625003077\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003077","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
Improved multi-source remote sensing object detection network by geometry consistency constraint and Gaussian distribution alignment
Multi-source remote sensing object detection, by combining data from different sensors, can comprehensively improve the accuracy and robustness of object detection. However, it faces challenges such as data inconsistency, domain shift, and scarcity of labeled data. Domain adaptation methods can address these challenges by aligning features between the source and target domains, reducing domain shift, and enhancing the model’s generalization ability, thus solving the discrepancies in multi-source data. However, existing domain adaptation object detection methods insufficiently utilize shallow geometric features that are important for geometric consistency, and traditional methods that use adversarial networks for feature alignment often leading to insufficient alignment capability and training instability. To address the insufficient utilization of geometric information in existing methods and considering that shallow features contain abundant geometric information (e.g., points, lines, and surfaces), this paper proposes a shallow feature alignment method based on geometric consistency (GCFA), using shallow features as alignment cues. This method achieves effective feature alignment through partition calculation and weighted loss processing. Furthermore, to tackle the problems of insufficient alignment capability and training instability in the network, we introduce a feature alignment method based on Gaussian distribution (GDFA). This method directly aligns the feature distributions of the source and target domains by leveraging the mean and standard deviation, thereby enhancing the alignment capability of the network. And we can update the network directly through the loss function, without the need for adversarial networks or gradient reversal layers, thus avoiding potential training instability issues. In addition, we design a pseudo-labels refinement module (PLRM) that combines dynamic threshold select and pseudo-labels class weighting to enhance the constraint ability of the model’s unsupervised branch. In order to verify the effectiveness of the method proposed in this paper, we conducted extensive experiments on datasets such as DOTA, DIOR, WHU, and Levir. On the DOTA and DIOR datasets, the proposed method achieves a 3.09 % improvement in mAP50 compared to the best baseline method. On the WHU dataset, it shows a 2.30 % improvement over the best method, and on the Levir and SSDD datasets, the proposed method outperforms the best method by 2.13 %.
期刊介绍:
The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive.
P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields.
In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.