Cross-Temporal Remote Sensing Image Change Captioning: A Manifold Mapping and Bayesian Diffusion Approach for Land Use Monitoring

IF 4.7 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Qingshan Bai;Xiaohua Wang
{"title":"Cross-Temporal Remote Sensing Image Change Captioning: A Manifold Mapping and Bayesian Diffusion Approach for Land Use Monitoring","authors":"Qingshan Bai;Xiaohua Wang","doi":"10.1109/JSTARS.2025.3575807","DOIUrl":null,"url":null,"abstract":"This study proposes a cross-temporal remote sensing image change captioning (RSICC) model named CTM, which is constructed based on manifold mapping and Bayesian diffusion techniques. The primary objective of CTM is to enhance the accuracy and robustness of captioning changes in multitemporal remote sensing images (RSIs). The model first employs manifold mapping to model illumination variations, reducing the impact of seasonal and lighting factors on image consistency. Subsequently, Bayesian diffusion is introduced to improve the modeling capability of cross-temporal image changes, enhancing robustness against noise and pseudo-changes. In addition, a dual-layer multicoding module is adopted to strengthen temporal feature representation, improving the perception of change regions. Finally, a difference enhancement and dual-attention based image-text captioning strategy is proposed to optimize feature selection and enhance the accuracy and detail of textual descriptions. Experimental results demonstrate that CTM exhibits greater robustness in handling long-span RSIs, effectively mitigating pseudo-changes caused by illumination and seasonal variations. On the LEVIR-CC dataset, CTM achieves a CIDEr score of 138.78, outperforming the best existing method by 7.38 points. On the WHU-CDC dataset, CTM achieves the highest performance in BLEU and METEOR metrics, with a CIDEr score of 153.29, showcasing its outstanding performance in RSICC tasks. Furthermore, visual analysis indicates that CTM accurately captures real change regions while significantly suppressing pseudo-changes, maintaining high descriptive accuracy even in complex environments. This study provides an efficient and precise solution for applications such as land use monitoring, environmental monitoring, and disaster response.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"14406-14415"},"PeriodicalIF":4.7000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11021286","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11021286/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a cross-temporal remote sensing image change captioning (RSICC) model named CTM, which is constructed based on manifold mapping and Bayesian diffusion techniques. The primary objective of CTM is to enhance the accuracy and robustness of captioning changes in multitemporal remote sensing images (RSIs). The model first employs manifold mapping to model illumination variations, reducing the impact of seasonal and lighting factors on image consistency. Subsequently, Bayesian diffusion is introduced to improve the modeling capability of cross-temporal image changes, enhancing robustness against noise and pseudo-changes. In addition, a dual-layer multicoding module is adopted to strengthen temporal feature representation, improving the perception of change regions. Finally, a difference enhancement and dual-attention based image-text captioning strategy is proposed to optimize feature selection and enhance the accuracy and detail of textual descriptions. Experimental results demonstrate that CTM exhibits greater robustness in handling long-span RSIs, effectively mitigating pseudo-changes caused by illumination and seasonal variations. On the LEVIR-CC dataset, CTM achieves a CIDEr score of 138.78, outperforming the best existing method by 7.38 points. On the WHU-CDC dataset, CTM achieves the highest performance in BLEU and METEOR metrics, with a CIDEr score of 153.29, showcasing its outstanding performance in RSICC tasks. Furthermore, visual analysis indicates that CTM accurately captures real change regions while significantly suppressing pseudo-changes, maintaining high descriptive accuracy even in complex environments. This study provides an efficient and precise solution for applications such as land use monitoring, environmental monitoring, and disaster response.
土地利用监测的流形映射与贝叶斯扩散方法
本文提出了一种基于流形映射和贝叶斯扩散技术构建的遥感影像变化字幕(RSICC)模型CTM。CTM的主要目标是提高多时相遥感图像标题变化的准确性和鲁棒性。该模型首先采用流形映射来模拟光照变化,减少了季节和光照因素对图像一致性的影响。随后,引入贝叶斯扩散来提高跨时间图像变化的建模能力,增强对噪声和伪变化的鲁棒性。此外,采用双层多编码模块加强时序特征表示,提高对变化区域的感知。最后,提出了一种基于差分增强和双关注的图像-文本字幕策略,以优化特征选择,提高文本描述的准确性和细节性。实验结果表明,CTM在处理大跨度rsi时表现出更强的鲁棒性,有效地缓解了光照和季节变化引起的伪变化。在LEVIR-CC数据集上,CTM的CIDEr得分为138.78,比现有的最佳方法高出7.38分。在WHU-CDC数据集上,CTM在BLEU和METEOR指标上的表现最高,CIDEr得分为153.29,在RSICC任务上表现出色。此外,视觉分析表明CTM能准确捕获真实变化区域,同时显著抑制伪变化,即使在复杂环境中也能保持较高的描述精度。该研究为土地利用监测、环境监测和灾害响应等应用提供了高效、精确的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.30
自引率
10.90%
发文量
563
审稿时长
4.7 months
期刊介绍: The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信