CLFDA：用于细粒度土地覆盖分类的连续低频分解架构

IF 4.4

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society Pub Date : 2025-07-15 DOI:10.1109/LGRS.2025.3589242

Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo

{"title":"CLFDA：用于细粒度土地覆盖分类的连续低频分解架构","authors":"Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo","doi":"10.1109/LGRS.2025.3589242","DOIUrl":null,"url":null,"abstract":"Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at <uri>https://github.com/GeoRSAI/CLFDA</uri>","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CLFDA: Continuous Low-Frequency Decomposition Architecture for Fine-Grained Land Cover Classification\",\"authors\":\"Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo\",\"doi\":\"10.1109/LGRS.2025.3589242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at <uri>https://github.com/GeoRSAI/CLFDA</uri>\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11080475/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11080475/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于高分辨率遥感影像的细粒度土地覆盖分类在城市和环境监测中起着至关重要的作用。虽然现有的基于空间域的方法取得了显著的进展，但它们在复杂场景中的性能仍然受到特征建模不足的限制。本文提出了连续低频分解架构（CLFDA），以解决当前方法中对多尺度频率特性的跨域建模不足的问题。该架构通过连续低频分解引入频域特征，其中每个频率分解和增强（FDE）模块使用离散小波变换（DWT）将空间和低频特征分离为低频和高频子带。低频特征反馈到编码器以获取全局上下文，而高频特征通过注意机制路由到解码器以进行细节细化，从而实现双向空间频率融合。通过集成卷积神经网络（cnn）、视觉变压器和曼巴骨干网，CLFDA在id -15和FUSU数据集上分别实现了2.0%和3.46%的平均mIoU改进。这些跨异构骨干网的一致性能增益证明了CLFDA在建模频域特征方面的有效性和通用性。代码见https://github.com/GeoRSAI/CLFDA

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CLFDA: Continuous Low-Frequency Decomposition Architecture for Fine-Grained Land Cover Classification

Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at https://github.com/GeoRSAI/CLFDA

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society

自引率

0.00%

发文量