Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo
{"title":"CLFDA:用于细粒度土地覆盖分类的连续低频分解架构","authors":"Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo","doi":"10.1109/LGRS.2025.3589242","DOIUrl":null,"url":null,"abstract":"Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at <uri>https://github.com/GeoRSAI/CLFDA</uri>","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CLFDA: Continuous Low-Frequency Decomposition Architecture for Fine-Grained Land Cover Classification\",\"authors\":\"Dongyang Hou;Junwu Xiang;Li Lei;Wenmin Qiu;Mengdi Zhao;Yingjun Luo\",\"doi\":\"10.1109/LGRS.2025.3589242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at <uri>https://github.com/GeoRSAI/CLFDA</uri>\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11080475/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11080475/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CLFDA: Continuous Low-Frequency Decomposition Architecture for Fine-Grained Land Cover Classification
Fine-grained land cover classification from high-resolution remote sensing imagery plays a vital role in urban and environmental monitoring. While the existing spatial-domain-based approaches achieve notable progress, their performance in complex scenarios remains constrained by insufficient modeling of characteristics. This letter proposes the continuous low-frequency decomposition architecture (CLFDA) to address insufficient cross-domain modeling of multiscale frequency characteristics in current methods. The architecture introduces frequency-domain features through continuous low-frequency decomposition, where each frequency decomposition and enhancement (FDE) module employ discrete wavelet transform (DWT) to separate spatial and low-frequency features into low-frequency and high-frequency subbands. Low-frequency features feedback into the encoder for global context, while high-frequency features are routed to the decoder via attention mechanisms for detail refinement, enabling bidirectional spatial–frequency fusion. By integrating convolutional neural networks (CNNs), vision transformer, and mamba backbones, our CLFDA achieves 2.0% and 3.46% average mIoU improvements on the GID-15 and the FUSU datasets, respectively. These consistent performance gains across heterogeneous backbones demonstrate the effectiveness and generalizability of our CLFDA in modeling frequency-domain features. The code is at https://github.com/GeoRSAI/CLFDA