Wubiao Huang;Fei Deng;Haibing Liu;Mingtao Ding;Qi Yao
{"title":"基于边缘优化的遥感图像多尺度语义分割","authors":"Wubiao Huang;Fei Deng;Haibing Liu;Mingtao Ding;Qi Yao","doi":"10.1109/TGRS.2025.3553524","DOIUrl":null,"url":null,"abstract":"Semantic segmentation of remote sensing images is crucial for disaster monitoring, urban planning, and land use. Due to scene complexity and multiscale features of targets, semantic segmentation of remote sensing images has become a challenging task. Deep convolutional neural networks capture remote contextual dependencies that are limited. Meanwhile, restoring the image size quickly leads to undersampling at object edges, resulting in poor boundary prediction. Therefore, this article proposes a multiscale semantic segmentation network of remote sensing images based on edge optimization, namely, multiscale edge optimization network (MSEONet). The decoder of the network consists of a multiscale context aggregation (MSCA) module, a coarse edge extraction (CEE) module, and an edge point feature optimization (EPFO) module. The MSCA module is used to capture multiscale contextual information and global dependencies between pixels. The CEE module is used for boundary extraction of multiclass coarse segmentation results. The EPFO module is used to optimize edge point features during the upsampling process. We conducted extensive experiments on the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam 2-D dataset, the ISPRS Vaihingen 2-D dataset, and the FLAIR #1 dataset. The results show the effectiveness and superiority of our proposed MSEONet model compared to most of the state-of-the-art models. The CEE and EPFO modules can enhance the edge segmentation effect without increasing the computational and parametric quantities too much. The code is publicly available at <uri>https://github.com/HuangWBill/MSEONet</uri>.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-13"},"PeriodicalIF":8.6000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiscale Semantic Segmentation of Remote Sensing Images Based on Edge Optimization\",\"authors\":\"Wubiao Huang;Fei Deng;Haibing Liu;Mingtao Ding;Qi Yao\",\"doi\":\"10.1109/TGRS.2025.3553524\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic segmentation of remote sensing images is crucial for disaster monitoring, urban planning, and land use. Due to scene complexity and multiscale features of targets, semantic segmentation of remote sensing images has become a challenging task. Deep convolutional neural networks capture remote contextual dependencies that are limited. Meanwhile, restoring the image size quickly leads to undersampling at object edges, resulting in poor boundary prediction. Therefore, this article proposes a multiscale semantic segmentation network of remote sensing images based on edge optimization, namely, multiscale edge optimization network (MSEONet). The decoder of the network consists of a multiscale context aggregation (MSCA) module, a coarse edge extraction (CEE) module, and an edge point feature optimization (EPFO) module. The MSCA module is used to capture multiscale contextual information and global dependencies between pixels. The CEE module is used for boundary extraction of multiclass coarse segmentation results. The EPFO module is used to optimize edge point features during the upsampling process. We conducted extensive experiments on the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam 2-D dataset, the ISPRS Vaihingen 2-D dataset, and the FLAIR #1 dataset. The results show the effectiveness and superiority of our proposed MSEONet model compared to most of the state-of-the-art models. The CEE and EPFO modules can enhance the edge segmentation effect without increasing the computational and parametric quantities too much. The code is publicly available at <uri>https://github.com/HuangWBill/MSEONet</uri>.\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"63 \",\"pages\":\"1-13\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10937181/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10937181/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multiscale Semantic Segmentation of Remote Sensing Images Based on Edge Optimization
Semantic segmentation of remote sensing images is crucial for disaster monitoring, urban planning, and land use. Due to scene complexity and multiscale features of targets, semantic segmentation of remote sensing images has become a challenging task. Deep convolutional neural networks capture remote contextual dependencies that are limited. Meanwhile, restoring the image size quickly leads to undersampling at object edges, resulting in poor boundary prediction. Therefore, this article proposes a multiscale semantic segmentation network of remote sensing images based on edge optimization, namely, multiscale edge optimization network (MSEONet). The decoder of the network consists of a multiscale context aggregation (MSCA) module, a coarse edge extraction (CEE) module, and an edge point feature optimization (EPFO) module. The MSCA module is used to capture multiscale contextual information and global dependencies between pixels. The CEE module is used for boundary extraction of multiclass coarse segmentation results. The EPFO module is used to optimize edge point features during the upsampling process. We conducted extensive experiments on the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam 2-D dataset, the ISPRS Vaihingen 2-D dataset, and the FLAIR #1 dataset. The results show the effectiveness and superiority of our proposed MSEONet model compared to most of the state-of-the-art models. The CEE and EPFO modules can enhance the edge segmentation effect without increasing the computational and parametric quantities too much. The code is publicly available at https://github.com/HuangWBill/MSEONet.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.