基于门控全局-局部线性关注的遥感图像道路提取方法

IF 4.4

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society Pub Date : 2025-08-22 DOI:10.1109/LGRS.2025.3601585

Zhilin Qu;Mingzhe Li;Chenggong Wang;Zehua Chen

{"title":"基于门控全局-局部线性关注的遥感图像道路提取方法","authors":"Zhilin Qu;Mingzhe Li;Chenggong Wang;Zehua Chen","doi":"10.1109/LGRS.2025.3601585","DOIUrl":null,"url":null,"abstract":"Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>L<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at <uri>https://github.com/ZehuaChenLab</uri>","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"G2L2Net: A Road Extraction Method for Remote Sensing Images via Gated Global–Local Linear Attention\",\"authors\":\"Zhilin Qu;Mingzhe Li;Chenggong Wang;Zehua Chen\",\"doi\":\"10.1109/LGRS.2025.3601585\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>L<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at <uri>https://github.com/ZehuaChenLab</uri>\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11134406/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11134406/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

从遥感影像中提取道路在广泛的地理空间和城市应用中起着关键作用。然而，由于道路复杂的形态变化和复杂背景环境引起的频繁遮挡或干扰，这项任务仍然具有固有的挑战性。为了解决这些挑战，我们提出了一个基于门控全局-局部线性关注（G $^2$ L $^2$ attention）的道路提取网络。首先，我们引入了一个线性可变形卷积，并设计了一个线性输入相关的可变形卷积（LID2Conv），该卷积以内容感知的方式自适应调节卷积偏移量和权重。此外，我们设计了一个基于顶部的稀疏门控权（TGW）。我们使用这种门控机制作为共享权值与局部和全局信息相乘来实现G2L2Attention。局部信息由LID2Conv获取，全局信息由二维选择性扫描（SS2D）获取。通过提出的G2L2Attention将这两条路径整合在一起，实现了分层空间特征的高效一致融合。提取的特征被传递给解码器。这种方法改进了道路细节表示，并提供了准确的上下文信息。在三个公共道路数据集上进行的实验表明，G2L2Net在各种评估指标上优于现有方法。我们的源代码可从https://github.com/ZehuaChenLab获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

G2L2Net: A Road Extraction Method for Remote Sensing Images via Gated Global–Local Linear Attention

Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G

$^2$

Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at https://github.com/ZehuaChenLab

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society

自引率

0.00%

发文量