{"title":"基于门控全局-局部线性关注的遥感图像道路提取方法","authors":"Zhilin Qu;Mingzhe Li;Chenggong Wang;Zehua Chen","doi":"10.1109/LGRS.2025.3601585","DOIUrl":null,"url":null,"abstract":"Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>L<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at <uri>https://github.com/ZehuaChenLab</uri>","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"G2L2Net: A Road Extraction Method for Remote Sensing Images via Gated Global–Local Linear Attention\",\"authors\":\"Zhilin Qu;Mingzhe Li;Chenggong Wang;Zehua Chen\",\"doi\":\"10.1109/LGRS.2025.3601585\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>L<inline-formula> <tex-math>$^2$ </tex-math></inline-formula>Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at <uri>https://github.com/ZehuaChenLab</uri>\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11134406/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11134406/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
从遥感影像中提取道路在广泛的地理空间和城市应用中起着关键作用。然而,由于道路复杂的形态变化和复杂背景环境引起的频繁遮挡或干扰,这项任务仍然具有固有的挑战性。为了解决这些挑战,我们提出了一个基于门控全局-局部线性关注(G $^2$ L $^2$ attention)的道路提取网络。首先,我们引入了一个线性可变形卷积,并设计了一个线性输入相关的可变形卷积(LID2Conv),该卷积以内容感知的方式自适应调节卷积偏移量和权重。此外,我们设计了一个基于顶部的稀疏门控权(TGW)。我们使用这种门控机制作为共享权值与局部和全局信息相乘来实现G2L2Attention。局部信息由LID2Conv获取,全局信息由二维选择性扫描(SS2D)获取。通过提出的G2L2Attention将这两条路径整合在一起,实现了分层空间特征的高效一致融合。提取的特征被传递给解码器。这种方法改进了道路细节表示,并提供了准确的上下文信息。在三个公共道路数据集上进行的实验表明,G2L2Net在各种评估指标上优于现有方法。我们的源代码可从https://github.com/ZehuaChenLab获得
G2L2Net: A Road Extraction Method for Remote Sensing Images via Gated Global–Local Linear Attention
Road extraction from remote sensing imagery plays a pivotal role in a wide range of geospatial and urban applications. Nevertheless, this task remains inherently challenging due to the intricate morphological variations of roads and frequent occlusions or interference caused by complex background environments. To address these challenges, we propose a road extraction network based on gated global–local linear attention (G$^2$ L$^2$ Attention). First, we introduce a linear deformable convolution and design a linear input-dependent deformable convolution (LID2Conv), which adaptively modulates convolution offsets and weights in a content-aware manner. In addition, we design a top-K-based sparse gated weight (TGW). We use this gated mechanism as a shared weight to multiply with local and global information to achieve G2L2Attention. Local information is obtained by LID2Conv, and we gain global information by introducing 2-D selective scan (SS2D). These two pathways are integrated through the proposed G2L2Attention, enabling an efficient and consistent fusion of hierarchical spatial features. The extracted features are passed to the decoder. This approach improves road detail representation and provides accurate contextual information. Experiments conducted on three public road datasets demonstrate that G2L2Net outperforms the existing methods in various evaluation metrics. Our source code is available at https://github.com/ZehuaChenLab