A Transfer Learning Approach for Landslide Semantic Segmentation Based on Visual Foundation Model

IF 4.7 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Changhong Hou;Junchuan Yu;Daqing Ge;Liu Yang;Laidian Xi;Yunxuan Pang;Yi Wen
{"title":"A Transfer Learning Approach for Landslide Semantic Segmentation Based on Visual Foundation Model","authors":"Changhong Hou;Junchuan Yu;Daqing Ge;Liu Yang;Laidian Xi;Yunxuan Pang;Yi Wen","doi":"10.1109/JSTARS.2025.3559884","DOIUrl":null,"url":null,"abstract":"Landslides are one of the most destructive natural disasters in the world, threatening human life and safety. With excellent performance as a foundation model for image segmentation, the segment anything model (SAM) has provided a novel paradigm for semantic segmentation research. However, the lack of remote sensing images in the SAM training data limits its ability to recognize landslides. In addition, despite the transfer learning approach can transfer SAM feature extraction capability to the landslide segmentation task, but it will consume a lot of computational resources and training time. In order to solve these challenges, this study proposes a TransLandSeg model that transfers the segmentation capability of SAM while learning landslide features at a low training cost. To limit model training parameters, the adaptive transfer learning (ATL) module is purposely designed, the image encoder is frozen during model training, only the ATL module and mask decoder are trained, and the knowledge learned from the ATL module is input into the original network. Moreover, to select the best ATL module, we also designed 9 kinds of ATL modules and analyzed the accuracy of the TransLandSeg model with different ATL modules. We selected the Bijie landslide dataset and the Landslide4Sense dataset for model training and testing. The experiment results show that the TransLandSeg model increases the mean intersection over union by 1.48% –13.01% compared to other state-of-the-art semantic segmentation models. In addition, TransLandSeg requires only 1.3% of SAM parameters to enable SAM's powerful capabilities to transfer to landslide segmentation.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"11561-11572"},"PeriodicalIF":4.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10962290","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10962290/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Landslides are one of the most destructive natural disasters in the world, threatening human life and safety. With excellent performance as a foundation model for image segmentation, the segment anything model (SAM) has provided a novel paradigm for semantic segmentation research. However, the lack of remote sensing images in the SAM training data limits its ability to recognize landslides. In addition, despite the transfer learning approach can transfer SAM feature extraction capability to the landslide segmentation task, but it will consume a lot of computational resources and training time. In order to solve these challenges, this study proposes a TransLandSeg model that transfers the segmentation capability of SAM while learning landslide features at a low training cost. To limit model training parameters, the adaptive transfer learning (ATL) module is purposely designed, the image encoder is frozen during model training, only the ATL module and mask decoder are trained, and the knowledge learned from the ATL module is input into the original network. Moreover, to select the best ATL module, we also designed 9 kinds of ATL modules and analyzed the accuracy of the TransLandSeg model with different ATL modules. We selected the Bijie landslide dataset and the Landslide4Sense dataset for model training and testing. The experiment results show that the TransLandSeg model increases the mean intersection over union by 1.48% –13.01% compared to other state-of-the-art semantic segmentation models. In addition, TransLandSeg requires only 1.3% of SAM parameters to enable SAM's powerful capabilities to transfer to landslide segmentation.
基于视觉基础模型的滑坡语义分割迁移学习方法
山体滑坡是世界上最具破坏性的自然灾害之一,威胁着人类的生命和安全。作为图像分割的基础模型,SAM具有优异的性能,为语义分割研究提供了一种新的范式。然而,地空导弹训练数据中缺少遥感图像,限制了其识别滑坡的能力。此外,迁移学习方法虽然可以将SAM特征提取能力转移到滑坡分割任务中,但会消耗大量的计算资源和训练时间。为了解决这些挑战,本研究提出了一种TransLandSeg模型,该模型在以较低的训练成本学习滑坡特征的同时转移了SAM的分割能力。为了限制模型训练参数,特意设计了自适应迁移学习(ATL)模块,在模型训练时冻结图像编码器,只训练ATL模块和掩码解码器,并将从ATL模块学习到的知识输入到原网络中。此外,为了选择最佳的ATL模块,我们还设计了9种ATL模块,并分析了不同ATL模块对TransLandSeg模型的精度。我们选择毕节滑坡数据集和Landslide4Sense数据集进行模型训练和测试。实验结果表明,TransLandSeg模型比其他最先进的语义分割模型提高了1.48% -13.01%的平均交集比联合。此外,TransLandSeg只需要1.3%的SAM参数,就可以将SAM的强大功能转化为滑坡分割。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.30
自引率
10.90%
发文量
563
审稿时长
4.7 months
期刊介绍: The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信