Saied Pirasteh;Muhammad Yasir;Hong Fan;Fernando J. Aguilar;Md Sakaouth Hossain;Huxiong Li
{"title":"Enhanced Landslide Detection Using a Swin Transformer With Multiscale Feature Fusion and Local Information Aggregation Modules","authors":"Saied Pirasteh;Muhammad Yasir;Hong Fan;Fernando J. Aguilar;Md Sakaouth Hossain;Huxiong Li","doi":"10.1109/LGRS.2025.3560990","DOIUrl":null,"url":null,"abstract":"In recent years, detecting and monitoring landslides have become increasingly critical for disaster management and mitigation efforts. Here, we propose a model for landslide detection utilizing a combination of the Swin Transformer architecture with multiscale feature fusion lateral connection and local information aggregation modules (LIAMs). The Swin Transformer, known for its effectiveness in image understanding tasks, serves as the backbone of our detection system. By leveraging its hierarchical self-attention mechanism, the Swin Transformer can effectively capture both local and global contextual information from input images, facilitating accurate feature representation. To increase the performance of the Swin Transformer specifically for landslide detection, we introduce two additional modules: the multiscale feature fusion lateral connection module (MFFLCM) and the LIAM. The former module enables the integration of features across multiple scales, allowing the model to capture both fine-grained details and broader contextual information relevant to landslide characteristics. Meanwhile, the latter module focuses on aggregating local information within regions of interest, further refining the model’s ability to discriminate between landslide and non-landslide areas. Through extensive test and evaluation of benchmark datasets, our proposed method demonstrates promising results in detecting landslides with high mIoU, <inline-formula> <tex-math>$F1$ </tex-math></inline-formula> score, kappa, precision, and recall 84.2%, 90.7%, 82.6%, 89.9%, and 91.9%, respectively. Moreover, its robustness to variations in terrain and environmental conditions suggests its potential for real-world applications in landslide monitoring and early warning systems. Overall, our study highlights the effectiveness of integrating advanced transformer architectures with tailored modules for addressing complex geospatial challenges like landslide detection.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10973097/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, detecting and monitoring landslides have become increasingly critical for disaster management and mitigation efforts. Here, we propose a model for landslide detection utilizing a combination of the Swin Transformer architecture with multiscale feature fusion lateral connection and local information aggregation modules (LIAMs). The Swin Transformer, known for its effectiveness in image understanding tasks, serves as the backbone of our detection system. By leveraging its hierarchical self-attention mechanism, the Swin Transformer can effectively capture both local and global contextual information from input images, facilitating accurate feature representation. To increase the performance of the Swin Transformer specifically for landslide detection, we introduce two additional modules: the multiscale feature fusion lateral connection module (MFFLCM) and the LIAM. The former module enables the integration of features across multiple scales, allowing the model to capture both fine-grained details and broader contextual information relevant to landslide characteristics. Meanwhile, the latter module focuses on aggregating local information within regions of interest, further refining the model’s ability to discriminate between landslide and non-landslide areas. Through extensive test and evaluation of benchmark datasets, our proposed method demonstrates promising results in detecting landslides with high mIoU, $F1$ score, kappa, precision, and recall 84.2%, 90.7%, 82.6%, 89.9%, and 91.9%, respectively. Moreover, its robustness to variations in terrain and environmental conditions suggests its potential for real-world applications in landslide monitoring and early warning systems. Overall, our study highlights the effectiveness of integrating advanced transformer architectures with tailored modules for addressing complex geospatial challenges like landslide detection.