Zhilu Chang , Shui-Hua Jiang , Faming Huang , Lei Shi , Jinsong Huang , Jianhong Wan , Filippo Catani
{"title":"基于空间异质性和机器学习模型的坡地覆盖层厚度预测","authors":"Zhilu Chang , Shui-Hua Jiang , Faming Huang , Lei Shi , Jinsong Huang , Jianhong Wan , Filippo Catani","doi":"10.1016/j.gsf.2025.102109","DOIUrl":null,"url":null,"abstract":"<div><div>The spatial distribution of overburden layer thickness (OLT) is crucial for landslide susceptibility prediction and slope stability analysis. Due to OLT spatial heterogeneity in hillslope regions, combined with the difficulty and time consumption of OLT sample collection, accurately predicting OLT distribution remains a challenging. To address this, a novel framework has been developed. First, OLT samples are collected through field surveys, remote sensing, and geological drilling. Next, the heterogeneity of OLT’s spatial distribution is analyzed using the probability distribution of OLT samples and their horizontal and vertical distributions. The OLT samples are categorized and the small sample categories are expanded using the synthetic minority over-sampling technique (SMOTE). The slope position is selected as a key conditioning factor. Subsequently, 16 conditioning factors are applied to construct OLT prediction model using the random forest regression algorithm. Weights are assigned to each OLT sample category to balance the uneven distribution of sample sizes. Finally, the Pearson correlation coefficient, mean absolute error (MAE), root mean square error (RMSE), and Lin’s concordance correlation coefficient (Lin’s CCC) are employed to validate the OLT prediction results. The Huangtan town serves as the case study. Results show: (1) heterogeneity analysis, SMOTE-based OLT sample expansion strategy and slope position selection can significantly mitigate the effect of spatial heterogeneity on OLT prediction. (2) The Pearson correlation coefficient, RMSE, MAE and Lin’s CCC values are 0.84, 1.173, 1.378 and 0.804, respectively, indicating excellent prediction performance. This research provides an effective solution for predicting OLT distribution in hillslope regions.</div></div>","PeriodicalId":12711,"journal":{"name":"Geoscience frontiers","volume":"16 5","pages":"Article 102109"},"PeriodicalIF":8.5000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of overburden layer thickness based on spatial heterogeneity analysis and machine learning models in hillslope regions\",\"authors\":\"Zhilu Chang , Shui-Hua Jiang , Faming Huang , Lei Shi , Jinsong Huang , Jianhong Wan , Filippo Catani\",\"doi\":\"10.1016/j.gsf.2025.102109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The spatial distribution of overburden layer thickness (OLT) is crucial for landslide susceptibility prediction and slope stability analysis. Due to OLT spatial heterogeneity in hillslope regions, combined with the difficulty and time consumption of OLT sample collection, accurately predicting OLT distribution remains a challenging. To address this, a novel framework has been developed. First, OLT samples are collected through field surveys, remote sensing, and geological drilling. Next, the heterogeneity of OLT’s spatial distribution is analyzed using the probability distribution of OLT samples and their horizontal and vertical distributions. The OLT samples are categorized and the small sample categories are expanded using the synthetic minority over-sampling technique (SMOTE). The slope position is selected as a key conditioning factor. Subsequently, 16 conditioning factors are applied to construct OLT prediction model using the random forest regression algorithm. Weights are assigned to each OLT sample category to balance the uneven distribution of sample sizes. Finally, the Pearson correlation coefficient, mean absolute error (MAE), root mean square error (RMSE), and Lin’s concordance correlation coefficient (Lin’s CCC) are employed to validate the OLT prediction results. The Huangtan town serves as the case study. Results show: (1) heterogeneity analysis, SMOTE-based OLT sample expansion strategy and slope position selection can significantly mitigate the effect of spatial heterogeneity on OLT prediction. (2) The Pearson correlation coefficient, RMSE, MAE and Lin’s CCC values are 0.84, 1.173, 1.378 and 0.804, respectively, indicating excellent prediction performance. This research provides an effective solution for predicting OLT distribution in hillslope regions.</div></div>\",\"PeriodicalId\":12711,\"journal\":{\"name\":\"Geoscience frontiers\",\"volume\":\"16 5\",\"pages\":\"Article 102109\"},\"PeriodicalIF\":8.5000,\"publicationDate\":\"2025-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoscience frontiers\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1674987125001148\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience frontiers","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1674987125001148","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
Prediction of overburden layer thickness based on spatial heterogeneity analysis and machine learning models in hillslope regions
The spatial distribution of overburden layer thickness (OLT) is crucial for landslide susceptibility prediction and slope stability analysis. Due to OLT spatial heterogeneity in hillslope regions, combined with the difficulty and time consumption of OLT sample collection, accurately predicting OLT distribution remains a challenging. To address this, a novel framework has been developed. First, OLT samples are collected through field surveys, remote sensing, and geological drilling. Next, the heterogeneity of OLT’s spatial distribution is analyzed using the probability distribution of OLT samples and their horizontal and vertical distributions. The OLT samples are categorized and the small sample categories are expanded using the synthetic minority over-sampling technique (SMOTE). The slope position is selected as a key conditioning factor. Subsequently, 16 conditioning factors are applied to construct OLT prediction model using the random forest regression algorithm. Weights are assigned to each OLT sample category to balance the uneven distribution of sample sizes. Finally, the Pearson correlation coefficient, mean absolute error (MAE), root mean square error (RMSE), and Lin’s concordance correlation coefficient (Lin’s CCC) are employed to validate the OLT prediction results. The Huangtan town serves as the case study. Results show: (1) heterogeneity analysis, SMOTE-based OLT sample expansion strategy and slope position selection can significantly mitigate the effect of spatial heterogeneity on OLT prediction. (2) The Pearson correlation coefficient, RMSE, MAE and Lin’s CCC values are 0.84, 1.173, 1.378 and 0.804, respectively, indicating excellent prediction performance. This research provides an effective solution for predicting OLT distribution in hillslope regions.
Geoscience frontiersEarth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
17.80
自引率
3.40%
发文量
147
审稿时长
35 days
期刊介绍:
Geoscience Frontiers (GSF) is the Journal of China University of Geosciences (Beijing) and Peking University. It publishes peer-reviewed research articles and reviews in interdisciplinary fields of Earth and Planetary Sciences. GSF covers various research areas including petrology and geochemistry, lithospheric architecture and mantle dynamics, global tectonics, economic geology and fuel exploration, geophysics, stratigraphy and paleontology, environmental and engineering geology, astrogeology, and the nexus of resources-energy-emissions-climate under Sustainable Development Goals. The journal aims to bridge innovative, provocative, and challenging concepts and models in these fields, providing insights on correlations and evolution.