Bo Liu, Haijia Wen, Mingrui Di, Mingyong Liao, Junhao Huang, Long Qian, Yongbo Chu
{"title":"基于多源数据的地震危险性评估模型稳健性研究","authors":"Bo Liu, Haijia Wen, Mingrui Di, Mingyong Liao, Junhao Huang, Long Qian, Yongbo Chu","doi":"10.1016/j.gr.2025.04.003","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional seismic hazard assessment models mainly focus on seismic conditions, often neglecting topography, geology, and environmental factors. The limited robustness and “black-box” nature of single machine learning algorithms pose challenges in reliably assessing seismic hazard. This study leverages multi-source data and five hyperparameter optimization (HPO) algorithms to enhance the robustness of seismic hazard mapping (SHM) models in Jiuzhaigou County, China, by fine-tuning the traditional random forest (RF) model. The SHapley Additive exPlanations (SHAP) method was used to reveal decision-making mechanisms in seismic hazard models, supported by local interpretations and on-site investigations. This study considered 23 factors, including seismic factors, terrain morphology, geological conditions, and environmental conditions. We evaluated the performance of each hybrid model using metrics such as Coefficient of Determination (R<sup>2</sup> score) and various validation methods. The results show that the Random Forest-Tree-structured Parzen Estimators (RF-TPE) model achieves the highest R<sup>2</sup> score (0.9877). The RF-TPE outperforms other models in terms of robustness, making the SHM constructed with this model more reliable. The study also found that, in modeling SHM in the region, factors such as Peak Ground Velocity (PGV), hypocentral distance, Peak Spectral Acceleration (PSA), distance to fault, slope, and elevation are the most influential.</div></div>","PeriodicalId":12761,"journal":{"name":"Gondwana Research","volume":"144 ","pages":"Pages 87-108"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robustness study of seismic hazard assessment models based on multi-source data\",\"authors\":\"Bo Liu, Haijia Wen, Mingrui Di, Mingyong Liao, Junhao Huang, Long Qian, Yongbo Chu\",\"doi\":\"10.1016/j.gr.2025.04.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traditional seismic hazard assessment models mainly focus on seismic conditions, often neglecting topography, geology, and environmental factors. The limited robustness and “black-box” nature of single machine learning algorithms pose challenges in reliably assessing seismic hazard. This study leverages multi-source data and five hyperparameter optimization (HPO) algorithms to enhance the robustness of seismic hazard mapping (SHM) models in Jiuzhaigou County, China, by fine-tuning the traditional random forest (RF) model. The SHapley Additive exPlanations (SHAP) method was used to reveal decision-making mechanisms in seismic hazard models, supported by local interpretations and on-site investigations. This study considered 23 factors, including seismic factors, terrain morphology, geological conditions, and environmental conditions. We evaluated the performance of each hybrid model using metrics such as Coefficient of Determination (R<sup>2</sup> score) and various validation methods. The results show that the Random Forest-Tree-structured Parzen Estimators (RF-TPE) model achieves the highest R<sup>2</sup> score (0.9877). The RF-TPE outperforms other models in terms of robustness, making the SHM constructed with this model more reliable. The study also found that, in modeling SHM in the region, factors such as Peak Ground Velocity (PGV), hypocentral distance, Peak Spectral Acceleration (PSA), distance to fault, slope, and elevation are the most influential.</div></div>\",\"PeriodicalId\":12761,\"journal\":{\"name\":\"Gondwana Research\",\"volume\":\"144 \",\"pages\":\"Pages 87-108\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Gondwana Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1342937X25001157\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gondwana Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1342937X25001157","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
Robustness study of seismic hazard assessment models based on multi-source data
Traditional seismic hazard assessment models mainly focus on seismic conditions, often neglecting topography, geology, and environmental factors. The limited robustness and “black-box” nature of single machine learning algorithms pose challenges in reliably assessing seismic hazard. This study leverages multi-source data and five hyperparameter optimization (HPO) algorithms to enhance the robustness of seismic hazard mapping (SHM) models in Jiuzhaigou County, China, by fine-tuning the traditional random forest (RF) model. The SHapley Additive exPlanations (SHAP) method was used to reveal decision-making mechanisms in seismic hazard models, supported by local interpretations and on-site investigations. This study considered 23 factors, including seismic factors, terrain morphology, geological conditions, and environmental conditions. We evaluated the performance of each hybrid model using metrics such as Coefficient of Determination (R2 score) and various validation methods. The results show that the Random Forest-Tree-structured Parzen Estimators (RF-TPE) model achieves the highest R2 score (0.9877). The RF-TPE outperforms other models in terms of robustness, making the SHM constructed with this model more reliable. The study also found that, in modeling SHM in the region, factors such as Peak Ground Velocity (PGV), hypocentral distance, Peak Spectral Acceleration (PSA), distance to fault, slope, and elevation are the most influential.
期刊介绍:
Gondwana Research (GR) is an International Journal aimed to promote high quality research publications on all topics related to solid Earth, particularly with reference to the origin and evolution of continents, continental assemblies and their resources. GR is an "all earth science" journal with no restrictions on geological time, terrane or theme and covers a wide spectrum of topics in geosciences such as geology, geomorphology, palaeontology, structure, petrology, geochemistry, stable isotopes, geochronology, economic geology, exploration geology, engineering geology, geophysics, and environmental geology among other themes, and provides an appropriate forum to integrate studies from different disciplines and different terrains. In addition to regular articles and thematic issues, the journal invites high profile state-of-the-art reviews on thrust area topics for its column, ''GR FOCUS''. Focus articles include short biographies and photographs of the authors. Short articles (within ten printed pages) for rapid publication reporting important discoveries or innovative models of global interest will be considered under the category ''GR LETTERS''.