机器学习模拟高分辨率淹没图

E. B. Storrøsten, Naveen Ragu Ramalingam, S. Lorito, M. Volpe, C. Sánchez-Linares, F. Løvholt, Steven J. Gibbons
{"title":"机器学习模拟高分辨率淹没图","authors":"E. B. Storrøsten, Naveen Ragu Ramalingam, S. Lorito, M. Volpe, C. Sánchez-Linares, F. Løvholt, Steven J. Gibbons","doi":"10.1093/gji/ggae151","DOIUrl":null,"url":null,"abstract":"\n Estimating coastal tsunami impact for early-warning or long-term hazard analysis requires the calculation of inundation metrics such as flow-depth or momentum flux. Both applications require the simulation of large numbers of scenarios to capture both the aleatory variability and the epistemic tsunami uncertainty. A computationally demanding step in simulating inundation is solving the nonlinear shallow water (NLSW) equations on meshes with sufficiently high resolution to represent the local elevation accurately enough to capture the physics governing the flow. This computational expense is particularly challenging in the context of Tsunami Early Warning where strict time constraints apply. A Machine Learning (ML) model that predicts inundation maps from offshore simulation results with acceptable accuracy, trained on an acceptably small training set of full simulations, could replace the computationally expensive NLSW part of the simulations for vast numbers of scenarios and predict inundation rapidly and with reduced computational demands. We consider the application of an encoder-decoder based neural network to predict high-resolution inundation maps based only on more cheaply calculated simulated time-series at a limited number of offshore locations. The network needs to be trained using input offshore time-series and the corresponding inundation maps from previously calculated full simulations. We develop and evaluate the ML model on a comprehensive set of inundation simulations for the coast of eastern Sicily for tens of thousands of subduction earthquake sources in the Mediterranean Sea. We find good performance for this case study even using relatively small training sets (order of hundreds) provided that appropriate choices are made in the specification of model parameters, the specification of the loss function, and the selection of training events. The uncertainty in the prediction for any given location decreases with the number of training events that inundate that location, with a good range of flow depths needed for accurate predictions. This means that care is needed to ensure that rarer high-inundation scenarios are well-represented in the training sets. The importance of applying regularization techniques increases as the size of the training sets decreases. The computational gain of the proposed methodology depends on the number of complete simulations needed to train the neural network, ranging between 164 and 4196 scenarios in this study. The cost of training the network is small in comparison with the cost of the numerical simulations and, for an ensemble of around 28000 scenarios, this represents a 6 to 170-fold reduction in computing costs.","PeriodicalId":502458,"journal":{"name":"Geophysical Journal International","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Emulation of High Resolution Inundation Maps\",\"authors\":\"E. B. Storrøsten, Naveen Ragu Ramalingam, S. Lorito, M. Volpe, C. Sánchez-Linares, F. Løvholt, Steven J. Gibbons\",\"doi\":\"10.1093/gji/ggae151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Estimating coastal tsunami impact for early-warning or long-term hazard analysis requires the calculation of inundation metrics such as flow-depth or momentum flux. Both applications require the simulation of large numbers of scenarios to capture both the aleatory variability and the epistemic tsunami uncertainty. A computationally demanding step in simulating inundation is solving the nonlinear shallow water (NLSW) equations on meshes with sufficiently high resolution to represent the local elevation accurately enough to capture the physics governing the flow. This computational expense is particularly challenging in the context of Tsunami Early Warning where strict time constraints apply. A Machine Learning (ML) model that predicts inundation maps from offshore simulation results with acceptable accuracy, trained on an acceptably small training set of full simulations, could replace the computationally expensive NLSW part of the simulations for vast numbers of scenarios and predict inundation rapidly and with reduced computational demands. We consider the application of an encoder-decoder based neural network to predict high-resolution inundation maps based only on more cheaply calculated simulated time-series at a limited number of offshore locations. The network needs to be trained using input offshore time-series and the corresponding inundation maps from previously calculated full simulations. We develop and evaluate the ML model on a comprehensive set of inundation simulations for the coast of eastern Sicily for tens of thousands of subduction earthquake sources in the Mediterranean Sea. We find good performance for this case study even using relatively small training sets (order of hundreds) provided that appropriate choices are made in the specification of model parameters, the specification of the loss function, and the selection of training events. The uncertainty in the prediction for any given location decreases with the number of training events that inundate that location, with a good range of flow depths needed for accurate predictions. This means that care is needed to ensure that rarer high-inundation scenarios are well-represented in the training sets. The importance of applying regularization techniques increases as the size of the training sets decreases. The computational gain of the proposed methodology depends on the number of complete simulations needed to train the neural network, ranging between 164 and 4196 scenarios in this study. The cost of training the network is small in comparison with the cost of the numerical simulations and, for an ensemble of around 28000 scenarios, this represents a 6 to 170-fold reduction in computing costs.\",\"PeriodicalId\":502458,\"journal\":{\"name\":\"Geophysical Journal International\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geophysical Journal International\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/gji/ggae151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geophysical Journal International","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gji/ggae151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

估算沿海海啸的影响,用于预警或长期灾害分析,需要计算淹没指标,如水流深度 或动量通量。这两种应用都需要模拟大量的情况,以捕捉已知的可变性和认识上的海啸不确定性。模拟淹没的一个计算要求很高的步骤是在分辨率足够高的网格上求解非线性浅水(NLSW)方程,以足够准确地表示局部海拔高度,从而捕捉到控制水流的物理规律。在海啸预警方面,时间非常紧迫,因此计算费用尤其具有挑战性。机器学习(ML)模型可以从离岸模拟结果中预测淹没地图,其准确度可以接受,并且是在可接受的小规模全模拟训练集上训练出来的,它可以取代计算成本高昂的 NLSW 部分模拟,适用于大量场景,并能快速预测淹没情况,同时降低计算需求。我们考虑应用基于编码器-解码器的神经网络来预测高分辨率的淹没图,该网络仅基于数量有限的近海地点的更廉价的模拟时间序列计算。该网络需要使用输入的离岸时间序列和先前计算的完整模拟的相应淹没图进行训练。我们针对地中海数万个俯冲地震震源,在西西里岛东部海岸的一整套淹没模拟中开发并评估了 ML 模型。我们发现,即使使用相对较小的训练集(数量级为数百),只要在模型参数的指定、损失函数的指定以及训练事件的选择方面做出适当的选择,该案例研究也能获得良好的性能。任何给定地点预测结果的不确定性都会随着淹没该地点的训练事件数量的增加而减小,准确预测所需的水流深度范围较大。这意味着需要注意确保在训练集中充分反映较罕见的高淹没情景。应用正则化技术的重要性随着训练集规模的减小而增加。建议方法的计算收益取决于训练神经网络所需的完整模拟次数,在本研究中,模拟次数从 164 到 4196 次不等。与数值模拟的成本相比,训练网络的成本很小,对于大约 28000 个场景的集合而言,这意味着计算成本减少了 6 到 170 倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Emulation of High Resolution Inundation Maps
Estimating coastal tsunami impact for early-warning or long-term hazard analysis requires the calculation of inundation metrics such as flow-depth or momentum flux. Both applications require the simulation of large numbers of scenarios to capture both the aleatory variability and the epistemic tsunami uncertainty. A computationally demanding step in simulating inundation is solving the nonlinear shallow water (NLSW) equations on meshes with sufficiently high resolution to represent the local elevation accurately enough to capture the physics governing the flow. This computational expense is particularly challenging in the context of Tsunami Early Warning where strict time constraints apply. A Machine Learning (ML) model that predicts inundation maps from offshore simulation results with acceptable accuracy, trained on an acceptably small training set of full simulations, could replace the computationally expensive NLSW part of the simulations for vast numbers of scenarios and predict inundation rapidly and with reduced computational demands. We consider the application of an encoder-decoder based neural network to predict high-resolution inundation maps based only on more cheaply calculated simulated time-series at a limited number of offshore locations. The network needs to be trained using input offshore time-series and the corresponding inundation maps from previously calculated full simulations. We develop and evaluate the ML model on a comprehensive set of inundation simulations for the coast of eastern Sicily for tens of thousands of subduction earthquake sources in the Mediterranean Sea. We find good performance for this case study even using relatively small training sets (order of hundreds) provided that appropriate choices are made in the specification of model parameters, the specification of the loss function, and the selection of training events. The uncertainty in the prediction for any given location decreases with the number of training events that inundate that location, with a good range of flow depths needed for accurate predictions. This means that care is needed to ensure that rarer high-inundation scenarios are well-represented in the training sets. The importance of applying regularization techniques increases as the size of the training sets decreases. The computational gain of the proposed methodology depends on the number of complete simulations needed to train the neural network, ranging between 164 and 4196 scenarios in this study. The cost of training the network is small in comparison with the cost of the numerical simulations and, for an ensemble of around 28000 scenarios, this represents a 6 to 170-fold reduction in computing costs.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信