{"title":"Machine learning for safety distances prediction during emergency response of toxic dispersion accidental scenarios","authors":"Artemis Papadaki, Alba Àgueda, Eulàlia Planas","doi":"10.1016/j.jlp.2025.105604","DOIUrl":null,"url":null,"abstract":"<div><div>The present work investigates the prediction of safety distances (SDs) for toxic dispersion scenarios using machine learning (ML) models and examines an application to emergency response cases. PHAST software was used to generate multiple dispersion scenarios over a wide range of ambient conditions, leak and catastrophic rupture loss of containment focusing on nine toxic substances in varying storage conditions. The database underwent exploratory analysis and dependency testing to thoroughly analyze its content and the interdependencies among the input features. A correlation matrix facilitated feature reduction by identifying redundant features which were disregarded. The nine input features used were molecular weight (Mw), mass, storage phase, storage temperature, air temperature, stability class, wind, ground roughness and hole size. Following, four ML models, namely Random Forest, AdaBoost, HistGradientBoosting and LightGBM, were trained and optimized on the database. Their performance was compared across three sets (validation, test and extrapolation) using two common statistical metrics (RMSE and R<sup>2</sup>) and a custom metric (CS) designed to address the specific characteristics of SDs. Among the four models, LightGBM showed the best performance with an RMSE of 20.00 m and a CS of 3.65 m in the test set. Sensitivity analysis techniques (i.e., permutation importance, Sobol S1 and DMIM indices) were used to rank the importance of the features of the LightGBM model. This model was then integrated into a logic diagram capable of handling cases where some input features are unknown. Seven hypothetical cases were defined to demonstrate the practicability of such tool for emergency response.</div></div>","PeriodicalId":16291,"journal":{"name":"Journal of Loss Prevention in The Process Industries","volume":"95 ","pages":"Article 105604"},"PeriodicalIF":3.6000,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Loss Prevention in The Process Industries","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950423025000622","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The present work investigates the prediction of safety distances (SDs) for toxic dispersion scenarios using machine learning (ML) models and examines an application to emergency response cases. PHAST software was used to generate multiple dispersion scenarios over a wide range of ambient conditions, leak and catastrophic rupture loss of containment focusing on nine toxic substances in varying storage conditions. The database underwent exploratory analysis and dependency testing to thoroughly analyze its content and the interdependencies among the input features. A correlation matrix facilitated feature reduction by identifying redundant features which were disregarded. The nine input features used were molecular weight (Mw), mass, storage phase, storage temperature, air temperature, stability class, wind, ground roughness and hole size. Following, four ML models, namely Random Forest, AdaBoost, HistGradientBoosting and LightGBM, were trained and optimized on the database. Their performance was compared across three sets (validation, test and extrapolation) using two common statistical metrics (RMSE and R2) and a custom metric (CS) designed to address the specific characteristics of SDs. Among the four models, LightGBM showed the best performance with an RMSE of 20.00 m and a CS of 3.65 m in the test set. Sensitivity analysis techniques (i.e., permutation importance, Sobol S1 and DMIM indices) were used to rank the importance of the features of the LightGBM model. This model was then integrated into a logic diagram capable of handling cases where some input features are unknown. Seven hypothetical cases were defined to demonstrate the practicability of such tool for emergency response.
期刊介绍:
The broad scope of the journal is process safety. Process safety is defined as the prevention and mitigation of process-related injuries and damage arising from process incidents involving fire, explosion and toxic release. Such undesired events occur in the process industries during the use, storage, manufacture, handling, and transportation of highly hazardous chemicals.