Mujungu L. Museru , Rouzbeh Nazari , Mohammad Reza Nikoo , Maryam Karimi
{"title":"Improving disaster resilience with causal machine learning for flood damage estimation","authors":"Mujungu L. Museru , Rouzbeh Nazari , Mohammad Reza Nikoo , Maryam Karimi","doi":"10.1016/j.scitotenv.2025.180121","DOIUrl":null,"url":null,"abstract":"<div><div>Machine Learning (ML) models have become a pivotal tool in the scientific community, successfully addressing complex problems across various domains, including flood risk management. Despite these advancements, traditional data-driven models often struggle when training data is scarce and primarily rely on correlation rather than causal relationships, making them vulnerable to shifts in data distribution. This paper introduces a Causally Informed Neural Network (CINN) framework that integrates causal prior knowledge as an inductive bias to improve flood damage predictions for residential properties and address these limitations. The proposed approach enhances model adaptability to unseen data distributions—an essential requirement for flood damage modeling. First, Deep End-to-End Causal Inference (DECI) is used to discover causal relationships and estimate their average treatment effects. These causal insights are then embedded into the neural network through causal weight initialization and causal regularization. The framework is validated using an enhanced National Flood Insurance Program (NFIP) claims dataset from Hurricane Katrina, and its performance is benchmarked against six widely used ML models from previous studies. Results show that the discovered causal relationships align with domain knowledge, reinforcing the approach's credibility. The proposed CINN model achieves an average 22 % error reduction compared to traditional ML models, demonstrating its superior robustness and predictive accuracy. Additionally, a feature attribution experiment confirms that the model's decision-making process is consistent with the identified causal relationships, increasing interpretability and trust in its predictions. These findings highlight the potential of integrating causality into ML-based flood damage modeling, paving the way for more resilient and generalizable disaster risk assessment frameworks.</div></div>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":"995 ","pages":"Article 180121"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0048969725017619","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Machine Learning (ML) models have become a pivotal tool in the scientific community, successfully addressing complex problems across various domains, including flood risk management. Despite these advancements, traditional data-driven models often struggle when training data is scarce and primarily rely on correlation rather than causal relationships, making them vulnerable to shifts in data distribution. This paper introduces a Causally Informed Neural Network (CINN) framework that integrates causal prior knowledge as an inductive bias to improve flood damage predictions for residential properties and address these limitations. The proposed approach enhances model adaptability to unseen data distributions—an essential requirement for flood damage modeling. First, Deep End-to-End Causal Inference (DECI) is used to discover causal relationships and estimate their average treatment effects. These causal insights are then embedded into the neural network through causal weight initialization and causal regularization. The framework is validated using an enhanced National Flood Insurance Program (NFIP) claims dataset from Hurricane Katrina, and its performance is benchmarked against six widely used ML models from previous studies. Results show that the discovered causal relationships align with domain knowledge, reinforcing the approach's credibility. The proposed CINN model achieves an average 22 % error reduction compared to traditional ML models, demonstrating its superior robustness and predictive accuracy. Additionally, a feature attribution experiment confirms that the model's decision-making process is consistent with the identified causal relationships, increasing interpretability and trust in its predictions. These findings highlight the potential of integrating causality into ML-based flood damage modeling, paving the way for more resilient and generalizable disaster risk assessment frameworks.
期刊介绍:
The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere.
The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.