Pitfalls of XAI interpretation in environmental modeling: A warning on model bias in air quality data analysis

IF 4.6 2区环境科学与生态学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Environmental Modelling & Software Pub Date : 2025-09-19 DOI:10.1016/j.envsoft.2025.106700

Souichi Oka , Takuma Yamazaki , Yoshiyasu Takefuji

{"title":"Pitfalls of XAI interpretation in environmental modeling: A warning on model bias in air quality data analysis","authors":"Souichi Oka , Takuma Yamazaki , Yoshiyasu Takefuji","doi":"10.1016/j.envsoft.2025.106700","DOIUrl":null,"url":null,"abstract":"<div><div>Jung et al. (2025) achieved high predictive accuracy in interpolating missing ozone data using graph machine learning (ML) and conducted feature importance analysis with explainable AI (XAI). This correspondence acknowledges their significant contribution but discusses the limitations and biases inherent in ML models and XAI methods (e.g., Random Forest/Bootstrap Test, SHapley Additive exPlanations (SHAP)) and their impact on the reliability of derived feature importance. High predictive accuracy does not necessarily guarantee trustworthy interpretation of feature relevance, as evidenced by inconsistent importance rankings across models and XAI techniques. To enhance interpretability and scientific reliability, we advocate a validation strategy integrating ML with rigorous statistical analysis. It combines model-driven insights with statistical measures such as Spearman's rho and Kendall's tau, and information-theoretic metrics like Mutual Information and Total Correlation to capture complex, non-linear dependencies. Such integration improves the robustness of feature importance assessments and supports more reliable interpretations in environmental modeling.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"194 ","pages":"Article 106700"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225003846","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Jung et al. (2025) achieved high predictive accuracy in interpolating missing ozone data using graph machine learning (ML) and conducted feature importance analysis with explainable AI (XAI). This correspondence acknowledges their significant contribution but discusses the limitations and biases inherent in ML models and XAI methods (e.g., Random Forest/Bootstrap Test, SHapley Additive exPlanations (SHAP)) and their impact on the reliability of derived feature importance. High predictive accuracy does not necessarily guarantee trustworthy interpretation of feature relevance, as evidenced by inconsistent importance rankings across models and XAI techniques. To enhance interpretability and scientific reliability, we advocate a validation strategy integrating ML with rigorous statistical analysis. It combines model-driven insights with statistical measures such as Spearman's rho and Kendall's tau, and information-theoretic metrics like Mutual Information and Total Correlation to capture complex, non-linear dependencies. Such integration improves the robustness of feature importance assessments and supports more reliable interpretations in environmental modeling.

查看原文本刊更多论文

环境建模中XAI解释的陷阱：对空气质量数据分析中模型偏差的警告

Jung等人（2025）使用图机器学习（ML）在插值缺失的臭氧数据方面实现了很高的预测精度，并使用可解释的人工智能（XAI）进行了特征重要性分析。本文承认他们的重要贡献，但讨论了ML模型和XAI方法固有的局限性和偏见（例如，随机森林/Bootstrap测试，SHapley加性解释（SHAP））及其对衍生特征重要性可靠性的影响。高预测准确性并不一定保证特征相关性的可信解释，正如模型和XAI技术之间不一致的重要性排名所证明的那样。为了提高可解释性和科学可靠性，我们提倡将机器学习与严格的统计分析相结合的验证策略。它将模型驱动的洞察力与统计度量（如Spearman的rho和Kendall的tau）以及信息理论度量（如Mutual Information和Total Correlation）相结合，以捕获复杂的非线性依赖关系。这种集成提高了特征重要性评估的鲁棒性，并支持环境建模中更可靠的解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Environmental Modelling & Software 工程技术-工程：环境

CiteScore

9.30

自引率

8.20%

发文量

241

审稿时长

60 days

期刊介绍： Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.