Cross-validation strategy impacts the performance and interpretation of machine learning models

Artificial intelligence for the earth systems Pub Date : 2023-07-10 DOI:10.1175/aies-d-23-0026.1

Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler

{"title":"Cross-validation strategy impacts the performance and interpretation of machine learning models","authors":"Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler","doi":"10.1175/aies-d-23-0026.1","DOIUrl":null,"url":null,"abstract":"\nMachine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"32 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence for the earth systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1175/aies-d-23-0026.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Machine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.

查看原文本刊更多论文

交叉验证策略影响机器学习模型的性能和解释

机器学习算法能够捕捉复杂的非线性相互作用关系，并越来越多地用于预测区域和国家尺度上的产量变化。将可解释的人工智能(XAI)方法应用于此类算法，可以更好地科学理解产量变化的驱动因素。然而，当应用于时空相关数据集时，XAI方法可能会提供误导性的结果。在本研究中，通过训练机器学习模型来根据气候指数预测模拟作物产量，并评估模型评估策略对结果模型的解释和性能的影响。使用来自基于过程的裁剪模型的数据，我们可以对XAI方法提供的“解释”的合理性进行评论。我们的研究结果表明，在使用评估策略进行超参数调整和特征选择后，评估策略的选择会影响(i)模型的解释和(ii)模型技能对保留年份和地区的影响。我们发现，在特征空间中使用基于聚类的交叉验证策略可以获得最合理的解释，以及在滞留年份和区域上的最佳模型性能。我们的结果为在时空农业或气候数据上使用XAI工具确定特定领域的“最佳实践”提供了第一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial intelligence for the earth systems

自引率

0.00%

发文量