Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler
{"title":"交叉验证策略影响机器学习模型的性能和解释","authors":"Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler","doi":"10.1175/aies-d-23-0026.1","DOIUrl":null,"url":null,"abstract":"\nMachine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"32 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Cross-validation strategy impacts the performance and interpretation of machine learning models\",\"authors\":\"Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler\",\"doi\":\"10.1175/aies-d-23-0026.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nMachine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.\",\"PeriodicalId\":94369,\"journal\":{\"name\":\"Artificial intelligence for the earth systems\",\"volume\":\"32 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence for the earth systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1175/aies-d-23-0026.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence for the earth systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1175/aies-d-23-0026.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cross-validation strategy impacts the performance and interpretation of machine learning models
Machine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.