可解释的机器学习如何促进地质科学的过程理解

IF 7.3 1区 地球科学 Q1 ENVIRONMENTAL SCIENCES
Earths Future Pub Date : 2024-07-15 DOI:10.1029/2024EF004540
Shijie Jiang, Lily-belle Sweet, Georgios Blougouras, Alexander Brenning, Wantong Li, Markus Reichstein, Joachim Denzler, Wei Shangguan, Guo Yu, Feini Huang, Jakob Zscheischler
{"title":"可解释的机器学习如何促进地质科学的过程理解","authors":"Shijie Jiang,&nbsp;Lily-belle Sweet,&nbsp;Georgios Blougouras,&nbsp;Alexander Brenning,&nbsp;Wantong Li,&nbsp;Markus Reichstein,&nbsp;Joachim Denzler,&nbsp;Wei Shangguan,&nbsp;Guo Yu,&nbsp;Feini Huang,&nbsp;Jakob Zscheischler","doi":"10.1029/2024EF004540","DOIUrl":null,"url":null,"abstract":"<p>Interpretable Machine Learning (IML) has rapidly advanced in recent years, offering new opportunities to improve our understanding of the complex Earth system. IML goes beyond conventional machine learning by not only making predictions but also seeking to elucidate the reasoning behind those predictions. The combination of predictive power and enhanced transparency makes IML a promising approach for uncovering relationships in data that may be overlooked by traditional analysis. Despite its potential, the broader implications for the field have yet to be fully appreciated. Meanwhile, the rapid proliferation of IML, still in its early stages, has been accompanied by instances of careless application. In response to these challenges, this paper focuses on how IML can effectively and appropriately aid geoscientists in advancing process understanding—areas that are often underexplored in more technical discussions of IML. Specifically, we identify pragmatic application scenarios for IML in typical geoscientific studies, such as quantifying relationships in specific contexts, generating hypotheses about potential mechanisms, and evaluating process-based models. Moreover, we present a general and practical workflow for using IML to address specific research questions. In particular, we identify several critical and common pitfalls in the use of IML that can lead to misleading conclusions, and propose corresponding good practices. Our goal is to facilitate a broader, yet more careful and thoughtful integration of IML into Earth science research, positioning it as a valuable data science tool capable of enhancing our current understanding of the Earth system.</p>","PeriodicalId":48748,"journal":{"name":"Earths Future","volume":null,"pages":null},"PeriodicalIF":7.3000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EF004540","citationCount":"0","resultStr":"{\"title\":\"How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences\",\"authors\":\"Shijie Jiang,&nbsp;Lily-belle Sweet,&nbsp;Georgios Blougouras,&nbsp;Alexander Brenning,&nbsp;Wantong Li,&nbsp;Markus Reichstein,&nbsp;Joachim Denzler,&nbsp;Wei Shangguan,&nbsp;Guo Yu,&nbsp;Feini Huang,&nbsp;Jakob Zscheischler\",\"doi\":\"10.1029/2024EF004540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Interpretable Machine Learning (IML) has rapidly advanced in recent years, offering new opportunities to improve our understanding of the complex Earth system. IML goes beyond conventional machine learning by not only making predictions but also seeking to elucidate the reasoning behind those predictions. The combination of predictive power and enhanced transparency makes IML a promising approach for uncovering relationships in data that may be overlooked by traditional analysis. Despite its potential, the broader implications for the field have yet to be fully appreciated. Meanwhile, the rapid proliferation of IML, still in its early stages, has been accompanied by instances of careless application. In response to these challenges, this paper focuses on how IML can effectively and appropriately aid geoscientists in advancing process understanding—areas that are often underexplored in more technical discussions of IML. Specifically, we identify pragmatic application scenarios for IML in typical geoscientific studies, such as quantifying relationships in specific contexts, generating hypotheses about potential mechanisms, and evaluating process-based models. Moreover, we present a general and practical workflow for using IML to address specific research questions. In particular, we identify several critical and common pitfalls in the use of IML that can lead to misleading conclusions, and propose corresponding good practices. Our goal is to facilitate a broader, yet more careful and thoughtful integration of IML into Earth science research, positioning it as a valuable data science tool capable of enhancing our current understanding of the Earth system.</p>\",\"PeriodicalId\":48748,\"journal\":{\"name\":\"Earths Future\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2024-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EF004540\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Earths Future\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1029/2024EF004540\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earths Future","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024EF004540","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

可解释机器学习(IML)近年来发展迅速,为增进我们对复杂地球系统的了解提供了新的机遇。IML 超越了传统的机器学习,它不仅能做出预测,还能阐明这些预测背后的推理。预测能力和更高的透明度相结合,使 IML 成为一种很有前途的方法,可以揭示传统分析可能忽略的数据关系。尽管 IML 潜力巨大,但其对该领域的广泛影响仍有待充分认识。与此同时,IML 仍处于早期阶段,在其迅速普及的同时,也出现了应用不慎的情况。为了应对这些挑战,本文重点讨论了 IML 如何有效、适当地帮助地球科学家加深对过程的理解--而这些领域在关于 IML 的技术讨论中往往未得到充分探讨。具体来说,我们确定了 IML 在典型地球科学研究中的实用应用场景,如量化特定环境中的关系、生成潜在机制假设以及评估基于过程的模型。此外,我们还介绍了使用 IML 解决特定研究问题的通用实用工作流程。特别是,我们指出了使用 IML 时可能导致误导性结论的几个关键和常见陷阱,并提出了相应的良好做法。我们的目标是促进 IML 更广泛、更仔细、更周到地融入地球科学研究,将其定位为一种有价值的数据科学工具,能够增强我们目前对地球系统的了解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences

How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences

Interpretable Machine Learning (IML) has rapidly advanced in recent years, offering new opportunities to improve our understanding of the complex Earth system. IML goes beyond conventional machine learning by not only making predictions but also seeking to elucidate the reasoning behind those predictions. The combination of predictive power and enhanced transparency makes IML a promising approach for uncovering relationships in data that may be overlooked by traditional analysis. Despite its potential, the broader implications for the field have yet to be fully appreciated. Meanwhile, the rapid proliferation of IML, still in its early stages, has been accompanied by instances of careless application. In response to these challenges, this paper focuses on how IML can effectively and appropriately aid geoscientists in advancing process understanding—areas that are often underexplored in more technical discussions of IML. Specifically, we identify pragmatic application scenarios for IML in typical geoscientific studies, such as quantifying relationships in specific contexts, generating hypotheses about potential mechanisms, and evaluating process-based models. Moreover, we present a general and practical workflow for using IML to address specific research questions. In particular, we identify several critical and common pitfalls in the use of IML that can lead to misleading conclusions, and propose corresponding good practices. Our goal is to facilitate a broader, yet more careful and thoughtful integration of IML into Earth science research, positioning it as a valuable data science tool capable of enhancing our current understanding of the Earth system.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Earths Future
Earths Future ENVIRONMENTAL SCIENCESGEOSCIENCES, MULTIDI-GEOSCIENCES, MULTIDISCIPLINARY
CiteScore
11.00
自引率
7.30%
发文量
260
审稿时长
16 weeks
期刊介绍: Earth’s Future: A transdisciplinary open access journal, Earth’s Future focuses on the state of the Earth and the prediction of the planet’s future. By publishing peer-reviewed articles as well as editorials, essays, reviews, and commentaries, this journal will be the preeminent scholarly resource on the Anthropocene. It will also help assess the risks and opportunities associated with environmental changes and challenges.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信