GRRIEn分析:供地球科学家从全球地球观测中学习的数据科学小抄

Elizabeth Carter, C. Hultquist, T. Wen
{"title":"GRRIEn分析:供地球科学家从全球地球观测中学习的数据科学小抄","authors":"Elizabeth Carter, C. Hultquist, T. Wen","doi":"10.1175/aies-d-22-0065.1","DOIUrl":null,"url":null,"abstract":"\nGlobally available environmental observations (EOs), specifically from satellites and coupled earth systems models, represent some of the largest datasets of the digital age. As the volume of global EOs continues to grow, so does the potential of this data to help earth scientists discover trends and patterns in earth systems at large spatial scales. To leverage global EOs for scientific insight, earth scientists need targeted and accessible exposure to skills in reproducible scientific computing and spatiotemporal data science, and to be empowered to apply their domain understanding to interpret data-driven models for knowledge discovery. The GRRIEn (Generalizable, Reproducible, Robust, and Interpreted Environmental) analysis framework was developed to prepare earth scientists with an introductory statistics background and limited/no understanding of programming and computational methods to use global EOs to successfully generalize insights from local/regional field measurements across unsampled times and locations. GRRIEn analysis is generalizable, meaning results from a sample are translated to landscape scales by combining direct environmental measurements with global EOs using supervised machine learning; robust, meaning that model shows good performance on data with scale-dependent feature and observation dependence; reproducible, based on a standard repository structure so that other scientists can quickly and easily replicate the analysis with a few computational tools; and interpreted, meaning that earth scientists apply domain expertise to ensure that model parameters reflect a physically plausible diagnosis of the environmental system. This tutorial presents standard steps for achieving GRRIEn analysis by combining conventions of rigor in traditional experimental design with the open-science movement.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GRRIEn analysis: a data science cheat sheet for earth scientists learning from global earth observations\",\"authors\":\"Elizabeth Carter, C. Hultquist, T. Wen\",\"doi\":\"10.1175/aies-d-22-0065.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nGlobally available environmental observations (EOs), specifically from satellites and coupled earth systems models, represent some of the largest datasets of the digital age. As the volume of global EOs continues to grow, so does the potential of this data to help earth scientists discover trends and patterns in earth systems at large spatial scales. To leverage global EOs for scientific insight, earth scientists need targeted and accessible exposure to skills in reproducible scientific computing and spatiotemporal data science, and to be empowered to apply their domain understanding to interpret data-driven models for knowledge discovery. The GRRIEn (Generalizable, Reproducible, Robust, and Interpreted Environmental) analysis framework was developed to prepare earth scientists with an introductory statistics background and limited/no understanding of programming and computational methods to use global EOs to successfully generalize insights from local/regional field measurements across unsampled times and locations. GRRIEn analysis is generalizable, meaning results from a sample are translated to landscape scales by combining direct environmental measurements with global EOs using supervised machine learning; robust, meaning that model shows good performance on data with scale-dependent feature and observation dependence; reproducible, based on a standard repository structure so that other scientists can quickly and easily replicate the analysis with a few computational tools; and interpreted, meaning that earth scientists apply domain expertise to ensure that model parameters reflect a physically plausible diagnosis of the environmental system. This tutorial presents standard steps for achieving GRRIEn analysis by combining conventions of rigor in traditional experimental design with the open-science movement.\",\"PeriodicalId\":94369,\"journal\":{\"name\":\"Artificial intelligence for the earth systems\",\"volume\":\"33 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence for the earth systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1175/aies-d-22-0065.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence for the earth systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1175/aies-d-22-0065.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

全球可获得的环境观测(EOs),特别是来自卫星和耦合地球系统模型的观测,代表了数字时代一些最大的数据集。随着全球生态系统的数量持续增长,这些数据在帮助地球科学家在大空间尺度上发现地球系统的趋势和模式方面的潜力也在不断增加。为了利用全球EOs获得科学洞察力,地球科学家需要有针对性和可访问的可重复科学计算和时空数据科学技能,并被授权应用他们的领域理解来解释数据驱动的模型,以进行知识发现。GRRIEn (generizable, reproducibility, Robust, and interpret environment)分析框架的开发是为了让具有入门统计学背景和对编程和计算方法有限或没有理解的地球科学家准备好使用全球EOs来成功地概括来自未采样时间和地点的局部/区域现场测量的见解。GRRIEn分析是可推广的,这意味着通过使用监督机器学习将直接环境测量与全球EOs相结合,将样本结果转化为景观尺度;鲁棒性,即模型对具有尺度依赖特征和观测依赖的数据表现出良好的性能;可重复性,基于标准存储库结构,以便其他科学家可以使用一些计算工具快速轻松地复制分析;这意味着地球科学家运用该领域的专业知识来确保模型参数反映了对环境系统的物理上合理的诊断。本教程通过将传统实验设计中的严格惯例与开放科学运动相结合,介绍了实现GRRIEn分析的标准步骤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
GRRIEn analysis: a data science cheat sheet for earth scientists learning from global earth observations
Globally available environmental observations (EOs), specifically from satellites and coupled earth systems models, represent some of the largest datasets of the digital age. As the volume of global EOs continues to grow, so does the potential of this data to help earth scientists discover trends and patterns in earth systems at large spatial scales. To leverage global EOs for scientific insight, earth scientists need targeted and accessible exposure to skills in reproducible scientific computing and spatiotemporal data science, and to be empowered to apply their domain understanding to interpret data-driven models for knowledge discovery. The GRRIEn (Generalizable, Reproducible, Robust, and Interpreted Environmental) analysis framework was developed to prepare earth scientists with an introductory statistics background and limited/no understanding of programming and computational methods to use global EOs to successfully generalize insights from local/regional field measurements across unsampled times and locations. GRRIEn analysis is generalizable, meaning results from a sample are translated to landscape scales by combining direct environmental measurements with global EOs using supervised machine learning; robust, meaning that model shows good performance on data with scale-dependent feature and observation dependence; reproducible, based on a standard repository structure so that other scientists can quickly and easily replicate the analysis with a few computational tools; and interpreted, meaning that earth scientists apply domain expertise to ensure that model parameters reflect a physically plausible diagnosis of the environmental system. This tutorial presents standard steps for achieving GRRIEn analysis by combining conventions of rigor in traditional experimental design with the open-science movement.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信