Modeling network of research sites for monitoring carbon flows by Random Forest method

V. Dobryakova, N. Moskvina, Andrey B. Dobryakov, L. Zhegalina
{"title":"Modeling network of research sites for monitoring carbon flows by Random Forest method","authors":"V. Dobryakova, N. Moskvina, Andrey B. Dobryakov, L. Zhegalina","doi":"10.35595/2414-9179-2022-1-28-645-658","DOIUrl":null,"url":null,"abstract":"Environmental observing networks provide information for understanding and predicting the spatial and temporal dynamics of Earth biophysical processes. The optimization of resources for large-scale environmental monitoring activities is required. The paper describes and then tests spatial structure of Tyumen region research sites network. The network is based on principles of landscape approach, taking into account cost minimization. At the baseline of research, two testing sets of 40 and 105 points were determined. Proposed locations were evaluated using Random Forest (RF) method. The study accomplished in two stages for each test set. At the first stage, the model was trained; its capacity and indicators of additional diagnostics were studied. At the second stage, the trained model was used to predict the points formed of regular grid covering entire territory of this region (544 points). In conclusion, the obtained results were compared with similar point sets of the same volume but generated randomly. Primary Productivity Gross (GPP) was chosen as predictable variable because it is one of the major complex environmental indicators associated with carbon production in this area. The ability of an area to absorb or produce carbon is one of the main parameters that determine climate processes. As independent variables characterizing geosystemic processes, a set of indicators associated with climate, terrain parameters, and variability of soil resources has been selected. The problem was solved using Forest-Based Classification and Regression tool from Spatial Statistics—Modeling Spatial Relationships toolkit of ArcGIS Pro software package. As the result of the study, a high forecast accuracy and reliability for both approaches to research sites locations was obtained. The study was based on open source data.","PeriodicalId":31498,"journal":{"name":"InterCarto InterGIS","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"InterCarto InterGIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35595/2414-9179-2022-1-28-645-658","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Environmental observing networks provide information for understanding and predicting the spatial and temporal dynamics of Earth biophysical processes. The optimization of resources for large-scale environmental monitoring activities is required. The paper describes and then tests spatial structure of Tyumen region research sites network. The network is based on principles of landscape approach, taking into account cost minimization. At the baseline of research, two testing sets of 40 and 105 points were determined. Proposed locations were evaluated using Random Forest (RF) method. The study accomplished in two stages for each test set. At the first stage, the model was trained; its capacity and indicators of additional diagnostics were studied. At the second stage, the trained model was used to predict the points formed of regular grid covering entire territory of this region (544 points). In conclusion, the obtained results were compared with similar point sets of the same volume but generated randomly. Primary Productivity Gross (GPP) was chosen as predictable variable because it is one of the major complex environmental indicators associated with carbon production in this area. The ability of an area to absorb or produce carbon is one of the main parameters that determine climate processes. As independent variables characterizing geosystemic processes, a set of indicators associated with climate, terrain parameters, and variability of soil resources has been selected. The problem was solved using Forest-Based Classification and Regression tool from Spatial Statistics—Modeling Spatial Relationships toolkit of ArcGIS Pro software package. As the result of the study, a high forecast accuracy and reliability for both approaches to research sites locations was obtained. The study was based on open source data.
基于随机森林方法的碳流监测研究站点网络建模
环境观测网络为理解和预测地球生物物理过程的时空动态提供了信息。需要为大规模环境监测活动优化资源。本文对秋明地区研究站点网络的空间结构进行了描述和检验。该网络基于景观方法的原则,考虑到成本最小化。在研究基线时,确定了40分和105分两个测试集。采用随机森林(RF)方法对建议的地点进行评估。每个测试集的研究分两个阶段完成。第一阶段,对模型进行训练;对其附加诊断能力和指标进行了研究。第二阶段,使用训练好的模型预测覆盖整个区域的规则网格(544个点)所形成的点。最后,将得到的结果与随机生成的相同体积的相似点集进行比较。选择初级生产力总值(GPP)作为可预测变量,是因为它是与该地区碳产量相关的主要复杂环境指标之一。一个地区吸收或产生碳的能力是决定气候过程的主要参数之一。作为表征地球系统过程的自变量,选择了一套与气候、地形参数和土壤资源变异相关的指标。利用ArcGIS Pro软件包中的空间统计-空间关系建模工具包中的基于森林的分类与回归工具解决了这一问题。研究结果表明,这两种方法对研究地点的预测精度和可靠性都很高。这项研究基于开源数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.90
自引率
0.00%
发文量
2
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信