基于随机森林的三江源区土壤温度数据集。

IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Xiaoqing Tan, Siqiong Luo, Hongmei Li, Zhuoqun Li, Qingxue Dong
{"title":"基于随机森林的三江源区土壤温度数据集。","authors":"Xiaoqing Tan, Siqiong Luo, Hongmei Li, Zhuoqun Li, Qingxue Dong","doi":"10.1038/s41597-025-04910-3","DOIUrl":null,"url":null,"abstract":"<p><p>Changes in soil temperature (ST) in the Three River Source Region (TRSR) significantly influence regional climate, ecology, and hydrological processes. However, existing models and reanalysis data exhibit considerable deviations in ST due to limitations in physical processes and parameterization schemes. To address this issue, we developed a new ST dataset using the Random Forest method (RFST), integrating observed ST data with relevant gridded datasets. RFST provides monthly ST data at nine layers with a spatial resolution of 0.01° × 0.01° from 1982 to 2015. Validation against two soil observation networks and six meteorological stations shows that the Nash-Sutcliffe Efficiency (NSE) of RFST exceeds 0.7 at all depths. Compared to ERA5 and CRA40, RFST corrects the cold bias, improves NSE, and reduces RMSE from 4 °C-8 °C to 1 °C-2 °C. RFST not only corrects the underestimation of ST and its warming rate but also aligns more closely with observed values for surface freezing and thawing indices as well as soil freeze-thaw periods, providing a more accurate representation of soil thermal conditions in the TRSR.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"882"},"PeriodicalIF":6.9000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12116792/pdf/","citationCount":"0","resultStr":"{\"title\":\"A soil temperature dataset based on random forest in the Three River Source Region.\",\"authors\":\"Xiaoqing Tan, Siqiong Luo, Hongmei Li, Zhuoqun Li, Qingxue Dong\",\"doi\":\"10.1038/s41597-025-04910-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Changes in soil temperature (ST) in the Three River Source Region (TRSR) significantly influence regional climate, ecology, and hydrological processes. However, existing models and reanalysis data exhibit considerable deviations in ST due to limitations in physical processes and parameterization schemes. To address this issue, we developed a new ST dataset using the Random Forest method (RFST), integrating observed ST data with relevant gridded datasets. RFST provides monthly ST data at nine layers with a spatial resolution of 0.01° × 0.01° from 1982 to 2015. Validation against two soil observation networks and six meteorological stations shows that the Nash-Sutcliffe Efficiency (NSE) of RFST exceeds 0.7 at all depths. Compared to ERA5 and CRA40, RFST corrects the cold bias, improves NSE, and reduces RMSE from 4 °C-8 °C to 1 °C-2 °C. RFST not only corrects the underestimation of ST and its warming rate but also aligns more closely with observed values for surface freezing and thawing indices as well as soil freeze-thaw periods, providing a more accurate representation of soil thermal conditions in the TRSR.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"12 1\",\"pages\":\"882\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12116792/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-025-04910-3\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-04910-3","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

三江源区土壤温度变化对区域气候、生态和水文过程具有重要影响。然而,由于物理过程和参数化方案的限制,现有模型和再分析数据在温度上显示出相当大的偏差。为了解决这个问题,我们使用随机森林方法(RFST)开发了一个新的ST数据集,将观测到的ST数据与相关的网格数据集整合在一起。RFST提供1982 - 2015年9层逐月温度数据,空间分辨率为0.01°× 0.01°。对2个土壤观测网和6个气象站的验证表明,RFST在所有深度的NSE均超过0.7。与ERA5和CRA40相比,RFST纠正了冷偏置,提高了NSE,并将RMSE从4°C-8°C降低到1°C-2°C。RFST不仅修正了对温度及其升温速率的低估,而且与地表冻融指数和土壤冻融期的观测值更接近,从而更准确地反映了TRSR中土壤热条件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A soil temperature dataset based on random forest in the Three River Source Region.

Changes in soil temperature (ST) in the Three River Source Region (TRSR) significantly influence regional climate, ecology, and hydrological processes. However, existing models and reanalysis data exhibit considerable deviations in ST due to limitations in physical processes and parameterization schemes. To address this issue, we developed a new ST dataset using the Random Forest method (RFST), integrating observed ST data with relevant gridded datasets. RFST provides monthly ST data at nine layers with a spatial resolution of 0.01° × 0.01° from 1982 to 2015. Validation against two soil observation networks and six meteorological stations shows that the Nash-Sutcliffe Efficiency (NSE) of RFST exceeds 0.7 at all depths. Compared to ERA5 and CRA40, RFST corrects the cold bias, improves NSE, and reduces RMSE from 4 °C-8 °C to 1 °C-2 °C. RFST not only corrects the underestimation of ST and its warming rate but also aligns more closely with observed values for surface freezing and thawing indices as well as soil freeze-thaw periods, providing a more accurate representation of soil thermal conditions in the TRSR.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Data
Scientific Data Social Sciences-Education
CiteScore
11.20
自引率
4.10%
发文量
689
审稿时长
16 weeks
期刊介绍: Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data. The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信