Yue Yin , Bingbo Gao , Hao Xu , Yuxue Wang , Dongkai Xie , Yanqing Liu , Chenyi Wang
{"title":"考虑空间异质性的复杂地形土壤有机质制图","authors":"Yue Yin , Bingbo Gao , Hao Xu , Yuxue Wang , Dongkai Xie , Yanqing Liu , Chenyi Wang","doi":"10.1016/j.envsoft.2025.106569","DOIUrl":null,"url":null,"abstract":"<div><div>The intricate topography and weak spatial autocorrelation in mountainous areas contribute to strong local and directional heterogeneity in the spatial distribution of soil organic matter (SOM). The relationships between SOM and auxiliary variables also exhibit spatial disparities. This mixed heterogeneity seriously affects the prediction accuracy of SOM's spatial distribution. Furthermore, the high cost and challenges associated with sampling in mountainous areas result in limited availability, sparseness, and uneven spatial distribution of soil samples, thereby intensifying the difficulty of precise spatial prediction. The newly developed two-point machine learning method (TPML) adeptly manages local heterogeneity and heterogeneous relationships by a two-step modeling approach, but its application in addressing directional heterogeneity remains unexplored. This study investigates whether explicitly integrating directional information between two points as an auxiliary variable in the TPML modeling process can enhance the prediction accuracy of SOM in complex terrains characterized by small sample sizes. In this study, multiple sets of comparative experiments were conducted to assess the accuracy of various methodologies, including TPML, ordinary kriging, random forest, and random forest regression kriging. The results indicate that (1) TPML can capture the local and directional heterogeneity in the distribution of SOM in mountainous areas, addressing the spatially varying relationship between SOM and auxiliary variables. (2) TPML demonstrates the capacity to characterize the directional heterogeneity of SOM even without the inclusion of directional information as an auxiliary variable. (3) Through cross-validation, TPML emerges as the most accurate predictive method. Mapping outcomes reveal that TPML can produce precise and coherent spatial distribution maps of SOM with fine spatial details.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"192 ","pages":"Article 106569"},"PeriodicalIF":4.6000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Soil organic matter mapping in complex terrains considering spatial heterogeneity\",\"authors\":\"Yue Yin , Bingbo Gao , Hao Xu , Yuxue Wang , Dongkai Xie , Yanqing Liu , Chenyi Wang\",\"doi\":\"10.1016/j.envsoft.2025.106569\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The intricate topography and weak spatial autocorrelation in mountainous areas contribute to strong local and directional heterogeneity in the spatial distribution of soil organic matter (SOM). The relationships between SOM and auxiliary variables also exhibit spatial disparities. This mixed heterogeneity seriously affects the prediction accuracy of SOM's spatial distribution. Furthermore, the high cost and challenges associated with sampling in mountainous areas result in limited availability, sparseness, and uneven spatial distribution of soil samples, thereby intensifying the difficulty of precise spatial prediction. The newly developed two-point machine learning method (TPML) adeptly manages local heterogeneity and heterogeneous relationships by a two-step modeling approach, but its application in addressing directional heterogeneity remains unexplored. This study investigates whether explicitly integrating directional information between two points as an auxiliary variable in the TPML modeling process can enhance the prediction accuracy of SOM in complex terrains characterized by small sample sizes. In this study, multiple sets of comparative experiments were conducted to assess the accuracy of various methodologies, including TPML, ordinary kriging, random forest, and random forest regression kriging. The results indicate that (1) TPML can capture the local and directional heterogeneity in the distribution of SOM in mountainous areas, addressing the spatially varying relationship between SOM and auxiliary variables. (2) TPML demonstrates the capacity to characterize the directional heterogeneity of SOM even without the inclusion of directional information as an auxiliary variable. (3) Through cross-validation, TPML emerges as the most accurate predictive method. Mapping outcomes reveal that TPML can produce precise and coherent spatial distribution maps of SOM with fine spatial details.</div></div>\",\"PeriodicalId\":310,\"journal\":{\"name\":\"Environmental Modelling & Software\",\"volume\":\"192 \",\"pages\":\"Article 106569\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Modelling & Software\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1364815225002531\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225002531","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Soil organic matter mapping in complex terrains considering spatial heterogeneity
The intricate topography and weak spatial autocorrelation in mountainous areas contribute to strong local and directional heterogeneity in the spatial distribution of soil organic matter (SOM). The relationships between SOM and auxiliary variables also exhibit spatial disparities. This mixed heterogeneity seriously affects the prediction accuracy of SOM's spatial distribution. Furthermore, the high cost and challenges associated with sampling in mountainous areas result in limited availability, sparseness, and uneven spatial distribution of soil samples, thereby intensifying the difficulty of precise spatial prediction. The newly developed two-point machine learning method (TPML) adeptly manages local heterogeneity and heterogeneous relationships by a two-step modeling approach, but its application in addressing directional heterogeneity remains unexplored. This study investigates whether explicitly integrating directional information between two points as an auxiliary variable in the TPML modeling process can enhance the prediction accuracy of SOM in complex terrains characterized by small sample sizes. In this study, multiple sets of comparative experiments were conducted to assess the accuracy of various methodologies, including TPML, ordinary kriging, random forest, and random forest regression kriging. The results indicate that (1) TPML can capture the local and directional heterogeneity in the distribution of SOM in mountainous areas, addressing the spatially varying relationship between SOM and auxiliary variables. (2) TPML demonstrates the capacity to characterize the directional heterogeneity of SOM even without the inclusion of directional information as an auxiliary variable. (3) Through cross-validation, TPML emerges as the most accurate predictive method. Mapping outcomes reveal that TPML can produce precise and coherent spatial distribution maps of SOM with fine spatial details.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.