{"title":"基于机器学习和深度学习算法的不同土地利用类型土壤有机碳含量变异性分析及其数字制图","authors":"Mounir Oukhattar, Sébastien Gadal, Yannick Robert, Nicolas Saby, Ismaguil Hanadé Houmma, Catherine Keller","doi":"10.1007/s10661-025-13972-0","DOIUrl":null,"url":null,"abstract":"<div><p>Soil organic carbon (SOC) plays a crucial role in carbon cycle management and soil fertility. Understanding the spatial variations in SOC content is vital for supporting sustainable soil resource management. In this study, we analyzed the variability in SOC content across eleven different types of land use in the mining basin of Provence in southeastern France. We modelled this variability spatially using machine and deep learning regression. Four algorithms were tested: random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost), and deep neural networks (DNNs). These integrated 162 soil samples and 21 environmental covariates, including climatic parameters, lithology, topographical features, land cover, remote sensing data, and soil physicochemical parameters. The results clearly show a large variability in SOC content across land use types, with forests revealing the highest values (mean of 69.3 g/kg) and arable land the lowest (mean of 8.9 g/kg). The Pearson correlation coefficients (<i>R</i>) indicate that land cover, topography, lithology, environmental indices, and clay content are the main factors influencing the SOC content. The XGBoost model generated the best result (<i>R</i><sup>2</sup> = 0.73), closely followed by RF (<i>R</i><sup>2</sup> = 0.68) and DNN (<i>R</i><sup>2</sup> = 0.60), while SVM showed the weakest performance (<i>R</i><sup>2</sup> = 0.36). XGBoost and RF remain the best options for obtaining reliable results with a limited number of soil samples and reduced calculation time. The results of this study provide vital insights for managing soil organic carbon in southeastern France and for climate change mitigation in sustainable land management.</p></div>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 5","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Variability analysis of soil organic carbon content across land use types and its digital mapping using machine learning and deep learning algorithms\",\"authors\":\"Mounir Oukhattar, Sébastien Gadal, Yannick Robert, Nicolas Saby, Ismaguil Hanadé Houmma, Catherine Keller\",\"doi\":\"10.1007/s10661-025-13972-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Soil organic carbon (SOC) plays a crucial role in carbon cycle management and soil fertility. Understanding the spatial variations in SOC content is vital for supporting sustainable soil resource management. In this study, we analyzed the variability in SOC content across eleven different types of land use in the mining basin of Provence in southeastern France. We modelled this variability spatially using machine and deep learning regression. Four algorithms were tested: random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost), and deep neural networks (DNNs). These integrated 162 soil samples and 21 environmental covariates, including climatic parameters, lithology, topographical features, land cover, remote sensing data, and soil physicochemical parameters. The results clearly show a large variability in SOC content across land use types, with forests revealing the highest values (mean of 69.3 g/kg) and arable land the lowest (mean of 8.9 g/kg). The Pearson correlation coefficients (<i>R</i>) indicate that land cover, topography, lithology, environmental indices, and clay content are the main factors influencing the SOC content. The XGBoost model generated the best result (<i>R</i><sup>2</sup> = 0.73), closely followed by RF (<i>R</i><sup>2</sup> = 0.68) and DNN (<i>R</i><sup>2</sup> = 0.60), while SVM showed the weakest performance (<i>R</i><sup>2</sup> = 0.36). XGBoost and RF remain the best options for obtaining reliable results with a limited number of soil samples and reduced calculation time. The results of this study provide vital insights for managing soil organic carbon in southeastern France and for climate change mitigation in sustainable land management.</p></div>\",\"PeriodicalId\":544,\"journal\":{\"name\":\"Environmental Monitoring and Assessment\",\"volume\":\"197 5\",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Monitoring and Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10661-025-13972-0\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-025-13972-0","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Variability analysis of soil organic carbon content across land use types and its digital mapping using machine learning and deep learning algorithms
Soil organic carbon (SOC) plays a crucial role in carbon cycle management and soil fertility. Understanding the spatial variations in SOC content is vital for supporting sustainable soil resource management. In this study, we analyzed the variability in SOC content across eleven different types of land use in the mining basin of Provence in southeastern France. We modelled this variability spatially using machine and deep learning regression. Four algorithms were tested: random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost), and deep neural networks (DNNs). These integrated 162 soil samples and 21 environmental covariates, including climatic parameters, lithology, topographical features, land cover, remote sensing data, and soil physicochemical parameters. The results clearly show a large variability in SOC content across land use types, with forests revealing the highest values (mean of 69.3 g/kg) and arable land the lowest (mean of 8.9 g/kg). The Pearson correlation coefficients (R) indicate that land cover, topography, lithology, environmental indices, and clay content are the main factors influencing the SOC content. The XGBoost model generated the best result (R2 = 0.73), closely followed by RF (R2 = 0.68) and DNN (R2 = 0.60), while SVM showed the weakest performance (R2 = 0.36). XGBoost and RF remain the best options for obtaining reliable results with a limited number of soil samples and reduced calculation time. The results of this study provide vital insights for managing soil organic carbon in southeastern France and for climate change mitigation in sustainable land management.
期刊介绍:
Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.