{"title":"Assessing the performance of machine learning models for predicting soil organic carbon variability across diverse landforms","authors":"Maryam Dadgar, Seyedeh Ensieh Faramarzi","doi":"10.1007/s12665-024-11960-0","DOIUrl":null,"url":null,"abstract":"<div><p>Soil organic carbon (SOC) is an essential soil property that plays an important role in sustainable agricultural production. Recently, there has been considerable interest in utilizing data mining and spatial modeling techniques for SOC estimation through machine learning methods, leveraging remote sensing data and terrain attributes. This study aimed to evaluate and compare several machine learning techniques, specifically Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost), for predicting SOC levels across various landforms in northwestern Iran. A total of 402 soil samples were collected, and their SOC content was measured. Furthermore, remote sensing indices obtained from Landsat 8 satellite imagery and terrain attributes from digital elevation models were used. The measured and predicted SOC values generated from the machine learning methods were compared across different landforms. The results indicated that the RF method achieved the highest accuracy in predicting SOC (R² = 0.84, RMSE = 0.04, AIC = −825, BIC = −840). Spatial distribution analysis revealed that only a small portion of the study area exhibited high SOC content, while most of the region had SOC content below 1%. Moreover, a comparison means values of SOC across different landforms indicated that SOC content in upper slope landforms were significantly lower than those in other landforms. Finally, the comparison of measured and predicted values across the three models showed that the RF method provided results closely aligned with the actual SOC content across all examined landforms. This study emphasizes that enhanced techniques for evaluating soil properties mark a notable progression in soil modeling, facilitating better management of soil resources.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"83 23","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-024-11960-0","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Soil organic carbon (SOC) is an essential soil property that plays an important role in sustainable agricultural production. Recently, there has been considerable interest in utilizing data mining and spatial modeling techniques for SOC estimation through machine learning methods, leveraging remote sensing data and terrain attributes. This study aimed to evaluate and compare several machine learning techniques, specifically Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost), for predicting SOC levels across various landforms in northwestern Iran. A total of 402 soil samples were collected, and their SOC content was measured. Furthermore, remote sensing indices obtained from Landsat 8 satellite imagery and terrain attributes from digital elevation models were used. The measured and predicted SOC values generated from the machine learning methods were compared across different landforms. The results indicated that the RF method achieved the highest accuracy in predicting SOC (R² = 0.84, RMSE = 0.04, AIC = −825, BIC = −840). Spatial distribution analysis revealed that only a small portion of the study area exhibited high SOC content, while most of the region had SOC content below 1%. Moreover, a comparison means values of SOC across different landforms indicated that SOC content in upper slope landforms were significantly lower than those in other landforms. Finally, the comparison of measured and predicted values across the three models showed that the RF method provided results closely aligned with the actual SOC content across all examined landforms. This study emphasizes that enhanced techniques for evaluating soil properties mark a notable progression in soil modeling, facilitating better management of soil resources.
期刊介绍:
Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth:
Water and soil contamination caused by waste management and disposal practices
Environmental problems associated with transportation by land, air, or water
Geological processes that may impact biosystems or humans
Man-made or naturally occurring geological or hydrological hazards
Environmental problems associated with the recovery of materials from the earth
Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources
Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials
Management of environmental data and information in data banks and information systems
Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment
In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.