{"title":"Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms","authors":"Dmitriy Troshin, Maksim Fayzulin, Denis Mirin","doi":"10.1007/s10661-025-13985-9","DOIUrl":null,"url":null,"abstract":"<div><p>Mapping and classifying the probability of occurrence of <i>Populus tremula</i> L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.</p></div>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 5","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-025-13985-9","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.
期刊介绍:
Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.