{"title":"基于哨兵2的机器学习模型在捷克国家森林清查中的验证","authors":"Richard Kovárník, Jitka Janová","doi":"10.1016/j.ecoinf.2025.103133","DOIUrl":null,"url":null,"abstract":"<div><div>The National Forest Inventory (NFI) of the Czech Republic provides essential data for forest management but requires significant time and resources. This study highlights the critical role of validating Sentinel-2-based machine learning models against real NFI data to ensure their reliability for forest monitoring. While satellite-based models offer a cost-effective alternative, their practical applicability depends on rigorous validation. We applied four commonly used machine learning models—Classification and Regression Trees, Random Forest, Support Vector Machine, and Naive Bayes—to Sentinel-2 imagery to estimate forest cover conditions. The Random Forest model achieved the highest overall accuracy (98.3 %). By systematically comparing model predictions with official NFI data, we address a key gap in remote sensing applications: the need for real-world validation beyond training datasets. Our findings demonstrate that properly validated Sentinel-2-based models can enhance large-scale forest monitoring, reducing the financial and labor burdens of traditional field surveys while ensuring data accuracy for sustainable forest management.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"87 ","pages":"Article 103133"},"PeriodicalIF":5.8000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Validation of sentinel 2 based machine learning models for Czech National Forest Inventory\",\"authors\":\"Richard Kovárník, Jitka Janová\",\"doi\":\"10.1016/j.ecoinf.2025.103133\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The National Forest Inventory (NFI) of the Czech Republic provides essential data for forest management but requires significant time and resources. This study highlights the critical role of validating Sentinel-2-based machine learning models against real NFI data to ensure their reliability for forest monitoring. While satellite-based models offer a cost-effective alternative, their practical applicability depends on rigorous validation. We applied four commonly used machine learning models—Classification and Regression Trees, Random Forest, Support Vector Machine, and Naive Bayes—to Sentinel-2 imagery to estimate forest cover conditions. The Random Forest model achieved the highest overall accuracy (98.3 %). By systematically comparing model predictions with official NFI data, we address a key gap in remote sensing applications: the need for real-world validation beyond training datasets. Our findings demonstrate that properly validated Sentinel-2-based models can enhance large-scale forest monitoring, reducing the financial and labor burdens of traditional field surveys while ensuring data accuracy for sustainable forest management.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"87 \",\"pages\":\"Article 103133\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125001426\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001426","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Validation of sentinel 2 based machine learning models for Czech National Forest Inventory
The National Forest Inventory (NFI) of the Czech Republic provides essential data for forest management but requires significant time and resources. This study highlights the critical role of validating Sentinel-2-based machine learning models against real NFI data to ensure their reliability for forest monitoring. While satellite-based models offer a cost-effective alternative, their practical applicability depends on rigorous validation. We applied four commonly used machine learning models—Classification and Regression Trees, Random Forest, Support Vector Machine, and Naive Bayes—to Sentinel-2 imagery to estimate forest cover conditions. The Random Forest model achieved the highest overall accuracy (98.3 %). By systematically comparing model predictions with official NFI data, we address a key gap in remote sensing applications: the need for real-world validation beyond training datasets. Our findings demonstrate that properly validated Sentinel-2-based models can enhance large-scale forest monitoring, reducing the financial and labor burdens of traditional field surveys while ensuring data accuracy for sustainable forest management.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.