Thomas Prantl , Til Barthel , Dennis Kaiser , Maximilian Schwinger , André Bauer , Samuel Kounev
{"title":"Challenges in the evaluation of earth observation products: Accuracy assessment case study using convolutional neural networks","authors":"Thomas Prantl , Til Barthel , Dennis Kaiser , Maximilian Schwinger , André Bauer , Samuel Kounev","doi":"10.1016/j.rsase.2024.101420","DOIUrl":null,"url":null,"abstract":"<div><div>Earth observation is essential for monitoring natural resources and the Earth’s climate. However, the accuracy of earth observation products is sometimes not adequately evaluated, mainly when deep learning (DL) is used. One reason is that the DP and remote sensing communities have different ways of evaluating accuracy. The DL community tends to use single metrics to summarize results, which the remote sensing community overlooks. On the other hand, the remote sensing community emphasizes transparency in map creation, documenting sampling methods, and error matrices, which is not a priority for the DL community. Therefore, a significant challenge in applying DP methods for earth observation is the lack of evaluation using the established metrics of the remote sensing community, which are not commonly used in the DL community. In addition, assessing the accuracy of deep learning models adds another layer of complexity to the process. To tackle this challenge, we conducted a case study on creating a map of settlements in Bavaria using CNNs and satellite images. We then evaluated the resulting map according to the recommendations found in remote sensing literature. Our evaluation revealed that the CNNs we trained had an Overall Accuracy of over 97%. Since the remote sensing literature recommends not reporting only Overall Accuracy as a metric, especially for class imbalances, we also specified the confusion matrices with sample and proportion counts and the estimators for a 95% confidence interval as additional metrics and performed a visual evaluation. Our evaluation is based on our previous examination of recommended assessment practices. Our aim in presenting this overview, along with the case study, is to help readers identify potential issues and offer guidance for evaluating their earth observation products.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"37 ","pages":"Article 101420"},"PeriodicalIF":3.8000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938524002842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Earth observation is essential for monitoring natural resources and the Earth’s climate. However, the accuracy of earth observation products is sometimes not adequately evaluated, mainly when deep learning (DL) is used. One reason is that the DP and remote sensing communities have different ways of evaluating accuracy. The DL community tends to use single metrics to summarize results, which the remote sensing community overlooks. On the other hand, the remote sensing community emphasizes transparency in map creation, documenting sampling methods, and error matrices, which is not a priority for the DL community. Therefore, a significant challenge in applying DP methods for earth observation is the lack of evaluation using the established metrics of the remote sensing community, which are not commonly used in the DL community. In addition, assessing the accuracy of deep learning models adds another layer of complexity to the process. To tackle this challenge, we conducted a case study on creating a map of settlements in Bavaria using CNNs and satellite images. We then evaluated the resulting map according to the recommendations found in remote sensing literature. Our evaluation revealed that the CNNs we trained had an Overall Accuracy of over 97%. Since the remote sensing literature recommends not reporting only Overall Accuracy as a metric, especially for class imbalances, we also specified the confusion matrices with sample and proportion counts and the estimators for a 95% confidence interval as additional metrics and performed a visual evaluation. Our evaluation is based on our previous examination of recommended assessment practices. Our aim in presenting this overview, along with the case study, is to help readers identify potential issues and offer guidance for evaluating their earth observation products.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems