Comparison of neural networks and k-nearest neighbors methods in forest stand variable estimation using airborne laser data

ISPRS Open Journal of Photogrammetry and Remote Sensing Pub Date : 2022-04-01 DOI:10.1016/j.ophoto.2022.100012

Andras Balazs, Eero Liski, Sakari Tuominen, Annika Kangas

{"title":"Comparison of neural networks and k-nearest neighbors methods in forest stand variable estimation using airborne laser data","authors":"Andras Balazs, Eero Liski, Sakari Tuominen, Annika Kangas","doi":"10.1016/j.ophoto.2022.100012","DOIUrl":null,"url":null,"abstract":"<div><p>In the remote sensing of forests, point cloud data from airborne laser scanning contains high-value information for predicting the volume of growing stock and the size of trees. At the same time, laser scanning data allows a very high number of potential features that can be extracted from the point cloud data for predicting the forest variables. In some methods, the features are first extracted by user-defined algorithms and the best features are selected based on supervised learning, whereas both tasks can be carried out automatically by deep learning methods typically based on deep neural networks. In this study we tested k-nearest neighbor method combined with genetic algorithm (k-NN), artificial neural network (ANN), 2-dimensional convolutional neural network (2D-CNN) and 3-dimensional CNN (3D-CNN) for estimating the following forest variables: volume of growing stock, stand mean height and mean diameter. The results indicate that there were no major differences in the accuracy of the tested methods, but the ANN and 3D-CNN generally resulted in the lowest RMSE values for the predicted forest variables and the highest R<sup>2</sup> values between the predicted and observed forest variables. The lowest RMSE scores were 20.3% (3D-CNN), 6.4% (3D-CNN) and 11.2% (ANN) and the highest R<sup>2</sup> results 0.90 (3D-CNN), 0.95 (3D-CNN) and 0.85 (ANN) for volume of growing stock, stand mean height and mean diameter, respectively. Covariances of all response variable combinations and all predictions methods were lower than corresponding covariances of the field observations. ANN predictions had the highest covariances for mean height vs. mean diameter and total growing stock vs. mean diameter combinations and 3D-CNN for mean height vs. total growing stock. CNNs have distinct theoretical advantage over the other methods in complex recognition or classification tasks, but the utilization of their full potential may possibly require higher point density clouds than applied here. Thus, the relatively low density of the point clouds data may have been a contributing factor to the somewhat inconclusive ranking of the methods in this study. The input data and computer codes are available at: <span>https://github.com/balazsan/ALS_NNs</span><svg><path></path></svg>.</p></div>","PeriodicalId":100730,"journal":{"name":"ISPRS Open Journal of Photogrammetry and Remote Sensing","volume":"4 ","pages":"Article 100012"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667393222000011/pdfft?md5=975880c7524bbf3b179db86e3e611df1&pid=1-s2.0-S2667393222000011-main.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Open Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667393222000011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

In the remote sensing of forests, point cloud data from airborne laser scanning contains high-value information for predicting the volume of growing stock and the size of trees. At the same time, laser scanning data allows a very high number of potential features that can be extracted from the point cloud data for predicting the forest variables. In some methods, the features are first extracted by user-defined algorithms and the best features are selected based on supervised learning, whereas both tasks can be carried out automatically by deep learning methods typically based on deep neural networks. In this study we tested k-nearest neighbor method combined with genetic algorithm (k-NN), artificial neural network (ANN), 2-dimensional convolutional neural network (2D-CNN) and 3-dimensional CNN (3D-CNN) for estimating the following forest variables: volume of growing stock, stand mean height and mean diameter. The results indicate that there were no major differences in the accuracy of the tested methods, but the ANN and 3D-CNN generally resulted in the lowest RMSE values for the predicted forest variables and the highest R² values between the predicted and observed forest variables. The lowest RMSE scores were 20.3% (3D-CNN), 6.4% (3D-CNN) and 11.2% (ANN) and the highest R² results 0.90 (3D-CNN), 0.95 (3D-CNN) and 0.85 (ANN) for volume of growing stock, stand mean height and mean diameter, respectively. Covariances of all response variable combinations and all predictions methods were lower than corresponding covariances of the field observations. ANN predictions had the highest covariances for mean height vs. mean diameter and total growing stock vs. mean diameter combinations and 3D-CNN for mean height vs. total growing stock. CNNs have distinct theoretical advantage over the other methods in complex recognition or classification tasks, but the utilization of their full potential may possibly require higher point density clouds than applied here. Thus, the relatively low density of the point clouds data may have been a contributing factor to the somewhat inconclusive ranking of the methods in this study. The input data and computer codes are available at: https://github.com/balazsan/ALS_NNs.

查看原文本刊更多论文

神经网络与k近邻方法在机载激光林分变量估计中的比较

在森林遥感中，机载激光扫描的点云数据包含了预测生长蓄积量和树木大小的高价值信息。同时，激光扫描数据允许从点云数据中提取非常多的潜在特征，用于预测森林变量。在一些方法中，首先通过用户自定义算法提取特征，然后基于监督学习选择最佳特征，而这两项任务都可以通过通常基于深度神经网络的深度学习方法自动完成。本文采用遗传算法(k-NN)、人工神经网络(ANN)、二维卷积神经网络(2D-CNN)和三维神经网络(3D-CNN)相结合的k-最近邻方法来估计林分蓄积量、林分平均高度和林分平均直径。结果表明，两种方法的准确率差异不大，但ANN和3D-CNN预测森林变量的RMSE值最低，预测森林变量与观测森林变量的R2值最高。生长期蓄积量、林分平均高度和林分平均直径的RMSE评分最低的分别为20.3% (3D-CNN)、6.4% (3D-CNN)和11.2% (ANN)， R2最高的分别为0.90 (3D-CNN)、0.95 (3D-CNN)和0.85 (ANN)。所有响应变量组合和所有预测方法的协方差均低于相应的现场观测协方差。ANN预测的平均高度与平均直径、总生长量与平均直径组合的协方差最高，3D-CNN预测的平均高度与总生长量组合的协方差最高。在复杂的识别或分类任务中，cnn比其他方法具有明显的理论优势，但要充分发挥其潜力，可能需要比本文更高的点密度云。因此，相对较低的点云数据密度可能是导致本研究中方法排名不确定的一个因素。输入数据和计算机代码可在https://github.com/balazsan/ALS_NNs上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ISPRS Open Journal of Photogrammetry and Remote Sensing

CiteScore

5.10

自引率

0.00%

发文量