{"title":"回归分析与人工神经网络预测群落林松种量","authors":"Wenceslao Santiago-García","doi":"10.1016/j.ecoinf.2025.103203","DOIUrl":null,"url":null,"abstract":"<div><div>Volume prediction models are fundamental in forestry, as they support forest inventories, sustainable forest management strategies, and comprehensive environmental planning. The main objective of this study was to implement and compare two prominent approaches—regression and machine learning—for modeling whole-tree volume and stem volume in two <em>Pinus</em> species in community forests of southern Mexico. Destructive sampling provided data from 56 <em>P. patula</em> and 51 <em>P. pseudostrobus</em> trees, covering a wide range of diameters and heights. The regression approach employed seemingly unrelated nonlinear regression (NSUR) to fit simultaneous additive volume systems using both one- and two-variable models. In this approach, volume was modeled as a function of diameter at breast height (<em>d</em>) alone and as a function of both <em>d</em> and total tree height (<em>h</em>). Species and volume type were implicitly accounted for within the structure of the additive systems structure. For the machine learning approach, multilayer perceptron (MLP) artificial neural networks (ANNs) were trained using four input variables: diameter at breast height, total tree height, species, and volume type. These variables were explicitly incorporated into the ANN structure, enabling the model to learn complex, non-linear interactions. The ANN was optimized using L1 regularization and the Adam optimizer. The quantitative variables were diameter at breast height and total tree height, while the qualitative variables were species (<em>P. patula</em> and <em>P. pseudostrobus</em>) and volume type (whole-tree volume and stem volume), both coded as 1 and 0, respectively. The relative rank method was used to identify the most effective models based on goodness-of-fit statistics, including the coefficient of determination (R<sup>2</sup>), average absolute error (AAE), total relative error (TRE), average systematic error (ASE), and mean percent standard error (MPSE). The ANN approach consistently outperformed the regression model, achieving higher R<sup>2</sup> values and lower error metrics across all evaluations. Specifically, the ANN model reduced AAE, TRE, and ASE while maintaining biologically consistent predictions. This proposed ANN model represents a significant advancement in modeling both whole-tree and stem volume simultaneously and independently across different species, providing reliable and precise estimates. Given its ability to handle complex, non-linear relationships and its superior accuracy, we recommend the use of ANN as a practical tool in forestry applications, including forest resource evaluation and the development of sustainable forest management plans.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"89 ","pages":"Article 103203"},"PeriodicalIF":5.8000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Regression analysis and artificial neural networks for predicting pine species volume in community forests\",\"authors\":\"Wenceslao Santiago-García\",\"doi\":\"10.1016/j.ecoinf.2025.103203\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Volume prediction models are fundamental in forestry, as they support forest inventories, sustainable forest management strategies, and comprehensive environmental planning. The main objective of this study was to implement and compare two prominent approaches—regression and machine learning—for modeling whole-tree volume and stem volume in two <em>Pinus</em> species in community forests of southern Mexico. Destructive sampling provided data from 56 <em>P. patula</em> and 51 <em>P. pseudostrobus</em> trees, covering a wide range of diameters and heights. The regression approach employed seemingly unrelated nonlinear regression (NSUR) to fit simultaneous additive volume systems using both one- and two-variable models. In this approach, volume was modeled as a function of diameter at breast height (<em>d</em>) alone and as a function of both <em>d</em> and total tree height (<em>h</em>). Species and volume type were implicitly accounted for within the structure of the additive systems structure. For the machine learning approach, multilayer perceptron (MLP) artificial neural networks (ANNs) were trained using four input variables: diameter at breast height, total tree height, species, and volume type. These variables were explicitly incorporated into the ANN structure, enabling the model to learn complex, non-linear interactions. The ANN was optimized using L1 regularization and the Adam optimizer. The quantitative variables were diameter at breast height and total tree height, while the qualitative variables were species (<em>P. patula</em> and <em>P. pseudostrobus</em>) and volume type (whole-tree volume and stem volume), both coded as 1 and 0, respectively. The relative rank method was used to identify the most effective models based on goodness-of-fit statistics, including the coefficient of determination (R<sup>2</sup>), average absolute error (AAE), total relative error (TRE), average systematic error (ASE), and mean percent standard error (MPSE). The ANN approach consistently outperformed the regression model, achieving higher R<sup>2</sup> values and lower error metrics across all evaluations. Specifically, the ANN model reduced AAE, TRE, and ASE while maintaining biologically consistent predictions. This proposed ANN model represents a significant advancement in modeling both whole-tree and stem volume simultaneously and independently across different species, providing reliable and precise estimates. Given its ability to handle complex, non-linear relationships and its superior accuracy, we recommend the use of ANN as a practical tool in forestry applications, including forest resource evaluation and the development of sustainable forest management plans.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"89 \",\"pages\":\"Article 103203\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125002122\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125002122","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Regression analysis and artificial neural networks for predicting pine species volume in community forests
Volume prediction models are fundamental in forestry, as they support forest inventories, sustainable forest management strategies, and comprehensive environmental planning. The main objective of this study was to implement and compare two prominent approaches—regression and machine learning—for modeling whole-tree volume and stem volume in two Pinus species in community forests of southern Mexico. Destructive sampling provided data from 56 P. patula and 51 P. pseudostrobus trees, covering a wide range of diameters and heights. The regression approach employed seemingly unrelated nonlinear regression (NSUR) to fit simultaneous additive volume systems using both one- and two-variable models. In this approach, volume was modeled as a function of diameter at breast height (d) alone and as a function of both d and total tree height (h). Species and volume type were implicitly accounted for within the structure of the additive systems structure. For the machine learning approach, multilayer perceptron (MLP) artificial neural networks (ANNs) were trained using four input variables: diameter at breast height, total tree height, species, and volume type. These variables were explicitly incorporated into the ANN structure, enabling the model to learn complex, non-linear interactions. The ANN was optimized using L1 regularization and the Adam optimizer. The quantitative variables were diameter at breast height and total tree height, while the qualitative variables were species (P. patula and P. pseudostrobus) and volume type (whole-tree volume and stem volume), both coded as 1 and 0, respectively. The relative rank method was used to identify the most effective models based on goodness-of-fit statistics, including the coefficient of determination (R2), average absolute error (AAE), total relative error (TRE), average systematic error (ASE), and mean percent standard error (MPSE). The ANN approach consistently outperformed the regression model, achieving higher R2 values and lower error metrics across all evaluations. Specifically, the ANN model reduced AAE, TRE, and ASE while maintaining biologically consistent predictions. This proposed ANN model represents a significant advancement in modeling both whole-tree and stem volume simultaneously and independently across different species, providing reliable and precise estimates. Given its ability to handle complex, non-linear relationships and its superior accuracy, we recommend the use of ANN as a practical tool in forestry applications, including forest resource evaluation and the development of sustainable forest management plans.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.