Validation of appropriate estimation criteria for the number of components for separating a polymodal grain-size distribution into lognormal distributions
{"title":"Validation of appropriate estimation criteria for the number of components for separating a polymodal grain-size distribution into lognormal distributions","authors":"Naofumi Yamaguchi","doi":"10.1186/s40645-023-00601-y","DOIUrl":null,"url":null,"abstract":"<p>Polymodal particle size distributions are generally analyzed by separating them into lognormal distributions, but estimating the precise number of lognormal components required remains a considerable problem. In the present study, appropriate evaluation criteria for the estimation of the number of components were examined by using artificial data for which the true number of components was known. The characteristics of estimations of the number of components by four evaluation criteria, the mean square error (MSE), Akaike information criterion (AIC), Bayesian information criterion (BIC), and adjusted R-squared (ARS), were investigated. The results showed that the MSE and ARS were less sensitive to the true number of components and tended to overestimate the number of components. By contrast, the AIC and BIC tended to underestimate the number of components, and their correct answer rates decreased as the true number of components increased. The BIC tended to include the true number of components among its higher ranked models. The present evaluation results suggest that the MSE, although frequently used, is not necessarily the most appropriate evaluation criterion, and that the AIC and ARS may be more appropriate criteria. Furthermore, checking whether the number of components estimated by the AIC or ARS is included among higher ranked BIC models might prevent overestimation and thereby allow for more valid estimation of the number of components. When the criteria were applied to grain-size distributions of lacustrine sediments, it was possible to estimate the number of components that reflected differences in grain-size distribution characteristics.</p>\n","PeriodicalId":54272,"journal":{"name":"Progress in Earth and Planetary Science","volume":"10 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Progress in Earth and Planetary Science","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1186/s40645-023-00601-y","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Polymodal particle size distributions are generally analyzed by separating them into lognormal distributions, but estimating the precise number of lognormal components required remains a considerable problem. In the present study, appropriate evaluation criteria for the estimation of the number of components were examined by using artificial data for which the true number of components was known. The characteristics of estimations of the number of components by four evaluation criteria, the mean square error (MSE), Akaike information criterion (AIC), Bayesian information criterion (BIC), and adjusted R-squared (ARS), were investigated. The results showed that the MSE and ARS were less sensitive to the true number of components and tended to overestimate the number of components. By contrast, the AIC and BIC tended to underestimate the number of components, and their correct answer rates decreased as the true number of components increased. The BIC tended to include the true number of components among its higher ranked models. The present evaluation results suggest that the MSE, although frequently used, is not necessarily the most appropriate evaluation criterion, and that the AIC and ARS may be more appropriate criteria. Furthermore, checking whether the number of components estimated by the AIC or ARS is included among higher ranked BIC models might prevent overestimation and thereby allow for more valid estimation of the number of components. When the criteria were applied to grain-size distributions of lacustrine sediments, it was possible to estimate the number of components that reflected differences in grain-size distribution characteristics.
期刊介绍:
Progress in Earth and Planetary Science (PEPS), a peer-reviewed open access e-journal, was launched by the Japan Geoscience Union (JpGU) in 2014. This international journal is devoted to high-quality original articles, reviews and papers with full data attached in the research fields of space and planetary sciences, atmospheric and hydrospheric sciences, human geosciences, solid earth sciences, and biogeosciences. PEPS promotes excellent review articles and welcomes articles with electronic attachments including videos, animations, and large original data files. PEPS also encourages papers with full data attached: papers with full data attached are scientific articles that preserve the full detailed raw research data and metadata which were gathered in their preparation and make these data freely available to the research community for further analysis.