Correction to “Predicting intraspecific trait variation among California's grasses”

IF 5.3 1区 环境科学与生态学 Q1 ECOLOGY
{"title":"Correction to “Predicting intraspecific trait variation among California's grasses”","authors":"","doi":"10.1111/1365-2745.14466","DOIUrl":null,"url":null,"abstract":"<p>Sandel, B., Pavelka, C., Hayashi, T., et al. (2021) Predicting intraspecific trait variation among California's grasses. <i>Journal of Ecology</i>, <i>109</i>, 2662–2677. https://doi.org/10.1111/1365-2745.13673.</p>\n<p>In the paper by Sandel et al. (2021), an error has been identified in the code.</p>\n<p>The error was in generating the testing data subset for assessing random forest fit, causing it to not be independent of the training dataset. This affects Figures 3-5, and the corrected versions of these are included below. Table S3 has been updated in the article. The updated text referring to these figures in the section ‘3.2 Modelling ITV’ is also included below. The changes do not fundamentally alter the message of the paper.</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/a95ce570-b963-458a-ba8f-8032e33fc08f/jec14466-fig-0001-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/a95ce570-b963-458a-ba8f-8032e33fc08f/jec14466-fig-0001-m.jpg\" loading=\"lazy\" src=\"/cms/asset/58c707eb-7764-4f13-910e-032b1da04aae/jec14466-fig-0001-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>FIGURE 3<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>Improvements in model performance when adding variable groupings. Model performance was measured as the correlation between observed and predicted delta-trait values in the testing dataset. For each variable group, we take the mean performance of all models that included that variable group minus the mean performance for all models that excluded that variable. Climate variables were mean annual temperature and annual precipitation; local traits were local measures of specific leaf area (SLA), height or leaf area (LA) at a site, excluding the predicted measures (e.g. models predicting SLA were trained on Height and LA); species traits were the overall species means of SLA, height and LA; phylogeny was the first five phylogenetic Eigenvector maps; and species name is a categorical variable giving the species name.</div>\n</figcaption>\n</figure>\n<p><b>3.2 Modelling</b> <b>ITV</b></p>\n<p>Across all specifications of the random forest models, performance scores were very similar on the training and testing data subsets (on average, differing by &lt;0.09, Table S3), suggesting little overfitting. When applied to the testing dataset, random forests containing all five predictor groups predicted values that were well correlated with the observed trait values (for delta-SLA: 0.74, SLA: 0.82, delta-Height: 0.67, Height = 0.88, delta-LA: 0.72, LA: 0.89, Table S3). Across all subsets of variable groups, other local traits (values of the non-focal trait from the local population, e.g. when predicting SLA, the Height of the plants) and climate were the most important groups for model performance (Figure 3). Species mean traits, name and phylogeny had smaller contributions to model fit. The performance of one such random forest, excluding the species predictor variable, is shown in Figure 4. The correlation between observed and predicted values is strong for both training and testing datasets. However, the observed–predicted relationships deviated somewhat from the 1:1 line, particularly for the delta-trait predictions. Standard major axis (SMA) regression slopes were less than 1, ranging from 0.65 to 0.73 for delta-trait models and 0.82 and 0.92 for the final local trait predictions. These deviations indicate that these models tend to predict less extreme values for the most extreme trait observations.</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/8ace7565-cf11-44c5-9200-82aacdd197ae/jec14466-fig-0002-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/8ace7565-cf11-44c5-9200-82aacdd197ae/jec14466-fig-0002-m.jpg\" loading=\"lazy\" src=\"/cms/asset/cfba3354-6dd3-4a32-a1f2-7cedbb9aaef7/jec14466-fig-0002-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>FIGURE 4<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>Model fit for random forests predicting local trait values from climate, other local traits, species mean traits and phylogenetic position. Each point represents a sample of a grass species from a particular location. Error bars indicate standard errors for the predictions. Models predicting delta-trait values are attempting to predict deviation of an individual from its species mean (left column). Adding the species means to these predictions gives an overall estimate of the trait value for an individual (right column).</div>\n</figcaption>\n</figure>\n<p>A model including other local trait measurements and species names would be of limited use for predicting trait values of a plant in an unmeasured location. In contrast, the climate of that location is readily available, and phylogenetic relationships are known for most species. Thus, we focused on a reduced model including just these two variable groups and two species-level traits: the species mean value for the focal trait and its life span. Removing species names from the model had little impact (Table S3), but removing other local traits reduced model performance (Figure 5). For example, predicted-observed correlations for SLA, height and LA dropped to 0.79, 0.88 and 0.89. This likely reflects the fact that other local trait measurements can provide insight into local conditions that are not captured by our two broad climate predictors. Despite this modest reduction, model performance for this simplified model was still fairly high.</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/0aaccbcf-6519-4c8f-b8c8-d12a4561c51a/jec14466-fig-0003-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/0aaccbcf-6519-4c8f-b8c8-d12a4561c51a/jec14466-fig-0003-m.jpg\" loading=\"lazy\" src=\"/cms/asset/97eb7986-b0cc-4b66-a4d2-2971f66b120e/jec14466-fig-0003-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>FIGURE 5<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>Model fit for random forests using only mean traits and phylogeny and trained on the entire dataset. Each point represents a sample of a grass species from a particular location. Error bars indicate standard errors. Models predicting delta-trait values are attempting to predict deviation of an individual from its species mean (left column). Adding the species means to these predictions gives an overall estimate of the trait value for an individual (right column).</div>\n</figcaption>\n</figure>\n<p>We apologise for this error.</p>","PeriodicalId":191,"journal":{"name":"Journal of Ecology","volume":"111 3S 1","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ecology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1111/1365-2745.14466","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Sandel, B., Pavelka, C., Hayashi, T., et al. (2021) Predicting intraspecific trait variation among California's grasses. Journal of Ecology, 109, 2662–2677. https://doi.org/10.1111/1365-2745.13673.

In the paper by Sandel et al. (2021), an error has been identified in the code.

The error was in generating the testing data subset for assessing random forest fit, causing it to not be independent of the training dataset. This affects Figures 3-5, and the corrected versions of these are included below. Table S3 has been updated in the article. The updated text referring to these figures in the section ‘3.2 Modelling ITV’ is also included below. The changes do not fundamentally alter the message of the paper.

Abstract Image
FIGURE 3
Open in figure viewerPowerPoint
Improvements in model performance when adding variable groupings. Model performance was measured as the correlation between observed and predicted delta-trait values in the testing dataset. For each variable group, we take the mean performance of all models that included that variable group minus the mean performance for all models that excluded that variable. Climate variables were mean annual temperature and annual precipitation; local traits were local measures of specific leaf area (SLA), height or leaf area (LA) at a site, excluding the predicted measures (e.g. models predicting SLA were trained on Height and LA); species traits were the overall species means of SLA, height and LA; phylogeny was the first five phylogenetic Eigenvector maps; and species name is a categorical variable giving the species name.

3.2 Modelling ITV

Across all specifications of the random forest models, performance scores were very similar on the training and testing data subsets (on average, differing by <0.09, Table S3), suggesting little overfitting. When applied to the testing dataset, random forests containing all five predictor groups predicted values that were well correlated with the observed trait values (for delta-SLA: 0.74, SLA: 0.82, delta-Height: 0.67, Height = 0.88, delta-LA: 0.72, LA: 0.89, Table S3). Across all subsets of variable groups, other local traits (values of the non-focal trait from the local population, e.g. when predicting SLA, the Height of the plants) and climate were the most important groups for model performance (Figure 3). Species mean traits, name and phylogeny had smaller contributions to model fit. The performance of one such random forest, excluding the species predictor variable, is shown in Figure 4. The correlation between observed and predicted values is strong for both training and testing datasets. However, the observed–predicted relationships deviated somewhat from the 1:1 line, particularly for the delta-trait predictions. Standard major axis (SMA) regression slopes were less than 1, ranging from 0.65 to 0.73 for delta-trait models and 0.82 and 0.92 for the final local trait predictions. These deviations indicate that these models tend to predict less extreme values for the most extreme trait observations.

Abstract Image
FIGURE 4
Open in figure viewerPowerPoint
Model fit for random forests predicting local trait values from climate, other local traits, species mean traits and phylogenetic position. Each point represents a sample of a grass species from a particular location. Error bars indicate standard errors for the predictions. Models predicting delta-trait values are attempting to predict deviation of an individual from its species mean (left column). Adding the species means to these predictions gives an overall estimate of the trait value for an individual (right column).

A model including other local trait measurements and species names would be of limited use for predicting trait values of a plant in an unmeasured location. In contrast, the climate of that location is readily available, and phylogenetic relationships are known for most species. Thus, we focused on a reduced model including just these two variable groups and two species-level traits: the species mean value for the focal trait and its life span. Removing species names from the model had little impact (Table S3), but removing other local traits reduced model performance (Figure 5). For example, predicted-observed correlations for SLA, height and LA dropped to 0.79, 0.88 and 0.89. This likely reflects the fact that other local trait measurements can provide insight into local conditions that are not captured by our two broad climate predictors. Despite this modest reduction, model performance for this simplified model was still fairly high.

Abstract Image
FIGURE 5
Open in figure viewerPowerPoint
Model fit for random forests using only mean traits and phylogeny and trained on the entire dataset. Each point represents a sample of a grass species from a particular location. Error bars indicate standard errors. Models predicting delta-trait values are attempting to predict deviation of an individual from its species mean (left column). Adding the species means to these predictions gives an overall estimate of the trait value for an individual (right column).

We apologise for this error.

求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Ecology
Journal of Ecology 环境科学-生态学
CiteScore
10.90
自引率
5.50%
发文量
207
审稿时长
3.0 months
期刊介绍: Journal of Ecology publishes original research papers on all aspects of the ecology of plants (including algae), in both aquatic and terrestrial ecosystems. We do not publish papers concerned solely with cultivated plants and agricultural ecosystems. Studies of plant communities, populations or individual species are accepted, as well as studies of the interactions between plants and animals, fungi or bacteria, providing they focus on the ecology of the plants. We aim to bring important work using any ecological approach (including molecular techniques) to a wide international audience and therefore only publish papers with strong and ecological messages that advance our understanding of ecological principles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信