对Raunkiæran植物短缺的全球评估:我们对植物性状认识的地理偏见

IF 9.4 1区 生物学 Q1 Agricultural and Biological Sciences
New Phytologist Pub Date : 2023-06-27 DOI:10.1111/nph.18999
Brian Maitner, Rachael Gallagher, Jens-Christian Svenning, Melanie Tietje, Elizabeth H. Wenk, Wolf L. Eiserhardt
{"title":"对Raunkiæran植物短缺的全球评估:我们对植物性状认识的地理偏见","authors":"Brian Maitner,&nbsp;Rachael Gallagher,&nbsp;Jens-Christian Svenning,&nbsp;Melanie Tietje,&nbsp;Elizabeth H. Wenk,&nbsp;Wolf L. Eiserhardt","doi":"10.1111/nph.18999","DOIUrl":null,"url":null,"abstract":"<p>The functional traits (measured attributes) of organisms result from interactions with their biotic and abiotic environment. Traits allow us to understand both how individuals and the communities they form will respond to environmental change and how these changes will impact ecosystem services and processes (Lavorel &amp; Garnier, <span>2002</span>). Plants constitute most of the biomass on Earth (<i>c</i>. 82%; Bar-On <i>et al</i>., <span>2018</span>), and their traits are the predominant drivers of terrestrial ecosystem functioning (Migliavacca <i>et al</i>., <span>2021</span>; Fricke <i>et al</i>., <span>2022</span>). Thus, to a first-order approximation, understanding the traits of plants means understanding terrestrial ecosystems.</p><p>There remains a sustained interest in both trait-based ecology (e.g. Lavorel &amp; Garnier, <span>2002</span>; McGill <i>et al</i>., <span>2006</span>; Violle <i>et al</i>., <span>2007</span>; Mouillot <i>et al</i>., <span>2021</span>) and Open Science (Cheruvelil &amp; Soranno, <span>2018</span>; Gallagher <i>et al</i>., <span>2020b</span>; Geange <i>et al</i>., <span>2021</span>), both of which have contributed to the creation and sharing of large compilations of plant traits constituting millions of observations (e.g. Kattge <i>et al</i>., <span>2011</span>; Maitner <i>et al</i>., <span>2017</span>; Sauquet <i>et al</i>., <span>2017</span>; Weigelt <i>et al</i>., <span>2020</span>; Falster <i>et al</i>., <span>2021</span>). However, despite this growing wealth of data, our knowledge of plant traits remains far from complete (the ‘Raunkiæran shortfall’; Hortal <i>et al</i>., <span>2015</span>).</p><p>In addition to trait data being incomplete, recent work by Cornwell <i>et al</i>. (<span>2019</span>) suggests our knowledge of plant traits is also spatially biased, with marked latitudinal variation in coverage. The causes of these biases have not been rigorously tested, but may be driven by: wealthier countries being able to collect and disseminate more data (Meyer <i>et al</i>., <span>2015</span>); smaller and more-accessible countries being able to sample proportionally more species (Hijmans <i>et al</i>., <span>2000</span>; Kadmon <i>et al</i>., <span>2004</span>; Hughes <i>et al</i>., <span>2021</span>); and countries with few species and low endemism reaching higher completeness more easily. These spatial biases in turn may limit our ability to respond to urgent global changes, particularly if there are discrepancies between where the data are most urgently needed (e.g. where changes are highly uncertain or projected to be severe) and where they are being collected.</p><p>Cornwell <i>et al</i>. (<span>2019</span>) examined the coverage of a range of attributes, including traits, in the global flora with a focus on assessing the completeness (fraction of plant species with data available) of information using The Plant List as a taxonomic backbone. Here, we expand on this work by mapping trait completeness (the fraction of species with freely available trait data for a given trait) globally, using the geographic and taxonomic information in the recently completed World Checklist of Vascular Plants (WCVP; Govaerts <i>et al</i>., <span>2021</span>) and trait information from the widely used TRY database (the most commonly used global plant trait database; Kattge <i>et al</i>., <span>2011</span>). We test hypothesized drivers of variation in plant trait data availability across the globe, including factors related to: wealth, research expenditure, and educational expenditure; region size and accessibility; and biogeography. We compare spatial patterns of trait data completeness with those of phylogenetic and distributional data completeness (Rudbeck <i>et al</i>., <span>2022</span>). We test for correlations between trait data completeness and impacts of global change. Finally, we identify solutions for filling significant regional gaps in trait data completeness using Open Science approaches, highlighting the model developed for the <span>AusTraits</span> database (Falster <i>et al</i>., <span>2021</span>).</p><p>The set of public trait data we received from TRY contained 1.5 million unique species × trait combinations across 10.4 million trait observations. Of the 2027 unique traits, the most complete was plant growth form (32.5%; 113 620 species). Most traits had poor coverage (Fig. S1), with a mean global completeness across all traits of 0.21% and a median of 0.0051%. Surprisingly, some traits calculated from multiple measurements had higher coverage than their components (e.g. SLA was available for 14 127 species and leaf dry mass for 6064). After excluding traits with data for &lt; 1% of species (3500), which could not be applied to all species, or which were not properties of individuals, our focal dataset included 5.1 million records across 122 230 species (34.9% of vascular plants) and 53 traits (Table S1). Trait coverage ranged between 1.01% and 32.5% global coverage (mean = 3.48%, median = 1.88%). For a single trait, within a single botanical country, trait completeness varied widely, ranging between 0 and 100% (mean = 19.4%, median = 12.7%), with entirely complete or incomplete data found on Antarctica and small islands. Mean completeness across traits ranged between 2.8% (New Guinea) and 58.7% (Føroyar) across countries (mean = 19.4%, median = 17.3%, Fig. 1). Correlations in completeness across botanical countries were positive and significant for the majority of focal traits, with the exception of four leaf morphological traits (width, length, margin type, and venation type; Fig. S2).</p><p>We found that the effects of four of our predictor variables had 95% confidence intervals that excluded zero (Table S2). Trait completeness was positively associated with mean species range size (0.41, 95% CI [0.37, 0.45]) and research expenditure (0.06, 95% CI [0.03, 0.09]). Trait completeness was negatively associated with endemism (−0.13, 95% CI [−0.15, −0.10]) and vascular plant species richness (−0.09, 95% CI [−0.12, −0.06]).</p><p>Despite the massive amount of trait data collated to date, we are a long way from fully capturing some of the simplest traits for the majority of plant species across the world's botanical countries. All 53 focal traits examined here were below the 40% coverage threshold noted by Penone <i>et al</i>. (<span>2014</span>) as being needed to impute missing traits with confidence, suggesting that imputation is not yet an option at a global level (but may be of use within certain regions). Our mapping shows that what information we do have is spatially biased, with trait coverage being higher in the Global North (in the socioeconomic sense, which includes Australia). Completeness is generally consistent across traits (see also Notes S1 for flower, wood, and seed traits), showing that we have a general lack of trait data in the Global South, as also noted by Cornwell <i>et al</i>. (<span>2019</span>), rather than simply a different set of traits being measured. However, we note that 19% of traits show a negative correlation with mean focal trait completeness (Fig. S7), but only four of these were above our 1% threshold for inclusion: leaf width, length, margin type, and venation type. These exceptions were driven by regional data aggregation efforts focused on Africa and China (Kirkup <i>et al</i>., <span>2005</span>; Prentice <i>et al</i>., <span>2011</span>; Dressler <i>et al</i>., <span>2014</span>). The geographic biases we observe are similar to those observed for phylogenetic data by Rudbeck <i>et al</i>. (<span>2022</span>), with the factors we identified as driving trait data completeness (range size, endemism, species richness, and research expenditure) being a subset of the factors they identified as driving phylogenetic data completeness, suggesting similar mechanisms underlie the acquisition of both data types. Unfortunately, this correspondence between phylogenetic and trait data completeness (Figs 2, S8) likely means that regions with low trait completeness have relatively less to gain via phylogenetic trait imputation.</p><p>Due to the positive correlations between trait data availability and global changes related to temperature and human footprint, many of the regions undergoing the most severe changes may be best positioned to predict those changes. However, this correlation also means that trait data have been disproportionately collected from anthropogenically disturbed regions, which may bias inferences. Conversely, the negative correlation between trait data availability and predicted changes and uncertainty in precipitation may hinder our ability to predict plant responses to altered precipitation. We also note that trait coverage is low in tropical regions which may be approaching climatic thresholds, beyond which irreversible changes in ecosystems may occur (e.g. conversion of rainforest to fire-dominated forests; Malhi <i>et al</i>., <span>2009</span>).</p><p>Our main analysis, which included traits measured anywhere, and our analysis focusing only on traits known to have been measured within particular botanical countries, differed substantially in the magnitude, direction, and significance of socioeconomic and biogeographic predictors. We argue that by removing the confounding influence of shared trait data, the georeferenced data provide a better picture of the drivers of trait data availability at a botanical country level. As we expected, we found trait data completeness is positively associated with national wealth and spending, with wealthier regions that spend more on research and education typically having better coverage. Also as expected, we found that trait coverage is negatively associated with endemism, likely driven by endemics having relatively small ranges and occurring in less-accessible areas, making them less likely to be sampled (Steinbauer <i>et al</i>., <span>2016</span>; Enquist <i>et al</i>., <span>2019</span>). Contrary to our expectations, however, we found that larger, less accessible, less secure, and more species-rich regions tended to have higher trait coverage. One potential reason for these unexpected relationships may be that scientists disproportionately choose to work in regions with these characteristics because of the relatively high species richness they harbor.</p><p>While we identify several factors that are associated with trait data completeness, we caution that relationships with socioeconomic factors can often be inconsistent and context-dependent (Rydén <i>et al</i>., <span>2020</span>; Zizka <i>et al</i>., <span>2021</span>). Furthermore, there are many socioeconomic factors beyond these which may be relevant, and different regions may differ in which factors are relevant (Meyer <i>et al</i>., <span>2015</span>; Zizka <i>et al</i>., <span>2021</span>). While much previous work has focused on the correlates of geographic data availability (e.g. Meyer <i>et al</i>., <span>2015</span>; Hughes <i>et al</i>., <span>2021</span>), less emphasis has been placed on the availability of trait data (but see Cornwell <i>et al</i>., <span>2019</span>). We found no significant correlation between trait completeness and geographic completeness, suggesting that the drivers of these two may differ, although some variables will likely be relevant for both (e.g. research funding; Meyer <i>et al</i>., <span>2015</span>).</p><p>The accuracy of our completeness estimates depends on our estimates of country-level species pools, which are hindered by both the Linnean shortfall (number of undescribed species) and the Wallacean shortfall, which are spatially biased (Meyer <i>et al</i>., <span>2015</span>; Freeman &amp; Pennell, <span>2021</span>; Hughes <i>et al</i>., <span>2021</span>). Thus, our analyses will overestimate the coverage for regions with many undescribed or unrecorded species. This provides a potential alternative explanation for our unexpected finding that relatively inaccessible, large, and insecure regions have higher trait coverage: We may be underestimating the species richness in these regions, thereby increasing completeness estimates. We also caution that our estimates of species range sizes are biased by the available data. Species were assumed to occupy the entirety of the regions they occur in, which will lead to overestimates of range sizes, particularly in large regions.</p><p>We note that in our main analysis (Fig. 1), some countries with high levels of wealth and spending appear relatively data poor (e.g. Australia and New Zealand). However, when we combine AusTraits with TRY, the level of trait completeness in Australia falls in line with levels seen elsewhere in the Global North (Fig. 3), suggesting that a lack of data integration may underlie some perceived knowledge gaps. In addition to relatively high rates of funding, countries in the temperate and polar regions of the Northern Hemisphere also have relatively low species diversity and large species ranges, allowing them to share data across boundaries. By contrast, many tropical countries have low rates of funding, high endemism, and high species diversity, factors which work against trait completeness. Although the focus of this study was botanical countries, the bias we observe will extend to other geographic classifications (e.g. biogeographic realms, biomes, and ecosystems; Olson <i>et al</i>., <span>2001</span>) such that our knowledge of those occurring predominantly in the Global South will tend to be relatively poor.</p><p>The analyses presented here focus on the most widely used global plant trait database, TRY (Kattge <i>et al</i>., <span>2011</span>, <span>2020</span>). However, numerous other plant trait databases and datasets exist, some of which will not have been incorporated in TRY yet, and others which may not be able to be incorporated due to licensing issues. However, by integrating TRY with one open access resource (AusTraits), we were able to more than double trait data completeness for all Australian states (Fig. 3). This supports recent work by Feng <i>et al</i>. (<span>2022</span>) showing that database integration can lead to rapid gains in available data. The AusTraits model demonstrates how the creation and integration of regional databases can rapidly expand trait completeness. With a more limited geographic scope, AusTraits could target its data collection, allowing it to achieve similar completeness to the northern hemisphere despite high endemism and a low population density. This includes personally interacting with researchers to create a sense of community, developing a workflow where a database manager leads the input of datasets, reducing contributor effort, and contacting researchers and repositories with known large datasets. In particular, AusTraits makes use of expertise held within the systematics community, mining data embedded in taxonomic descriptions. This approach provides depth of coverage for key traits such as growth, leaf dimensions, and plant height aiding in the filling of regional gaps for modeling and conservation management. Due to socioeconomic limitations, regional efforts in some areas (particularly in the Global South) may only be feasible with South–South and South–North collaborative efforts. Data source integration is currently hindered by variation in data structure, data availability, taxonomy, and even trait names (Gallagher <i>et al</i>., <span>2020b</span>), so new, regional efforts would benefit from adopting existing tools, taxonomies, and data structures (e.g. Boyle <i>et al</i>., <span>2013</span>; Schneider <i>et al</i>., <span>2019</span>; Falster <i>et al</i>., <span>2021</span>).</p><p>In this study, we quantified trait data completeness relative to a subset of traits provided in the TRY database, but there are an infinite number of possible traits: Measurements can be taken at any point, organ, or developmental stage on an individual, at multiple levels of organization, and combined in any way (e.g. dry leaf mass per leaf area and above ground biomass divided by belowground biomass). A lack of standard trait definitions makes the integration of different databases challenging (Garnier <i>et al</i>., <span>2016</span>). Thankfully, plant traits are often strongly correlated (Westoby <i>et al</i>., <span>2002</span>; Wright <i>et al</i>., <span>2004</span>; Díaz <i>et al</i>., <span>2016</span>; Zeballos <i>et al</i>., <span>2017</span>), and even sparse trait coverage may allow us to say something about the overall phenotypes of species (Mouillot <i>et al</i>., <span>2021</span>), particularly when combined with phylogenetic information (Penone <i>et al</i>., <span>2014</span>) or geographic information (Sandel <i>et al</i>., <span>2021</span>). Imputation methods based on trait and phylogenetic correlations also provide estimates of uncertainty which can be used in sampling prioritization. Thus, global collection efforts focused on key traits representing known trait spectra (e.g. Westoby <i>et al</i>., <span>2002</span>; Wright <i>et al</i>., <span>2004</span>; Díaz <i>et al</i>., <span>2016</span>; Zeballos <i>et al</i>., <span>2017</span>), particularly in taxa or regions of high uncertainty, may be a reasonable path forward. For such large-scale efforts, traits that may be especially important to focus on are those that are: broadly relevant and apply to most or all plant species; of importance for many ecological processes; and fast and affordable to measure. For example, plant height, leaf mass per area, and leaf dry matter content are all strong candidates. However, we also acknowledge that the choice of which traits to measure is ultimately driven by the research question, and a diversity of research questions necessitates a diversity of traits.</p><p>While plant traits are critically important across disciplines and are urgently needed to allow us to predict responses to global change, the current state of our knowledge is both incomplete and geographically biased. Given the current state of available data, large scale (e.g. country, biome, and global) analyses of plant traits must be interpreted with caution and should attempt to quantify uncertainty caused by these massive data gaps. Moving forward, researchers can help by publishing their data and metadata openly, including the full set of raw and derived trait data (Keller <i>et al</i>., <span>2023</span>). At larger scales, efforts to mobilize and integrate existing datasets, particularly those focused on particular geographic regions (e.g. Tavşanoğlu &amp; Pausas, <span>2018</span>; Falster <i>et al</i>., <span>2021</span>; Báez <i>et al</i>., <span>2022</span>) or types of traits (e.g. Iversen <i>et al</i>., <span>2017</span>; LeBauer <i>et al</i>., <span>2018</span>; Guerrero-Ramírez <i>et al</i>., <span>2021</span>), hold promise to rapidly advance the state of our knowledge (Feng <i>et al</i>., <span>2022</span>). However, what is ultimately needed is more data collection and research, particularly in the Global South, which requires funding, infrastructure, and capacity building.</p><p>None declared.</p><p>BSM and WLE conceived the initial idea. BSM, WLE, RG, J-CS and MT planned and designed the research. RG, J-CS, MT, EHW and WLE provided data. BSM conducted analyses with feedback from all authors. BSM led the writing with contributions from all authors.</p>","PeriodicalId":48887,"journal":{"name":"New Phytologist","volume":"240 4","pages":"1345-1354"},"PeriodicalIF":9.4000,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/nph.18999","citationCount":"0","resultStr":"{\"title\":\"A global assessment of the Raunkiæran shortfall in plants: geographic biases in our knowledge of plant traits\",\"authors\":\"Brian Maitner,&nbsp;Rachael Gallagher,&nbsp;Jens-Christian Svenning,&nbsp;Melanie Tietje,&nbsp;Elizabeth H. Wenk,&nbsp;Wolf L. Eiserhardt\",\"doi\":\"10.1111/nph.18999\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The functional traits (measured attributes) of organisms result from interactions with their biotic and abiotic environment. Traits allow us to understand both how individuals and the communities they form will respond to environmental change and how these changes will impact ecosystem services and processes (Lavorel &amp; Garnier, <span>2002</span>). Plants constitute most of the biomass on Earth (<i>c</i>. 82%; Bar-On <i>et al</i>., <span>2018</span>), and their traits are the predominant drivers of terrestrial ecosystem functioning (Migliavacca <i>et al</i>., <span>2021</span>; Fricke <i>et al</i>., <span>2022</span>). Thus, to a first-order approximation, understanding the traits of plants means understanding terrestrial ecosystems.</p><p>There remains a sustained interest in both trait-based ecology (e.g. Lavorel &amp; Garnier, <span>2002</span>; McGill <i>et al</i>., <span>2006</span>; Violle <i>et al</i>., <span>2007</span>; Mouillot <i>et al</i>., <span>2021</span>) and Open Science (Cheruvelil &amp; Soranno, <span>2018</span>; Gallagher <i>et al</i>., <span>2020b</span>; Geange <i>et al</i>., <span>2021</span>), both of which have contributed to the creation and sharing of large compilations of plant traits constituting millions of observations (e.g. Kattge <i>et al</i>., <span>2011</span>; Maitner <i>et al</i>., <span>2017</span>; Sauquet <i>et al</i>., <span>2017</span>; Weigelt <i>et al</i>., <span>2020</span>; Falster <i>et al</i>., <span>2021</span>). However, despite this growing wealth of data, our knowledge of plant traits remains far from complete (the ‘Raunkiæran shortfall’; Hortal <i>et al</i>., <span>2015</span>).</p><p>In addition to trait data being incomplete, recent work by Cornwell <i>et al</i>. (<span>2019</span>) suggests our knowledge of plant traits is also spatially biased, with marked latitudinal variation in coverage. The causes of these biases have not been rigorously tested, but may be driven by: wealthier countries being able to collect and disseminate more data (Meyer <i>et al</i>., <span>2015</span>); smaller and more-accessible countries being able to sample proportionally more species (Hijmans <i>et al</i>., <span>2000</span>; Kadmon <i>et al</i>., <span>2004</span>; Hughes <i>et al</i>., <span>2021</span>); and countries with few species and low endemism reaching higher completeness more easily. These spatial biases in turn may limit our ability to respond to urgent global changes, particularly if there are discrepancies between where the data are most urgently needed (e.g. where changes are highly uncertain or projected to be severe) and where they are being collected.</p><p>Cornwell <i>et al</i>. (<span>2019</span>) examined the coverage of a range of attributes, including traits, in the global flora with a focus on assessing the completeness (fraction of plant species with data available) of information using The Plant List as a taxonomic backbone. Here, we expand on this work by mapping trait completeness (the fraction of species with freely available trait data for a given trait) globally, using the geographic and taxonomic information in the recently completed World Checklist of Vascular Plants (WCVP; Govaerts <i>et al</i>., <span>2021</span>) and trait information from the widely used TRY database (the most commonly used global plant trait database; Kattge <i>et al</i>., <span>2011</span>). We test hypothesized drivers of variation in plant trait data availability across the globe, including factors related to: wealth, research expenditure, and educational expenditure; region size and accessibility; and biogeography. We compare spatial patterns of trait data completeness with those of phylogenetic and distributional data completeness (Rudbeck <i>et al</i>., <span>2022</span>). We test for correlations between trait data completeness and impacts of global change. Finally, we identify solutions for filling significant regional gaps in trait data completeness using Open Science approaches, highlighting the model developed for the <span>AusTraits</span> database (Falster <i>et al</i>., <span>2021</span>).</p><p>The set of public trait data we received from TRY contained 1.5 million unique species × trait combinations across 10.4 million trait observations. Of the 2027 unique traits, the most complete was plant growth form (32.5%; 113 620 species). Most traits had poor coverage (Fig. S1), with a mean global completeness across all traits of 0.21% and a median of 0.0051%. Surprisingly, some traits calculated from multiple measurements had higher coverage than their components (e.g. SLA was available for 14 127 species and leaf dry mass for 6064). After excluding traits with data for &lt; 1% of species (3500), which could not be applied to all species, or which were not properties of individuals, our focal dataset included 5.1 million records across 122 230 species (34.9% of vascular plants) and 53 traits (Table S1). Trait coverage ranged between 1.01% and 32.5% global coverage (mean = 3.48%, median = 1.88%). For a single trait, within a single botanical country, trait completeness varied widely, ranging between 0 and 100% (mean = 19.4%, median = 12.7%), with entirely complete or incomplete data found on Antarctica and small islands. Mean completeness across traits ranged between 2.8% (New Guinea) and 58.7% (Føroyar) across countries (mean = 19.4%, median = 17.3%, Fig. 1). Correlations in completeness across botanical countries were positive and significant for the majority of focal traits, with the exception of four leaf morphological traits (width, length, margin type, and venation type; Fig. S2).</p><p>We found that the effects of four of our predictor variables had 95% confidence intervals that excluded zero (Table S2). Trait completeness was positively associated with mean species range size (0.41, 95% CI [0.37, 0.45]) and research expenditure (0.06, 95% CI [0.03, 0.09]). Trait completeness was negatively associated with endemism (−0.13, 95% CI [−0.15, −0.10]) and vascular plant species richness (−0.09, 95% CI [−0.12, −0.06]).</p><p>Despite the massive amount of trait data collated to date, we are a long way from fully capturing some of the simplest traits for the majority of plant species across the world's botanical countries. All 53 focal traits examined here were below the 40% coverage threshold noted by Penone <i>et al</i>. (<span>2014</span>) as being needed to impute missing traits with confidence, suggesting that imputation is not yet an option at a global level (but may be of use within certain regions). Our mapping shows that what information we do have is spatially biased, with trait coverage being higher in the Global North (in the socioeconomic sense, which includes Australia). Completeness is generally consistent across traits (see also Notes S1 for flower, wood, and seed traits), showing that we have a general lack of trait data in the Global South, as also noted by Cornwell <i>et al</i>. (<span>2019</span>), rather than simply a different set of traits being measured. However, we note that 19% of traits show a negative correlation with mean focal trait completeness (Fig. S7), but only four of these were above our 1% threshold for inclusion: leaf width, length, margin type, and venation type. These exceptions were driven by regional data aggregation efforts focused on Africa and China (Kirkup <i>et al</i>., <span>2005</span>; Prentice <i>et al</i>., <span>2011</span>; Dressler <i>et al</i>., <span>2014</span>). The geographic biases we observe are similar to those observed for phylogenetic data by Rudbeck <i>et al</i>. (<span>2022</span>), with the factors we identified as driving trait data completeness (range size, endemism, species richness, and research expenditure) being a subset of the factors they identified as driving phylogenetic data completeness, suggesting similar mechanisms underlie the acquisition of both data types. Unfortunately, this correspondence between phylogenetic and trait data completeness (Figs 2, S8) likely means that regions with low trait completeness have relatively less to gain via phylogenetic trait imputation.</p><p>Due to the positive correlations between trait data availability and global changes related to temperature and human footprint, many of the regions undergoing the most severe changes may be best positioned to predict those changes. However, this correlation also means that trait data have been disproportionately collected from anthropogenically disturbed regions, which may bias inferences. Conversely, the negative correlation between trait data availability and predicted changes and uncertainty in precipitation may hinder our ability to predict plant responses to altered precipitation. We also note that trait coverage is low in tropical regions which may be approaching climatic thresholds, beyond which irreversible changes in ecosystems may occur (e.g. conversion of rainforest to fire-dominated forests; Malhi <i>et al</i>., <span>2009</span>).</p><p>Our main analysis, which included traits measured anywhere, and our analysis focusing only on traits known to have been measured within particular botanical countries, differed substantially in the magnitude, direction, and significance of socioeconomic and biogeographic predictors. We argue that by removing the confounding influence of shared trait data, the georeferenced data provide a better picture of the drivers of trait data availability at a botanical country level. As we expected, we found trait data completeness is positively associated with national wealth and spending, with wealthier regions that spend more on research and education typically having better coverage. Also as expected, we found that trait coverage is negatively associated with endemism, likely driven by endemics having relatively small ranges and occurring in less-accessible areas, making them less likely to be sampled (Steinbauer <i>et al</i>., <span>2016</span>; Enquist <i>et al</i>., <span>2019</span>). Contrary to our expectations, however, we found that larger, less accessible, less secure, and more species-rich regions tended to have higher trait coverage. One potential reason for these unexpected relationships may be that scientists disproportionately choose to work in regions with these characteristics because of the relatively high species richness they harbor.</p><p>While we identify several factors that are associated with trait data completeness, we caution that relationships with socioeconomic factors can often be inconsistent and context-dependent (Rydén <i>et al</i>., <span>2020</span>; Zizka <i>et al</i>., <span>2021</span>). Furthermore, there are many socioeconomic factors beyond these which may be relevant, and different regions may differ in which factors are relevant (Meyer <i>et al</i>., <span>2015</span>; Zizka <i>et al</i>., <span>2021</span>). While much previous work has focused on the correlates of geographic data availability (e.g. Meyer <i>et al</i>., <span>2015</span>; Hughes <i>et al</i>., <span>2021</span>), less emphasis has been placed on the availability of trait data (but see Cornwell <i>et al</i>., <span>2019</span>). We found no significant correlation between trait completeness and geographic completeness, suggesting that the drivers of these two may differ, although some variables will likely be relevant for both (e.g. research funding; Meyer <i>et al</i>., <span>2015</span>).</p><p>The accuracy of our completeness estimates depends on our estimates of country-level species pools, which are hindered by both the Linnean shortfall (number of undescribed species) and the Wallacean shortfall, which are spatially biased (Meyer <i>et al</i>., <span>2015</span>; Freeman &amp; Pennell, <span>2021</span>; Hughes <i>et al</i>., <span>2021</span>). Thus, our analyses will overestimate the coverage for regions with many undescribed or unrecorded species. This provides a potential alternative explanation for our unexpected finding that relatively inaccessible, large, and insecure regions have higher trait coverage: We may be underestimating the species richness in these regions, thereby increasing completeness estimates. We also caution that our estimates of species range sizes are biased by the available data. Species were assumed to occupy the entirety of the regions they occur in, which will lead to overestimates of range sizes, particularly in large regions.</p><p>We note that in our main analysis (Fig. 1), some countries with high levels of wealth and spending appear relatively data poor (e.g. Australia and New Zealand). However, when we combine AusTraits with TRY, the level of trait completeness in Australia falls in line with levels seen elsewhere in the Global North (Fig. 3), suggesting that a lack of data integration may underlie some perceived knowledge gaps. In addition to relatively high rates of funding, countries in the temperate and polar regions of the Northern Hemisphere also have relatively low species diversity and large species ranges, allowing them to share data across boundaries. By contrast, many tropical countries have low rates of funding, high endemism, and high species diversity, factors which work against trait completeness. Although the focus of this study was botanical countries, the bias we observe will extend to other geographic classifications (e.g. biogeographic realms, biomes, and ecosystems; Olson <i>et al</i>., <span>2001</span>) such that our knowledge of those occurring predominantly in the Global South will tend to be relatively poor.</p><p>The analyses presented here focus on the most widely used global plant trait database, TRY (Kattge <i>et al</i>., <span>2011</span>, <span>2020</span>). However, numerous other plant trait databases and datasets exist, some of which will not have been incorporated in TRY yet, and others which may not be able to be incorporated due to licensing issues. However, by integrating TRY with one open access resource (AusTraits), we were able to more than double trait data completeness for all Australian states (Fig. 3). This supports recent work by Feng <i>et al</i>. (<span>2022</span>) showing that database integration can lead to rapid gains in available data. The AusTraits model demonstrates how the creation and integration of regional databases can rapidly expand trait completeness. With a more limited geographic scope, AusTraits could target its data collection, allowing it to achieve similar completeness to the northern hemisphere despite high endemism and a low population density. This includes personally interacting with researchers to create a sense of community, developing a workflow where a database manager leads the input of datasets, reducing contributor effort, and contacting researchers and repositories with known large datasets. In particular, AusTraits makes use of expertise held within the systematics community, mining data embedded in taxonomic descriptions. This approach provides depth of coverage for key traits such as growth, leaf dimensions, and plant height aiding in the filling of regional gaps for modeling and conservation management. Due to socioeconomic limitations, regional efforts in some areas (particularly in the Global South) may only be feasible with South–South and South–North collaborative efforts. Data source integration is currently hindered by variation in data structure, data availability, taxonomy, and even trait names (Gallagher <i>et al</i>., <span>2020b</span>), so new, regional efforts would benefit from adopting existing tools, taxonomies, and data structures (e.g. Boyle <i>et al</i>., <span>2013</span>; Schneider <i>et al</i>., <span>2019</span>; Falster <i>et al</i>., <span>2021</span>).</p><p>In this study, we quantified trait data completeness relative to a subset of traits provided in the TRY database, but there are an infinite number of possible traits: Measurements can be taken at any point, organ, or developmental stage on an individual, at multiple levels of organization, and combined in any way (e.g. dry leaf mass per leaf area and above ground biomass divided by belowground biomass). A lack of standard trait definitions makes the integration of different databases challenging (Garnier <i>et al</i>., <span>2016</span>). Thankfully, plant traits are often strongly correlated (Westoby <i>et al</i>., <span>2002</span>; Wright <i>et al</i>., <span>2004</span>; Díaz <i>et al</i>., <span>2016</span>; Zeballos <i>et al</i>., <span>2017</span>), and even sparse trait coverage may allow us to say something about the overall phenotypes of species (Mouillot <i>et al</i>., <span>2021</span>), particularly when combined with phylogenetic information (Penone <i>et al</i>., <span>2014</span>) or geographic information (Sandel <i>et al</i>., <span>2021</span>). Imputation methods based on trait and phylogenetic correlations also provide estimates of uncertainty which can be used in sampling prioritization. Thus, global collection efforts focused on key traits representing known trait spectra (e.g. Westoby <i>et al</i>., <span>2002</span>; Wright <i>et al</i>., <span>2004</span>; Díaz <i>et al</i>., <span>2016</span>; Zeballos <i>et al</i>., <span>2017</span>), particularly in taxa or regions of high uncertainty, may be a reasonable path forward. For such large-scale efforts, traits that may be especially important to focus on are those that are: broadly relevant and apply to most or all plant species; of importance for many ecological processes; and fast and affordable to measure. For example, plant height, leaf mass per area, and leaf dry matter content are all strong candidates. However, we also acknowledge that the choice of which traits to measure is ultimately driven by the research question, and a diversity of research questions necessitates a diversity of traits.</p><p>While plant traits are critically important across disciplines and are urgently needed to allow us to predict responses to global change, the current state of our knowledge is both incomplete and geographically biased. Given the current state of available data, large scale (e.g. country, biome, and global) analyses of plant traits must be interpreted with caution and should attempt to quantify uncertainty caused by these massive data gaps. Moving forward, researchers can help by publishing their data and metadata openly, including the full set of raw and derived trait data (Keller <i>et al</i>., <span>2023</span>). At larger scales, efforts to mobilize and integrate existing datasets, particularly those focused on particular geographic regions (e.g. Tavşanoğlu &amp; Pausas, <span>2018</span>; Falster <i>et al</i>., <span>2021</span>; Báez <i>et al</i>., <span>2022</span>) or types of traits (e.g. Iversen <i>et al</i>., <span>2017</span>; LeBauer <i>et al</i>., <span>2018</span>; Guerrero-Ramírez <i>et al</i>., <span>2021</span>), hold promise to rapidly advance the state of our knowledge (Feng <i>et al</i>., <span>2022</span>). However, what is ultimately needed is more data collection and research, particularly in the Global South, which requires funding, infrastructure, and capacity building.</p><p>None declared.</p><p>BSM and WLE conceived the initial idea. BSM, WLE, RG, J-CS and MT planned and designed the research. RG, J-CS, MT, EHW and WLE provided data. BSM conducted analyses with feedback from all authors. BSM led the writing with contributions from all authors.</p>\",\"PeriodicalId\":48887,\"journal\":{\"name\":\"New Phytologist\",\"volume\":\"240 4\",\"pages\":\"1345-1354\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2023-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/nph.18999\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Phytologist\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/nph.18999\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Phytologist","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/nph.18999","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 0

摘要

除了四个叶片形态特征(宽度、长度、边缘类型和脉序类型;图S2)外,植物学国家之间的完整性相关性对大多数焦点性状都是正相关的,并且是显著的。我们发现,我们的四个预测变量的影响有95%的置信区间,排除了零(表S2)。性状完整性与平均物种范围大小(0.41,95%CI[0.37,0.45])和研究支出(0.06,95%CI[0.03,0.09])呈正相关。性状完全性与特有性(-0.13,95%CI[-0.15,-0.10])和维管植物物种丰富度(-0.09,95%CI[-0.12,-0.06])呈负相关。尽管迄今为止整理了大量的性状数据,我们离完全捕捉世界植物国家大多数植物物种的一些最简单特征还有很长的路要走。这里检查的所有53个焦点特征都低于Penone等人指出的40%覆盖率阈值。(2014),因为这是自信地估算缺失特征所必需的,这表明估算在全球范围内还不是一种选择(但可能在某些地区有用)。我们的地图显示,我们所掌握的信息在空间上是有偏见的,全球北方(从社会经济意义上讲,包括澳大利亚)的特征覆盖率更高。完整性在各个性状之间通常是一致的(也见注释S1中的花、木和种子性状),这表明我们在全球南方普遍缺乏性状数据,正如Cornwell等人也指出的那样。(2019),而不是简单地测量一组不同的特征。然而,我们注意到19%的性状与平均焦点性状完整性呈负相关(图S7),但其中只有四个性状高于我们1%的纳入阈值:叶宽、长度、边缘类型和脉序类型。这些例外是由专注于非洲和中国的区域数据聚合工作推动的(Kirkup et al.,2005;Prentice et al.,2011;Dressler et al.,2014)。我们观察到的地理偏差与Rudbeck et al.在系统发育数据中观察到的相似。(2022),我们确定的驱动性状数据完整性的因素(范围大小、地方性、物种丰富度和研究支出)是他们确定的驱动系统发育数据完整性因素的子集,这表明两种数据类型的获取机制相似。不幸的是,系统发育和特征数据完整性之间的这种对应关系(图2,S8)可能意味着特征完整性较低的区域通过系统发育特征插补获得的收益相对较少。由于性状数据的可用性与温度和人类足迹相关的全球变化之间存在正相关性,许多经历最严重变化的地区可能最适合预测这些变化。然而,这种相关性也意味着,特征数据是从受人类遗传学干扰的地区不成比例地收集的,这可能会使推断产生偏差。相反,特征数据可用性与预测的降水变化和不确定性之间的负相关性可能会阻碍我们预测植物对降水变化的反应的能力。我们还注意到,热带地区的特征覆盖率很低,这些地区可能正在接近气候阈值,超过气候阈值,生态系统可能会发生不可逆转的变化(例如,雨林转变为以火灾为主的森林;Malhi等人,2009年),我们的分析只关注已知在特定植物国家测量的特征,在社会经济和生物地理学预测因子的大小、方向和意义上存在很大差异。我们认为,通过消除共享性状数据的混杂影响,地理参考数据可以更好地了解植物国家层面性状数据可用性的驱动因素。正如我们所预期的那样,我们发现特征数据的完整性与国家财富和支出呈正相关,在研究和教育上支出更多的富裕地区通常覆盖率更高。同样正如预期的那样,我们发现性状覆盖率与地方病呈负相关,这可能是由范围相对较小且发生在不太容易到达的地区的地方病引起的,这使得它们不太可能被采样(Steinbauer et al.,2016;Enquist et al.,2019)。然而,与我们的预期相反,我们发现,物种丰富的区域往往具有更高的性状覆盖率。这些意想不到的关系的一个潜在原因可能是,科学家们不成比例地选择在具有这些特征的地区工作,因为这些地区拥有相对较高的物种丰富度。虽然我们确定了几个与特质数据完整性相关的因素,但我们警告说,与社会经济因素的关系往往是不一致的,并且依赖于上下文(Rydén等人,2020;Zizka等人,2021)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A global assessment of the Raunkiæran shortfall in plants: geographic biases in our knowledge of plant traits

The functional traits (measured attributes) of organisms result from interactions with their biotic and abiotic environment. Traits allow us to understand both how individuals and the communities they form will respond to environmental change and how these changes will impact ecosystem services and processes (Lavorel & Garnier, 2002). Plants constitute most of the biomass on Earth (c. 82%; Bar-On et al., 2018), and their traits are the predominant drivers of terrestrial ecosystem functioning (Migliavacca et al., 2021; Fricke et al., 2022). Thus, to a first-order approximation, understanding the traits of plants means understanding terrestrial ecosystems.

There remains a sustained interest in both trait-based ecology (e.g. Lavorel & Garnier, 2002; McGill et al., 2006; Violle et al., 2007; Mouillot et al., 2021) and Open Science (Cheruvelil & Soranno, 2018; Gallagher et al., 2020b; Geange et al., 2021), both of which have contributed to the creation and sharing of large compilations of plant traits constituting millions of observations (e.g. Kattge et al., 2011; Maitner et al., 2017; Sauquet et al., 2017; Weigelt et al., 2020; Falster et al., 2021). However, despite this growing wealth of data, our knowledge of plant traits remains far from complete (the ‘Raunkiæran shortfall’; Hortal et al., 2015).

In addition to trait data being incomplete, recent work by Cornwell et al. (2019) suggests our knowledge of plant traits is also spatially biased, with marked latitudinal variation in coverage. The causes of these biases have not been rigorously tested, but may be driven by: wealthier countries being able to collect and disseminate more data (Meyer et al., 2015); smaller and more-accessible countries being able to sample proportionally more species (Hijmans et al., 2000; Kadmon et al., 2004; Hughes et al., 2021); and countries with few species and low endemism reaching higher completeness more easily. These spatial biases in turn may limit our ability to respond to urgent global changes, particularly if there are discrepancies between where the data are most urgently needed (e.g. where changes are highly uncertain or projected to be severe) and where they are being collected.

Cornwell et al. (2019) examined the coverage of a range of attributes, including traits, in the global flora with a focus on assessing the completeness (fraction of plant species with data available) of information using The Plant List as a taxonomic backbone. Here, we expand on this work by mapping trait completeness (the fraction of species with freely available trait data for a given trait) globally, using the geographic and taxonomic information in the recently completed World Checklist of Vascular Plants (WCVP; Govaerts et al., 2021) and trait information from the widely used TRY database (the most commonly used global plant trait database; Kattge et al., 2011). We test hypothesized drivers of variation in plant trait data availability across the globe, including factors related to: wealth, research expenditure, and educational expenditure; region size and accessibility; and biogeography. We compare spatial patterns of trait data completeness with those of phylogenetic and distributional data completeness (Rudbeck et al., 2022). We test for correlations between trait data completeness and impacts of global change. Finally, we identify solutions for filling significant regional gaps in trait data completeness using Open Science approaches, highlighting the model developed for the AusTraits database (Falster et al., 2021).

The set of public trait data we received from TRY contained 1.5 million unique species × trait combinations across 10.4 million trait observations. Of the 2027 unique traits, the most complete was plant growth form (32.5%; 113 620 species). Most traits had poor coverage (Fig. S1), with a mean global completeness across all traits of 0.21% and a median of 0.0051%. Surprisingly, some traits calculated from multiple measurements had higher coverage than their components (e.g. SLA was available for 14 127 species and leaf dry mass for 6064). After excluding traits with data for < 1% of species (3500), which could not be applied to all species, or which were not properties of individuals, our focal dataset included 5.1 million records across 122 230 species (34.9% of vascular plants) and 53 traits (Table S1). Trait coverage ranged between 1.01% and 32.5% global coverage (mean = 3.48%, median = 1.88%). For a single trait, within a single botanical country, trait completeness varied widely, ranging between 0 and 100% (mean = 19.4%, median = 12.7%), with entirely complete or incomplete data found on Antarctica and small islands. Mean completeness across traits ranged between 2.8% (New Guinea) and 58.7% (Føroyar) across countries (mean = 19.4%, median = 17.3%, Fig. 1). Correlations in completeness across botanical countries were positive and significant for the majority of focal traits, with the exception of four leaf morphological traits (width, length, margin type, and venation type; Fig. S2).

We found that the effects of four of our predictor variables had 95% confidence intervals that excluded zero (Table S2). Trait completeness was positively associated with mean species range size (0.41, 95% CI [0.37, 0.45]) and research expenditure (0.06, 95% CI [0.03, 0.09]). Trait completeness was negatively associated with endemism (−0.13, 95% CI [−0.15, −0.10]) and vascular plant species richness (−0.09, 95% CI [−0.12, −0.06]).

Despite the massive amount of trait data collated to date, we are a long way from fully capturing some of the simplest traits for the majority of plant species across the world's botanical countries. All 53 focal traits examined here were below the 40% coverage threshold noted by Penone et al. (2014) as being needed to impute missing traits with confidence, suggesting that imputation is not yet an option at a global level (but may be of use within certain regions). Our mapping shows that what information we do have is spatially biased, with trait coverage being higher in the Global North (in the socioeconomic sense, which includes Australia). Completeness is generally consistent across traits (see also Notes S1 for flower, wood, and seed traits), showing that we have a general lack of trait data in the Global South, as also noted by Cornwell et al. (2019), rather than simply a different set of traits being measured. However, we note that 19% of traits show a negative correlation with mean focal trait completeness (Fig. S7), but only four of these were above our 1% threshold for inclusion: leaf width, length, margin type, and venation type. These exceptions were driven by regional data aggregation efforts focused on Africa and China (Kirkup et al., 2005; Prentice et al., 2011; Dressler et al., 2014). The geographic biases we observe are similar to those observed for phylogenetic data by Rudbeck et al. (2022), with the factors we identified as driving trait data completeness (range size, endemism, species richness, and research expenditure) being a subset of the factors they identified as driving phylogenetic data completeness, suggesting similar mechanisms underlie the acquisition of both data types. Unfortunately, this correspondence between phylogenetic and trait data completeness (Figs 2, S8) likely means that regions with low trait completeness have relatively less to gain via phylogenetic trait imputation.

Due to the positive correlations between trait data availability and global changes related to temperature and human footprint, many of the regions undergoing the most severe changes may be best positioned to predict those changes. However, this correlation also means that trait data have been disproportionately collected from anthropogenically disturbed regions, which may bias inferences. Conversely, the negative correlation between trait data availability and predicted changes and uncertainty in precipitation may hinder our ability to predict plant responses to altered precipitation. We also note that trait coverage is low in tropical regions which may be approaching climatic thresholds, beyond which irreversible changes in ecosystems may occur (e.g. conversion of rainforest to fire-dominated forests; Malhi et al., 2009).

Our main analysis, which included traits measured anywhere, and our analysis focusing only on traits known to have been measured within particular botanical countries, differed substantially in the magnitude, direction, and significance of socioeconomic and biogeographic predictors. We argue that by removing the confounding influence of shared trait data, the georeferenced data provide a better picture of the drivers of trait data availability at a botanical country level. As we expected, we found trait data completeness is positively associated with national wealth and spending, with wealthier regions that spend more on research and education typically having better coverage. Also as expected, we found that trait coverage is negatively associated with endemism, likely driven by endemics having relatively small ranges and occurring in less-accessible areas, making them less likely to be sampled (Steinbauer et al., 2016; Enquist et al., 2019). Contrary to our expectations, however, we found that larger, less accessible, less secure, and more species-rich regions tended to have higher trait coverage. One potential reason for these unexpected relationships may be that scientists disproportionately choose to work in regions with these characteristics because of the relatively high species richness they harbor.

While we identify several factors that are associated with trait data completeness, we caution that relationships with socioeconomic factors can often be inconsistent and context-dependent (Rydén et al., 2020; Zizka et al., 2021). Furthermore, there are many socioeconomic factors beyond these which may be relevant, and different regions may differ in which factors are relevant (Meyer et al., 2015; Zizka et al., 2021). While much previous work has focused on the correlates of geographic data availability (e.g. Meyer et al., 2015; Hughes et al., 2021), less emphasis has been placed on the availability of trait data (but see Cornwell et al., 2019). We found no significant correlation between trait completeness and geographic completeness, suggesting that the drivers of these two may differ, although some variables will likely be relevant for both (e.g. research funding; Meyer et al., 2015).

The accuracy of our completeness estimates depends on our estimates of country-level species pools, which are hindered by both the Linnean shortfall (number of undescribed species) and the Wallacean shortfall, which are spatially biased (Meyer et al., 2015; Freeman & Pennell, 2021; Hughes et al., 2021). Thus, our analyses will overestimate the coverage for regions with many undescribed or unrecorded species. This provides a potential alternative explanation for our unexpected finding that relatively inaccessible, large, and insecure regions have higher trait coverage: We may be underestimating the species richness in these regions, thereby increasing completeness estimates. We also caution that our estimates of species range sizes are biased by the available data. Species were assumed to occupy the entirety of the regions they occur in, which will lead to overestimates of range sizes, particularly in large regions.

We note that in our main analysis (Fig. 1), some countries with high levels of wealth and spending appear relatively data poor (e.g. Australia and New Zealand). However, when we combine AusTraits with TRY, the level of trait completeness in Australia falls in line with levels seen elsewhere in the Global North (Fig. 3), suggesting that a lack of data integration may underlie some perceived knowledge gaps. In addition to relatively high rates of funding, countries in the temperate and polar regions of the Northern Hemisphere also have relatively low species diversity and large species ranges, allowing them to share data across boundaries. By contrast, many tropical countries have low rates of funding, high endemism, and high species diversity, factors which work against trait completeness. Although the focus of this study was botanical countries, the bias we observe will extend to other geographic classifications (e.g. biogeographic realms, biomes, and ecosystems; Olson et al., 2001) such that our knowledge of those occurring predominantly in the Global South will tend to be relatively poor.

The analyses presented here focus on the most widely used global plant trait database, TRY (Kattge et al., 2011, 2020). However, numerous other plant trait databases and datasets exist, some of which will not have been incorporated in TRY yet, and others which may not be able to be incorporated due to licensing issues. However, by integrating TRY with one open access resource (AusTraits), we were able to more than double trait data completeness for all Australian states (Fig. 3). This supports recent work by Feng et al. (2022) showing that database integration can lead to rapid gains in available data. The AusTraits model demonstrates how the creation and integration of regional databases can rapidly expand trait completeness. With a more limited geographic scope, AusTraits could target its data collection, allowing it to achieve similar completeness to the northern hemisphere despite high endemism and a low population density. This includes personally interacting with researchers to create a sense of community, developing a workflow where a database manager leads the input of datasets, reducing contributor effort, and contacting researchers and repositories with known large datasets. In particular, AusTraits makes use of expertise held within the systematics community, mining data embedded in taxonomic descriptions. This approach provides depth of coverage for key traits such as growth, leaf dimensions, and plant height aiding in the filling of regional gaps for modeling and conservation management. Due to socioeconomic limitations, regional efforts in some areas (particularly in the Global South) may only be feasible with South–South and South–North collaborative efforts. Data source integration is currently hindered by variation in data structure, data availability, taxonomy, and even trait names (Gallagher et al., 2020b), so new, regional efforts would benefit from adopting existing tools, taxonomies, and data structures (e.g. Boyle et al., 2013; Schneider et al., 2019; Falster et al., 2021).

In this study, we quantified trait data completeness relative to a subset of traits provided in the TRY database, but there are an infinite number of possible traits: Measurements can be taken at any point, organ, or developmental stage on an individual, at multiple levels of organization, and combined in any way (e.g. dry leaf mass per leaf area and above ground biomass divided by belowground biomass). A lack of standard trait definitions makes the integration of different databases challenging (Garnier et al., 2016). Thankfully, plant traits are often strongly correlated (Westoby et al., 2002; Wright et al., 2004; Díaz et al., 2016; Zeballos et al., 2017), and even sparse trait coverage may allow us to say something about the overall phenotypes of species (Mouillot et al., 2021), particularly when combined with phylogenetic information (Penone et al., 2014) or geographic information (Sandel et al., 2021). Imputation methods based on trait and phylogenetic correlations also provide estimates of uncertainty which can be used in sampling prioritization. Thus, global collection efforts focused on key traits representing known trait spectra (e.g. Westoby et al., 2002; Wright et al., 2004; Díaz et al., 2016; Zeballos et al., 2017), particularly in taxa or regions of high uncertainty, may be a reasonable path forward. For such large-scale efforts, traits that may be especially important to focus on are those that are: broadly relevant and apply to most or all plant species; of importance for many ecological processes; and fast and affordable to measure. For example, plant height, leaf mass per area, and leaf dry matter content are all strong candidates. However, we also acknowledge that the choice of which traits to measure is ultimately driven by the research question, and a diversity of research questions necessitates a diversity of traits.

While plant traits are critically important across disciplines and are urgently needed to allow us to predict responses to global change, the current state of our knowledge is both incomplete and geographically biased. Given the current state of available data, large scale (e.g. country, biome, and global) analyses of plant traits must be interpreted with caution and should attempt to quantify uncertainty caused by these massive data gaps. Moving forward, researchers can help by publishing their data and metadata openly, including the full set of raw and derived trait data (Keller et al., 2023). At larger scales, efforts to mobilize and integrate existing datasets, particularly those focused on particular geographic regions (e.g. Tavşanoğlu & Pausas, 2018; Falster et al., 2021; Báez et al., 2022) or types of traits (e.g. Iversen et al., 2017; LeBauer et al., 2018; Guerrero-Ramírez et al., 2021), hold promise to rapidly advance the state of our knowledge (Feng et al., 2022). However, what is ultimately needed is more data collection and research, particularly in the Global South, which requires funding, infrastructure, and capacity building.

None declared.

BSM and WLE conceived the initial idea. BSM, WLE, RG, J-CS and MT planned and designed the research. RG, J-CS, MT, EHW and WLE provided data. BSM conducted analyses with feedback from all authors. BSM led the writing with contributions from all authors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
New Phytologist
New Phytologist PLANT SCIENCES-
CiteScore
17.60
自引率
5.30%
发文量
728
审稿时长
1 months
期刊介绍: New Phytologist is a leading publication that showcases exceptional and groundbreaking research in plant science and its practical applications. With a focus on five distinct sections - Physiology & Development, Environment, Interaction, Evolution, and Transformative Plant Biotechnology - the journal covers a wide array of topics ranging from cellular processes to the impact of global environmental changes. We encourage the use of interdisciplinary approaches, and our content is structured to reflect this. Our journal acknowledges the diverse techniques employed in plant science, including molecular and cell biology, functional genomics, modeling, and system-based approaches, across various subfields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信