Siwar Saadaoui, David Makowski, Benoît Gabrielle, Thierry Brunelle
{"title":"用数据驱动的全球模型绘制能源作物的产量,包括气候和土壤预测","authors":"Siwar Saadaoui, David Makowski, Benoît Gabrielle, Thierry Brunelle","doi":"10.1111/gcbb.70078","DOIUrl":null,"url":null,"abstract":"<p>Lignocellulosic crops such as Miscanthus, Eucalyptus, Poplar, Willow, and Switchgrass are gaining attention as promising feedstocks for renewable energy and carbon-mitigation strategies, especially on marginal lands. Assessing their global yield potentials requires models that go beyond climate drivers alone. Using a global dataset of 3963 yield observations for five species, we developed a high-resolution (5-arc-minute) modeling framework that augments climate with detailed soil and topographic predictors. Among seven machine learning algorithms, Random Forest, Extra Trees, and Gradient Boosting (GB) emerged as top performers. On an independent test set, the best model achieved a root mean square error (RMSE) of 4.8 t DM ha<sup>−1</sup> year<sup>−1</sup> (across algorithms: 4.7–5.0 t DM ha<sup>−1</sup> year<sup>−1</sup>) and an <span></span><math>\n <semantics>\n <mrow>\n <msup>\n <mi>R</mi>\n <mn>2</mn>\n </msup>\n </mrow>\n <annotation>$$ {R}^2 $$</annotation>\n </semantics></math> of 0.67, a moderate error relative to the broad 4–19 t DM ha<sup>−1</sup> year<sup>−1</sup> spatial yield range. After outlier handling via a two-phase cross-validation procedure, each model was applied globally under current climate and three future scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5). Across scenarios (relative to the 1980–2000 baseline), median absolute yield changes over suitable land are modest (ca. 1–2 t DM ha<sup>−1</sup> year<sup>−1</sup>), yet localized hotspots show gains or losses up to 8 t DM ha<sup>−1</sup> year<sup>−1</sup>. Yields most often increase in presently cool, high-latitude areas and decrease in warmer/drier or edaphically constrained low-latitude regions. We additionally provide a “best-crop” map identifying where each species may offer the most favorable balance between yield and production cost, revealing pronounced geographic variation in suitability. Compared with alternative models based on coarser-resolution datasets, our approach generally yields more conservative estimates, likely reflecting the added constraint from soil and topographic predictors. These results underscore the importance of representing local environmental heterogeneity when predicting energy-crop productivity under climate change.</p>","PeriodicalId":55126,"journal":{"name":"Global Change Biology Bioenergy","volume":"17 10","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gcbb.70078","citationCount":"0","resultStr":"{\"title\":\"Mapping Out the Yields of Energy Crops With Data-Driven Global Models, Including Climate and Soil Predictors\",\"authors\":\"Siwar Saadaoui, David Makowski, Benoît Gabrielle, Thierry Brunelle\",\"doi\":\"10.1111/gcbb.70078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Lignocellulosic crops such as Miscanthus, Eucalyptus, Poplar, Willow, and Switchgrass are gaining attention as promising feedstocks for renewable energy and carbon-mitigation strategies, especially on marginal lands. Assessing their global yield potentials requires models that go beyond climate drivers alone. Using a global dataset of 3963 yield observations for five species, we developed a high-resolution (5-arc-minute) modeling framework that augments climate with detailed soil and topographic predictors. Among seven machine learning algorithms, Random Forest, Extra Trees, and Gradient Boosting (GB) emerged as top performers. On an independent test set, the best model achieved a root mean square error (RMSE) of 4.8 t DM ha<sup>−1</sup> year<sup>−1</sup> (across algorithms: 4.7–5.0 t DM ha<sup>−1</sup> year<sup>−1</sup>) and an <span></span><math>\\n <semantics>\\n <mrow>\\n <msup>\\n <mi>R</mi>\\n <mn>2</mn>\\n </msup>\\n </mrow>\\n <annotation>$$ {R}^2 $$</annotation>\\n </semantics></math> of 0.67, a moderate error relative to the broad 4–19 t DM ha<sup>−1</sup> year<sup>−1</sup> spatial yield range. After outlier handling via a two-phase cross-validation procedure, each model was applied globally under current climate and three future scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5). Across scenarios (relative to the 1980–2000 baseline), median absolute yield changes over suitable land are modest (ca. 1–2 t DM ha<sup>−1</sup> year<sup>−1</sup>), yet localized hotspots show gains or losses up to 8 t DM ha<sup>−1</sup> year<sup>−1</sup>. Yields most often increase in presently cool, high-latitude areas and decrease in warmer/drier or edaphically constrained low-latitude regions. We additionally provide a “best-crop” map identifying where each species may offer the most favorable balance between yield and production cost, revealing pronounced geographic variation in suitability. Compared with alternative models based on coarser-resolution datasets, our approach generally yields more conservative estimates, likely reflecting the added constraint from soil and topographic predictors. These results underscore the importance of representing local environmental heterogeneity when predicting energy-crop productivity under climate change.</p>\",\"PeriodicalId\":55126,\"journal\":{\"name\":\"Global Change Biology Bioenergy\",\"volume\":\"17 10\",\"pages\":\"\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gcbb.70078\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Change Biology Bioenergy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/gcbb.70078\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Change Biology Bioenergy","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/gcbb.70078","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
摘要
芒草、桉树、杨树、柳树和柳枝稷等木质纤维素作物作为可再生能源和碳减排战略的有前途的原料正受到关注,特别是在边缘土地上。评估它们的全球产量潜力需要超越气候驱动因素的模型。利用全球5个物种的3963个产量观测数据集,我们开发了一个高分辨率(5弧分)建模框架,通过详细的土壤和地形预测来增强气候。在7种机器学习算法中,随机森林、额外树和梯度增强(GB)表现最好。在独立测试集上,最佳模型的均方根误差(RMSE)为4.8 t DM ha−1年−1(跨算法:4.7-5.0 t DM ha−1年−1),r2 $$ {R}^2 $$为0.67,相对于4-19 t DM ha−1年−1的空间产量范围而言,这是一个中等误差。在通过两阶段交叉验证程序处理异常值后,每个模型在当前气候和三种未来情景(SSP1-2.6、SSP2-4.5和SSP5-8.5)下进行了全球应用。在所有情景中(相对于1980-2000年基线),适宜土地的绝对产量变化中位数不大(约为1 - 2 t DM / h−1年−1),但局部热点地区的收益或损失高达8 t DM / h−1年−1。在目前寒冷的高纬度地区,产量通常会增加,而在温暖/干燥或土壤受限的低纬度地区,产量往往会减少。我们还提供了一个“最佳作物”地图,确定每个物种在产量和生产成本之间最有利的平衡,揭示了适合性的显著地理差异。与基于粗分辨率数据集的替代模型相比,我们的方法通常产生更保守的估计,可能反映了土壤和地形预测器的附加约束。这些结果强调了在预测气候变化下能源作物生产力时代表当地环境异质性的重要性。
Mapping Out the Yields of Energy Crops With Data-Driven Global Models, Including Climate and Soil Predictors
Lignocellulosic crops such as Miscanthus, Eucalyptus, Poplar, Willow, and Switchgrass are gaining attention as promising feedstocks for renewable energy and carbon-mitigation strategies, especially on marginal lands. Assessing their global yield potentials requires models that go beyond climate drivers alone. Using a global dataset of 3963 yield observations for five species, we developed a high-resolution (5-arc-minute) modeling framework that augments climate with detailed soil and topographic predictors. Among seven machine learning algorithms, Random Forest, Extra Trees, and Gradient Boosting (GB) emerged as top performers. On an independent test set, the best model achieved a root mean square error (RMSE) of 4.8 t DM ha−1 year−1 (across algorithms: 4.7–5.0 t DM ha−1 year−1) and an of 0.67, a moderate error relative to the broad 4–19 t DM ha−1 year−1 spatial yield range. After outlier handling via a two-phase cross-validation procedure, each model was applied globally under current climate and three future scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5). Across scenarios (relative to the 1980–2000 baseline), median absolute yield changes over suitable land are modest (ca. 1–2 t DM ha−1 year−1), yet localized hotspots show gains or losses up to 8 t DM ha−1 year−1. Yields most often increase in presently cool, high-latitude areas and decrease in warmer/drier or edaphically constrained low-latitude regions. We additionally provide a “best-crop” map identifying where each species may offer the most favorable balance between yield and production cost, revealing pronounced geographic variation in suitability. Compared with alternative models based on coarser-resolution datasets, our approach generally yields more conservative estimates, likely reflecting the added constraint from soil and topographic predictors. These results underscore the importance of representing local environmental heterogeneity when predicting energy-crop productivity under climate change.
期刊介绍:
GCB Bioenergy is an international journal publishing original research papers, review articles and commentaries that promote understanding of the interface between biological and environmental sciences and the production of fuels directly from plants, algae and waste. The scope of the journal extends to areas outside of biology to policy forum, socioeconomic analyses, technoeconomic analyses and systems analysis. Papers do not need a global change component for consideration for publication, it is viewed as implicit that most bioenergy will be beneficial in avoiding at least a part of the fossil fuel energy that would otherwise be used.
Key areas covered by the journal:
Bioenergy feedstock and bio-oil production: energy crops and algae their management,, genomics, genetic improvements, planting, harvesting, storage, transportation, integrated logistics, production modeling, composition and its modification, pests, diseases and weeds of feedstocks. Manuscripts concerning alternative energy based on biological mimicry are also encouraged (e.g. artificial photosynthesis).
Biological Residues/Co-products: from agricultural production, forestry and plantations (stover, sugar, bio-plastics, etc.), algae processing industries, and municipal sources (MSW).
Bioenergy and the Environment: ecosystem services, carbon mitigation, land use change, life cycle assessment, energy and greenhouse gas balances, water use, water quality, assessment of sustainability, and biodiversity issues.
Bioenergy Socioeconomics: examining the economic viability or social acceptability of crops, crops systems and their processing, including genetically modified organisms [GMOs], health impacts of bioenergy systems.
Bioenergy Policy: legislative developments affecting biofuels and bioenergy.
Bioenergy Systems Analysis: examining biological developments in a whole systems context.