Predictive modeling of canola seed composition in western Canada: Integrating geospatial patterns with early-season insights

IF 6.4 1区 农林科学 Q1 AGRONOMY
Jose L. Rotundo , Dave Charne , Chad Koscielny , Robert H. Gulden , Véronique J. Barthet
{"title":"Predictive modeling of canola seed composition in western Canada: Integrating geospatial patterns with early-season insights","authors":"Jose L. Rotundo ,&nbsp;Dave Charne ,&nbsp;Chad Koscielny ,&nbsp;Robert H. Gulden ,&nbsp;Véronique J. Barthet","doi":"10.1016/j.fcr.2025.110183","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><div>Western Canada is the largest global producer and exporter of canola (Brassica napus L.), a versatile crop used for its nutritious oil and high-protein meal. The industry faces challenges due to year-to-year variability in seed oil and protein concentrations, affecting processing logistics and financials.</div></div><div><h3>Objective</h3><div>This study aimed to identify key temporal and geographical patterns in canola seed composition in Western Canada and to develop a statistical model using weather variables associated to different phenological stages to predict seed composition.</div></div><div><h3>Methods</h3><div>Spring canola samples collected by the Canadian Grain Commission from 2009 to 2022 were analyzed for seed oil and protein concentrations. Weather data from NASA POWER assessed environmental conditions during different crop developmental stages. Temporal trends were modeled using mixed model analysis, and geospatial analysis characterized spatial patterns. Predictive modeling employed Random Forest algorithms using location, soil, and weather variables.</div></div><div><h3>Results</h3><div>From 2009–2022, seed oil concentration decreased by 0.17 % per year, while seed protein and meal protein concentrations increased by 0.19 % per year. Spatial analysis showed significant regional variations. Predictive modeling using Random Forest algorithms demonstrated improved accuracy at higher granularity levels, particularly with early reproductive stage weather data. Not improvement was observed by including weather information from later stages. The model's mean absolute error (MAE) for seed oil prediction at the Western Canada level was 0.41 %, compared to the benchmark of a null naïve model's MAE of 0.92 %. Similar improvements were observed for seed protein and meal protein predictions. Temperature, humidity, and rainfall were key factors influencing seed composition.</div></div><div><h3>Conclusions</h3><div>The study revealed significant spatial and temporal variation in canola seed composition across Western Canada, with early reproductive weather conditions playing a crucial role in determining seed oil and protein concentrations. The integration of geospatial patterns with early-season weather data enabled the development of a predictive model with a mean absolute error of 0.4 %, providing a valuable tool for forecasting seed composition.</div></div><div><h3>Implications</h3><div>The predictive model can aid industry stakeholders in logistical planning by offering early predictions of seed composition. This tool will allow processors and exporters to anticipate variations in seed oil and protein concentrations well before harvest, enabling informed decisions about sourcing, storage, and processing strategies. By forecasting seed composition, stakeholders can optimize operations to meet market demands and quality standards, potentially reducing costs and improving efficiency.</div></div>","PeriodicalId":12143,"journal":{"name":"Field Crops Research","volume":"335 ","pages":"Article 110183"},"PeriodicalIF":6.4000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Field Crops Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378429025004484","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Context

Western Canada is the largest global producer and exporter of canola (Brassica napus L.), a versatile crop used for its nutritious oil and high-protein meal. The industry faces challenges due to year-to-year variability in seed oil and protein concentrations, affecting processing logistics and financials.

Objective

This study aimed to identify key temporal and geographical patterns in canola seed composition in Western Canada and to develop a statistical model using weather variables associated to different phenological stages to predict seed composition.

Methods

Spring canola samples collected by the Canadian Grain Commission from 2009 to 2022 were analyzed for seed oil and protein concentrations. Weather data from NASA POWER assessed environmental conditions during different crop developmental stages. Temporal trends were modeled using mixed model analysis, and geospatial analysis characterized spatial patterns. Predictive modeling employed Random Forest algorithms using location, soil, and weather variables.

Results

From 2009–2022, seed oil concentration decreased by 0.17 % per year, while seed protein and meal protein concentrations increased by 0.19 % per year. Spatial analysis showed significant regional variations. Predictive modeling using Random Forest algorithms demonstrated improved accuracy at higher granularity levels, particularly with early reproductive stage weather data. Not improvement was observed by including weather information from later stages. The model's mean absolute error (MAE) for seed oil prediction at the Western Canada level was 0.41 %, compared to the benchmark of a null naïve model's MAE of 0.92 %. Similar improvements were observed for seed protein and meal protein predictions. Temperature, humidity, and rainfall were key factors influencing seed composition.

Conclusions

The study revealed significant spatial and temporal variation in canola seed composition across Western Canada, with early reproductive weather conditions playing a crucial role in determining seed oil and protein concentrations. The integration of geospatial patterns with early-season weather data enabled the development of a predictive model with a mean absolute error of 0.4 %, providing a valuable tool for forecasting seed composition.

Implications

The predictive model can aid industry stakeholders in logistical planning by offering early predictions of seed composition. This tool will allow processors and exporters to anticipate variations in seed oil and protein concentrations well before harvest, enabling informed decisions about sourcing, storage, and processing strategies. By forecasting seed composition, stakeholders can optimize operations to meet market demands and quality standards, potentially reducing costs and improving efficiency.
加拿大西部油菜籽成分的预测建模:整合地理空间模式与早期季节的见解
加拿大西部是全球最大的油菜籽(Brassica napus L.)生产国和出口国,油菜籽是一种多用途作物,用于其营养丰富的油和高蛋白粉。由于种子油和蛋白质浓度的逐年变化,该行业面临着挑战,影响了加工物流和财务。目的研究加拿大西部油菜籽组成的主要时间和地理格局,并利用与不同物候阶段相关的天气变量建立统计模型来预测油菜籽组成。方法对2009 ~ 2022年加拿大粮食委员会采集的春季油菜籽样品进行籽油和蛋白质含量分析。来自NASA POWER的天气数据评估了作物不同发育阶段的环境条件。采用混合模型分析模拟时间趋势,地理空间分析表征空间格局。预测模型采用随机森林算法,使用地点、土壤和天气变量。结果2009-2022年,种子油浓度以每年0.17 %的速度下降,而种子蛋白和粕蛋白浓度以每年0.19 %的速度上升。空间分析显示区域差异显著。使用随机森林算法的预测建模在更高粒度级别上显示出更高的准确性,特别是在生殖早期阶段的天气数据上。即使包括后期的天气信息,也没有观察到任何改善。该模型在加拿大西部水平预测种子油的平均绝对误差(MAE)为0.41 %,而无效naïve模型的基准MAE为0.92 %。对种子蛋白和粗粉蛋白的预测也有类似的改进。温度、湿度和降雨量是影响种子组成的关键因素。结论加拿大西部地区油菜籽成分存在显著的时空差异,早期生殖天气条件对油菜籽油和蛋白质含量起着至关重要的作用。将地理空间模式与早季天气数据相结合,建立了平均绝对误差为0.4 %的预测模型,为预测种子成分提供了一个有价值的工具。该预测模型可以通过提供种子成分的早期预测来帮助行业利益相关者进行物流规划。该工具将使加工商和出口商能够在收获前预测种子油和蛋白质浓度的变化,从而在采购、储存和加工策略方面做出明智的决策。通过预测种子成分,利益相关者可以优化操作以满足市场需求和质量标准,从而有可能降低成本并提高效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Field Crops Research
Field Crops Research 农林科学-农艺学
CiteScore
9.60
自引率
12.10%
发文量
307
审稿时长
46 days
期刊介绍: Field Crops Research is an international journal publishing scientific articles on: √ experimental and modelling research at field, farm and landscape levels on temperate and tropical crops and cropping systems, with a focus on crop ecology and physiology, agronomy, and plant genetics and breeding.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信