Pre-Harvest Corn Grain Moisture Estimation Using Aerial Multispectral Imagery and Machine Learning Techniques

IF 3.2 2区 环境科学与生态学 Q2 ENVIRONMENTAL STUDIES
Land Pub Date : 2023-12-18 DOI:10.3390/land12122188
Pius Jjagwe, A. Chandel, David Langston
{"title":"Pre-Harvest Corn Grain Moisture Estimation Using Aerial Multispectral Imagery and Machine Learning Techniques","authors":"Pius Jjagwe, A. Chandel, David Langston","doi":"10.3390/land12122188","DOIUrl":null,"url":null,"abstract":"Corn grain moisture (CGM) is critical to estimate grain maturity status and schedule harvest. Traditional methods for determining CGM range from manual scouting, destructive laboratory analyses, and weather-based dry down estimates. Such methods are either time consuming, expensive, spatially inaccurate, or subjective, therefore they are prone to errors or limitations. Realizing that precision harvest management could be critical for extracting the maximum crop value, this study evaluates the estimation of CGM at a pre-harvest stage using high-resolution (1.3 cm/pixel) multispectral imagery and machine learning techniques. Aerial imagery data were collected in the 2022 cropping season over 116 experimental corn planted plots. A total of 24 vegetation indices (VIs) were derived from imagery data along with reflectance (REF) information in the blue, green, red, red-edge, and near-infrared imaging spectrum that was initially evaluated for inter-correlations as well as subject to principal component analysis (PCA). VIs including the Green Normalized Difference Index (GNDVI), Green Chlorophyll Index (GCI), Infrared Percentage Vegetation Index (IPVI), Simple Ratio Index (SR), Normalized Difference Red-Edge Index (NDRE), and Visible Atmospherically Resistant Index (VARI) had the highest correlations with CGM (r: 0.68–0.80). Next, two state-of-the-art statistical and four machine learning (ML) models (Stepwise Linear Regression (SLR), Partial Least Squares Regression (PLSR), Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), and K-nearest neighbor (KNN)), and their 120 derivates (six ML models × two input groups (REFs and REFs+VIs) × 10 train–test data split ratios (starting 50:50)) were formulated and evaluated for CGM estimation. The CGM estimation accuracy was impacted by the ML model and train-test data split ratio. However, the impact was not significant for the input groups. For validation over the train and entire dataset, RF performed the best at a 95:5 split ratio, and REFs+VIs as the input variables (rtrain: 0.97, rRMSEtrain: 1.17%, rentire: 0.95, rRMSEentire: 1.37%). However, when validated for the test dataset, an increase in the train–test split ratio decreased the performances of the other ML models where SVM performed the best at a 50:50 split ratio (r = 0.70, rRMSE = 2.58%) and with REFs+VIs as the input variables. The 95:5 train–test ratio showed the best performance across all the models, which may be a suitable ratio for relatively smaller or medium-sized datasets. RF was identified to be the most stable and consistent ML model (r: 0.95, rRMSE: 1.37%). Findings in the study indicate that the integration of aerial remote sensing and ML-based data-run techniques could be useful for reliably predicting CGM at the pre-harvest stage, and developing precision corn harvest scheduling and management strategies for the growers.","PeriodicalId":37702,"journal":{"name":"Land","volume":" 33","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Land","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.3390/land12122188","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0

Abstract

Corn grain moisture (CGM) is critical to estimate grain maturity status and schedule harvest. Traditional methods for determining CGM range from manual scouting, destructive laboratory analyses, and weather-based dry down estimates. Such methods are either time consuming, expensive, spatially inaccurate, or subjective, therefore they are prone to errors or limitations. Realizing that precision harvest management could be critical for extracting the maximum crop value, this study evaluates the estimation of CGM at a pre-harvest stage using high-resolution (1.3 cm/pixel) multispectral imagery and machine learning techniques. Aerial imagery data were collected in the 2022 cropping season over 116 experimental corn planted plots. A total of 24 vegetation indices (VIs) were derived from imagery data along with reflectance (REF) information in the blue, green, red, red-edge, and near-infrared imaging spectrum that was initially evaluated for inter-correlations as well as subject to principal component analysis (PCA). VIs including the Green Normalized Difference Index (GNDVI), Green Chlorophyll Index (GCI), Infrared Percentage Vegetation Index (IPVI), Simple Ratio Index (SR), Normalized Difference Red-Edge Index (NDRE), and Visible Atmospherically Resistant Index (VARI) had the highest correlations with CGM (r: 0.68–0.80). Next, two state-of-the-art statistical and four machine learning (ML) models (Stepwise Linear Regression (SLR), Partial Least Squares Regression (PLSR), Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), and K-nearest neighbor (KNN)), and their 120 derivates (six ML models × two input groups (REFs and REFs+VIs) × 10 train–test data split ratios (starting 50:50)) were formulated and evaluated for CGM estimation. The CGM estimation accuracy was impacted by the ML model and train-test data split ratio. However, the impact was not significant for the input groups. For validation over the train and entire dataset, RF performed the best at a 95:5 split ratio, and REFs+VIs as the input variables (rtrain: 0.97, rRMSEtrain: 1.17%, rentire: 0.95, rRMSEentire: 1.37%). However, when validated for the test dataset, an increase in the train–test split ratio decreased the performances of the other ML models where SVM performed the best at a 50:50 split ratio (r = 0.70, rRMSE = 2.58%) and with REFs+VIs as the input variables. The 95:5 train–test ratio showed the best performance across all the models, which may be a suitable ratio for relatively smaller or medium-sized datasets. RF was identified to be the most stable and consistent ML model (r: 0.95, rRMSE: 1.37%). Findings in the study indicate that the integration of aerial remote sensing and ML-based data-run techniques could be useful for reliably predicting CGM at the pre-harvest stage, and developing precision corn harvest scheduling and management strategies for the growers.
利用航空多光谱成像和机器学习技术进行收获前玉米粒水分估算
玉米籽粒水分(CGM)对于估计籽粒成熟度和安排收割至关重要。确定 CGM 的传统方法包括人工侦察、破坏性实验室分析和基于天气的干度估计。这些方法要么耗时、昂贵,要么在空间上不准确,要么主观臆断,因此很容易出错或受到限制。由于认识到精确的收获管理对于获得作物的最大价值至关重要,本研究利用高分辨率(1.3 厘米/像素)多光谱图像和机器学习技术,对收获前阶段的作物总产量估算进行了评估。该研究在 2022 年种植季收集了 116 块玉米种植实验田的航空图像数据。根据图像数据以及蓝、绿、红、红边和近红外成像光谱中的反射率(REF)信息,共得出 24 种植被指数(VIs),并对这些指数进行了初步的相互关联性评估和主成分分析(PCA)。包括绿色归一化差异指数 (GNDVI)、绿色叶绿素指数 (GCI)、红外百分比植被指数 (IPVI)、简单比率指数 (SR)、归一化差异红边指数 (NDRE) 和可见光抗大气指数 (VARI) 在内的各种指数与 CGM 的相关性最高(r:0.68-0.80)。接下来,研究人员制定了两种最先进的统计模型和四种机器学习(ML)模型(逐步线性回归(SLR)、偏最小二乘法回归(PLSR)、人工神经网络(ANN)、支持向量机(SVM)、随机森林(RF)和 K-近邻(KNN))及其 120 种衍生模型(六种 ML 模型 × 两组输入数据(REFs 和 REFs+VIs ) × 10 种训练-测试数据分割比例(起始比例为 50:50)),并对其进行了评估。CGM 估计精度受 ML 模型和训练-测试数据分割比例的影响。不过,对输入组的影响不大。在对训练数据集和整个数据集进行验证时,RF 在 95:5 的分割比以及 REFs+VIs 作为输入变量时表现最佳(rtrain:0.97,rRMSEtrain:1.17%,rentire:0.95,rRMSEentire:1.37%)。然而,在对测试数据集进行验证时,训练-测试分割比的增加降低了其他 ML 模型的性能,其中 SVM 在 50:50 分割比(r = 0.70,rRMSE = 2.58%)和以 REFs+VIs 作为输入变量时表现最佳。在所有模型中,95:5 的训练-测试比例表现最佳,这可能是相对较小或中等规模数据集的合适比例。RF 被认为是最稳定、最一致的 ML 模型(r:0.95,rRMSE:1.37%)。研究结果表明,将航空遥感和基于 ML 的数据运行技术相结合,有助于在收获前阶段可靠地预测 CGM,并为种植者制定精确的玉米收获调度和管理策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Land
Land ENVIRONMENTAL STUDIES-Nature and Landscape Conservation
CiteScore
4.90
自引率
23.10%
发文量
1927
期刊介绍: Land is an international and cross-disciplinary, peer-reviewed, open access journal of land system science, landscape, soil–sediment–water systems, urban study, land–climate interactions, water–energy–land–food (WELF) nexus, biodiversity research and health nexus, land modelling and data processing, ecosystem services, and multifunctionality and sustainability etc., published monthly online by MDPI. The International Association for Landscape Ecology (IALE), European Land-use Institute (ELI), and Landscape Institute (LI) are affiliated with Land, and their members receive a discount on the article processing charge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信