基于机器学习的多光谱图像地块级甘蔗产量估算——以澳大利亚Bundaberg为例

IF 7.7 Q1 AGRICULTURE, MULTIDISCIPLINARY
Sharareh Akbarian , Mostafa Rahimi Jamnani , Chengyuan Xu , Weijin Wang , Samsung Lim
{"title":"基于机器学习的多光谱图像地块级甘蔗产量估算——以澳大利亚Bundaberg为例","authors":"Sharareh Akbarian ,&nbsp;Mostafa Rahimi Jamnani ,&nbsp;Chengyuan Xu ,&nbsp;Weijin Wang ,&nbsp;Samsung Lim","doi":"10.1016/j.inpa.2023.06.004","DOIUrl":null,"url":null,"abstract":"<div><div>Early crop yield prediction provides critical information for Precision Agriculture (PA) procedures, policymaking, and food security. The availability of Remote Sensing (RS) datasets and Machine Learning (ML) approaches improved the prediction of sugarcane crop yield on the local and global scales, but an additional effort on the plot scale prediction is required. Challenges for plot-level prediction include a high ratooning capacity of the sugarcane crop, the lack of high spatial resolution data during the critical growth stages, and the non-linear complexation of yield data. The principal objective of the study is to analyse the potential of a time series of high-resolution multispectral Unmanned Aerial Vehicle (UAV) imagery along with three advanced ML techniques, namely Random Forest Regression (RFR), Support Vector Regression (SVR), and Nonlinear Autoregressive Exogenous Artificial Neural Network (NARX ANN) as a solution to the plot-level sugarcane yield prediction. An experimental sugarcane field containing 48 plots was selected, and UAV imagery was collected during the three consecutive cropping seasons' early and middle crop growth stages. Each dataset per growth stage was analyzed separately to predict the sugarcane crop yield in an attempt to discover how early the prediction of pre-harvest yield can be achieved. The datasets of the first two cropping seasons were trained and tested using the three ML techniques, utilizing 10-fold cross-validation to avoid overfitting. The third cropping season dataset was then used to evaluate the reliability of the developed prediction models. The results show that the correlation of Vegetation Indices (VIs) with crop yield in the middle stage outperforms the early stage in all three ML models. Moreover, comparing these models indicates that the NARX ANN method outperformed the others in the middle stage with the highest correlation coefficient (R<sup>2</sup>) of 0.96 and the lowest Root Mean Square Error (RMSE) of 4.92 t/ha. It was followed by the SVR (R<sup>2</sup> = 0.52, RMSE of 14.85 t/ha), which performed similarly to the RFR method (R<sup>2</sup> = 0.48, RMSE = 11.20 t/ha). In conclusion, the best-suited model for predicting sugarcane yields during the middle growth stage is a NARX ANN model employing the Normalized Difference RedEdge (NDRE), which demonstrates the feasibility of the ML approaches to predict the plot level sugarcane yield at a specific period of growth as they are less sensitive to the inconsistency of data collection times.</div></div>","PeriodicalId":53443,"journal":{"name":"Information Processing in Agriculture","volume":"11 4","pages":"Pages 476-487"},"PeriodicalIF":7.7000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Plot level sugarcane yield estimation by machine learning on multispectral images: A case study of Bundaberg, Australia\",\"authors\":\"Sharareh Akbarian ,&nbsp;Mostafa Rahimi Jamnani ,&nbsp;Chengyuan Xu ,&nbsp;Weijin Wang ,&nbsp;Samsung Lim\",\"doi\":\"10.1016/j.inpa.2023.06.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Early crop yield prediction provides critical information for Precision Agriculture (PA) procedures, policymaking, and food security. The availability of Remote Sensing (RS) datasets and Machine Learning (ML) approaches improved the prediction of sugarcane crop yield on the local and global scales, but an additional effort on the plot scale prediction is required. Challenges for plot-level prediction include a high ratooning capacity of the sugarcane crop, the lack of high spatial resolution data during the critical growth stages, and the non-linear complexation of yield data. The principal objective of the study is to analyse the potential of a time series of high-resolution multispectral Unmanned Aerial Vehicle (UAV) imagery along with three advanced ML techniques, namely Random Forest Regression (RFR), Support Vector Regression (SVR), and Nonlinear Autoregressive Exogenous Artificial Neural Network (NARX ANN) as a solution to the plot-level sugarcane yield prediction. An experimental sugarcane field containing 48 plots was selected, and UAV imagery was collected during the three consecutive cropping seasons' early and middle crop growth stages. Each dataset per growth stage was analyzed separately to predict the sugarcane crop yield in an attempt to discover how early the prediction of pre-harvest yield can be achieved. The datasets of the first two cropping seasons were trained and tested using the three ML techniques, utilizing 10-fold cross-validation to avoid overfitting. The third cropping season dataset was then used to evaluate the reliability of the developed prediction models. The results show that the correlation of Vegetation Indices (VIs) with crop yield in the middle stage outperforms the early stage in all three ML models. Moreover, comparing these models indicates that the NARX ANN method outperformed the others in the middle stage with the highest correlation coefficient (R<sup>2</sup>) of 0.96 and the lowest Root Mean Square Error (RMSE) of 4.92 t/ha. It was followed by the SVR (R<sup>2</sup> = 0.52, RMSE of 14.85 t/ha), which performed similarly to the RFR method (R<sup>2</sup> = 0.48, RMSE = 11.20 t/ha). In conclusion, the best-suited model for predicting sugarcane yields during the middle growth stage is a NARX ANN model employing the Normalized Difference RedEdge (NDRE), which demonstrates the feasibility of the ML approaches to predict the plot level sugarcane yield at a specific period of growth as they are less sensitive to the inconsistency of data collection times.</div></div>\",\"PeriodicalId\":53443,\"journal\":{\"name\":\"Information Processing in Agriculture\",\"volume\":\"11 4\",\"pages\":\"Pages 476-487\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing in Agriculture\",\"FirstCategoryId\":\"1091\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214317323000574\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing in Agriculture","FirstCategoryId":"1091","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214317323000574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

作物早期产量预测为精准农业(PA)程序、政策制定和粮食安全提供关键信息。遥感(RS)数据集和机器学习(ML)方法的可用性改善了甘蔗作物在局部和全球尺度上的产量预测,但需要在地块尺度上进行额外的预测。甘蔗田级预测面临的挑战包括甘蔗作物的高再生能力、关键生育期缺乏高空间分辨率数据以及产量数据的非线性复杂性。该研究的主要目的是分析高分辨率多光谱无人机(UAV)时间序列图像的潜力,以及三种先进的机器学习技术,即随机森林回归(RFR),支持向量回归(SVR)和非线性自回归外源性人工神经网络(NARX ANN)作为地块水平甘蔗产量预测的解决方案。选取48块甘蔗试验田,在连续3个种植季作物生长早中期采集无人机影像。对每个生长阶段的每个数据集进行单独分析,以预测甘蔗作物产量,试图发现收获前产量的预测可以多早实现。前两个种植季节的数据集使用三种ML技术进行训练和测试,使用10倍交叉验证以避免过拟合。然后利用第三个种植季数据来评估所建立的预测模型的可靠性。结果表明,三种ML模型的中期植被指数(VIs)与作物产量的相关性均优于早期。结果表明,NARX神经网络方法在中期表现优于其他方法,相关系数(R2)最高,为0.96,均方根误差(RMSE)最低,为4.92 t/ha。其次是SVR (R2 = 0.52, RMSE = 14.85 t/ha),其结果与RFR方法相似(R2 = 0.48, RMSE = 11.20 t/ha)。综上所述,最适合预测甘蔗生长中期产量的模型是采用归一化差分差分(NDRE)的NARX神经网络模型,这表明ML方法对数据采集时间不一致的敏感性较低,可以预测特定生长时期地块水平的甘蔗产量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Plot level sugarcane yield estimation by machine learning on multispectral images: A case study of Bundaberg, Australia
Early crop yield prediction provides critical information for Precision Agriculture (PA) procedures, policymaking, and food security. The availability of Remote Sensing (RS) datasets and Machine Learning (ML) approaches improved the prediction of sugarcane crop yield on the local and global scales, but an additional effort on the plot scale prediction is required. Challenges for plot-level prediction include a high ratooning capacity of the sugarcane crop, the lack of high spatial resolution data during the critical growth stages, and the non-linear complexation of yield data. The principal objective of the study is to analyse the potential of a time series of high-resolution multispectral Unmanned Aerial Vehicle (UAV) imagery along with three advanced ML techniques, namely Random Forest Regression (RFR), Support Vector Regression (SVR), and Nonlinear Autoregressive Exogenous Artificial Neural Network (NARX ANN) as a solution to the plot-level sugarcane yield prediction. An experimental sugarcane field containing 48 plots was selected, and UAV imagery was collected during the three consecutive cropping seasons' early and middle crop growth stages. Each dataset per growth stage was analyzed separately to predict the sugarcane crop yield in an attempt to discover how early the prediction of pre-harvest yield can be achieved. The datasets of the first two cropping seasons were trained and tested using the three ML techniques, utilizing 10-fold cross-validation to avoid overfitting. The third cropping season dataset was then used to evaluate the reliability of the developed prediction models. The results show that the correlation of Vegetation Indices (VIs) with crop yield in the middle stage outperforms the early stage in all three ML models. Moreover, comparing these models indicates that the NARX ANN method outperformed the others in the middle stage with the highest correlation coefficient (R2) of 0.96 and the lowest Root Mean Square Error (RMSE) of 4.92 t/ha. It was followed by the SVR (R2 = 0.52, RMSE of 14.85 t/ha), which performed similarly to the RFR method (R2 = 0.48, RMSE = 11.20 t/ha). In conclusion, the best-suited model for predicting sugarcane yields during the middle growth stage is a NARX ANN model employing the Normalized Difference RedEdge (NDRE), which demonstrates the feasibility of the ML approaches to predict the plot level sugarcane yield at a specific period of growth as they are less sensitive to the inconsistency of data collection times.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Processing in Agriculture
Information Processing in Agriculture Agricultural and Biological Sciences-Animal Science and Zoology
CiteScore
21.10
自引率
0.00%
发文量
80
期刊介绍: Information Processing in Agriculture (IPA) was established in 2013 and it encourages the development towards a science and technology of information processing in agriculture, through the following aims: • Promote the use of knowledge and methods from the information processing technologies in the agriculture; • Illustrate the experiences and publications of the institutes, universities and government, and also the profitable technologies on agriculture; • Provide opportunities and platform for exchanging knowledge, strategies and experiences among the researchers in information processing worldwide; • Promote and encourage interactions among agriculture Scientists, Meteorologists, Biologists (Pathologists/Entomologists) with IT Professionals and other stakeholders to develop and implement methods, techniques, tools, and issues related to information processing technology in agriculture; • Create and promote expert groups for development of agro-meteorological databases, crop and livestock modelling and applications for development of crop performance based decision support system. Topics of interest include, but are not limited to: • Smart Sensor and Wireless Sensor Network • Remote Sensing • Simulation, Optimization, Modeling and Automatic Control • Decision Support Systems, Intelligent Systems and Artificial Intelligence • Computer Vision and Image Processing • Inspection and Traceability for Food Quality • Precision Agriculture and Intelligent Instrument • The Internet of Things and Cloud Computing • Big Data and Data Mining
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信