k -最近邻回归在玉米产量预测中的应用

Miriam Sitienei, A. Otieno, A. Anapapa
{"title":"k -最近邻回归在玉米产量预测中的应用","authors":"Miriam Sitienei, A. Otieno, A. Anapapa","doi":"10.9734/ajpas/2023/v24i4529","DOIUrl":null,"url":null,"abstract":"Predictive analytics utilizes historical data and knowledge to predict future outcomes and provides a method for evaluating the accuracy and reliability of these forecasts. Artificial Intelligence is a tool of predictive analytics.  AI trains computers to learn human behaviors like learning, judgment, and decision-making while simulating intelligent human behavior using computers and has received a lot of attention in almost all areas of research. Machine learning is a branch of Artificial Intelligence that has been used to solve classification and regression problems. Machine learning advancements have aided in boosting agricultural gains. Yield prediction is one of the agricultural sectors that has embraced machine learning. K Nearest Neighbor (KNN) Regression is a regression algorithm used in machine learning for prediction tasks. KNN Regression is like KNN Classification, except that KNN Regression predicts a constant output value for a given input instead of predicting a class label. The basic idea behind KNN Regression is to find the K nearest neighbors to a given input data point based on a distance metric and then use the average (or weighted average) of the output values of these K neighbors as the predicted output for the input data point. The distance metric used in KNN Regression can vary depending on the data type being analyzed, but common distance metrics include Euclidean distance, Manhattan distance, and Minkowski distance. This paper presents the application of KNN regression in maize yield prediction in Uasin Gishu county, in north rift region of Kenya. Questionnaires were distributed to 900 randomly selected maize farmers across the thirty wards to obtain primary data. With a Train Test split ration of 80:20, KNN regression algorithm was able to predict maize yield and its prediction performance was evaluated using Root Mean Squared error-RMSE=0.4948, Mean Squared Error-MSE =0.2803, Mean Absolute Error-MAE = 0.4591 and Mean Absolute Percentage Error-MAPE = 36.17. According to the study findings, the algorithm was able to predict maize yield in the maize producing county.","PeriodicalId":8532,"journal":{"name":"Asian Journal of Probability and Statistics","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Application of K-Nearest-Neighbor Regression in Maize Yield Prediction\",\"authors\":\"Miriam Sitienei, A. Otieno, A. Anapapa\",\"doi\":\"10.9734/ajpas/2023/v24i4529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predictive analytics utilizes historical data and knowledge to predict future outcomes and provides a method for evaluating the accuracy and reliability of these forecasts. Artificial Intelligence is a tool of predictive analytics.  AI trains computers to learn human behaviors like learning, judgment, and decision-making while simulating intelligent human behavior using computers and has received a lot of attention in almost all areas of research. Machine learning is a branch of Artificial Intelligence that has been used to solve classification and regression problems. Machine learning advancements have aided in boosting agricultural gains. Yield prediction is one of the agricultural sectors that has embraced machine learning. K Nearest Neighbor (KNN) Regression is a regression algorithm used in machine learning for prediction tasks. KNN Regression is like KNN Classification, except that KNN Regression predicts a constant output value for a given input instead of predicting a class label. The basic idea behind KNN Regression is to find the K nearest neighbors to a given input data point based on a distance metric and then use the average (or weighted average) of the output values of these K neighbors as the predicted output for the input data point. The distance metric used in KNN Regression can vary depending on the data type being analyzed, but common distance metrics include Euclidean distance, Manhattan distance, and Minkowski distance. This paper presents the application of KNN regression in maize yield prediction in Uasin Gishu county, in north rift region of Kenya. Questionnaires were distributed to 900 randomly selected maize farmers across the thirty wards to obtain primary data. With a Train Test split ration of 80:20, KNN regression algorithm was able to predict maize yield and its prediction performance was evaluated using Root Mean Squared error-RMSE=0.4948, Mean Squared Error-MSE =0.2803, Mean Absolute Error-MAE = 0.4591 and Mean Absolute Percentage Error-MAPE = 36.17. According to the study findings, the algorithm was able to predict maize yield in the maize producing county.\",\"PeriodicalId\":8532,\"journal\":{\"name\":\"Asian Journal of Probability and Statistics\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Probability and Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9734/ajpas/2023/v24i4529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Probability and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/ajpas/2023/v24i4529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

预测分析利用历史数据和知识来预测未来的结果,并提供一种评估这些预测的准确性和可靠性的方法。人工智能是一种预测分析工具。人工智能训练计算机学习人类的行为,如学习、判断和决策,同时用计算机模拟人类的智能行为,几乎在所有研究领域都受到了广泛关注。机器学习是人工智能的一个分支,已被用于解决分类和回归问题。机器学习的进步有助于提高农业收益。产量预测是采用机器学习的农业领域之一。K最近邻(KNN)回归是一种用于机器学习预测任务的回归算法。KNN回归类似于KNN分类,除了KNN回归预测给定输入的恒定输出值,而不是预测类标签。KNN回归背后的基本思想是根据距离度量找到给定输入数据点最近的K个邻居,然后使用这K个邻居的输出值的平均值(或加权平均值)作为输入数据点的预测输出。KNN回归中使用的距离度量可以根据所分析的数据类型而变化,但常见的距离度量包括欧几里得距离、曼哈顿距离和闵可夫斯基距离。本文介绍了KNN回归在肯尼亚北部裂谷地区瓦辛吉舒县玉米产量预测中的应用。在30个区随机抽取900名玉米农户进行问卷调查,获取原始数据。在列车检验分割比为80:20的情况下,KNN回归算法能够预测玉米产量,采用均方根误差- rmse =0.4948,均方误差- mse =0.2803,平均绝对误差- mae = 0.4591,平均绝对百分比误差- mape = 36.17对其预测性能进行评价。根据研究结果,该算法能够预测玉米主产区的玉米产量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Application of K-Nearest-Neighbor Regression in Maize Yield Prediction
Predictive analytics utilizes historical data and knowledge to predict future outcomes and provides a method for evaluating the accuracy and reliability of these forecasts. Artificial Intelligence is a tool of predictive analytics.  AI trains computers to learn human behaviors like learning, judgment, and decision-making while simulating intelligent human behavior using computers and has received a lot of attention in almost all areas of research. Machine learning is a branch of Artificial Intelligence that has been used to solve classification and regression problems. Machine learning advancements have aided in boosting agricultural gains. Yield prediction is one of the agricultural sectors that has embraced machine learning. K Nearest Neighbor (KNN) Regression is a regression algorithm used in machine learning for prediction tasks. KNN Regression is like KNN Classification, except that KNN Regression predicts a constant output value for a given input instead of predicting a class label. The basic idea behind KNN Regression is to find the K nearest neighbors to a given input data point based on a distance metric and then use the average (or weighted average) of the output values of these K neighbors as the predicted output for the input data point. The distance metric used in KNN Regression can vary depending on the data type being analyzed, but common distance metrics include Euclidean distance, Manhattan distance, and Minkowski distance. This paper presents the application of KNN regression in maize yield prediction in Uasin Gishu county, in north rift region of Kenya. Questionnaires were distributed to 900 randomly selected maize farmers across the thirty wards to obtain primary data. With a Train Test split ration of 80:20, KNN regression algorithm was able to predict maize yield and its prediction performance was evaluated using Root Mean Squared error-RMSE=0.4948, Mean Squared Error-MSE =0.2803, Mean Absolute Error-MAE = 0.4591 and Mean Absolute Percentage Error-MAPE = 36.17. According to the study findings, the algorithm was able to predict maize yield in the maize producing county.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信