Henry Lamos-Díaz, David Esteban Puentes-Garzón, Diego Alejandro Zárate-Caicedo
{"title":"哥伦比亚桑坦德可可作物产量预测的机器学习模型比较","authors":"Henry Lamos-Díaz, David Esteban Puentes-Garzón, Diego Alejandro Zárate-Caicedo","doi":"10.19053/01211129.v29.n54.2020.10853","DOIUrl":null,"url":null,"abstract":"The identification of influencing factors in crop yield (kg·ha) provides essential information for decision-making processes related to the prediction and improvement of productivity, which gives farmers the opportunity to increase their income. The current study investigates the application of multiple machine learning algorithms for cocoa yield prediction and influencing factors identification. The Support Vector Machines (SVM) and Ensemble Learning Models (Random Forests, Gradient Boosting) are compared with Least Absolute Shrinkage and Selection Operator (LASSO) regression models. The considered predictors were climate conditions, cocoa variety, fertilization level and sun exposition in an experimental crop located in Rionegro, Santander. Results showed that Gradient Boosting is the best prediction alternative with Coefficient of determination (R) = 68%, Mean Absolute Error (MAE) = 13.32, and Root Mean Square Error (RMSE) = 20.41. The crop yield variability is explained mainly by the radiation one month before harvest, the accumulated rainfall on the harvest month, and the temperature one month before harvest. Likewise, the crop yields are evaluated based on the kind of sun exposure, and it was found that radiation one month before harvest is the most 1 Ph. D. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). hlamos@uis.edu.co. ORCID: 0000-0003-1778-9768 2 M. Sc. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). david.puentes1@correo.uis.edu.co. ORCID: 0000-0001-8178-2339 3 Ph. D. Corporación Colombiana de Investigación Agropecuaria (Rionegro-Santander, Colombia). dzarate@corpoica.org.co. ORCID: 0000-0001-9630-3927 Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia Revista Facultad de Ingeniería (Rev. Fac. Ing.) Vol. 29 (54), e10477. 2020. Tunja-Boyacá, Colombia. L-ISSN: 0121-1129, e-ISSN: 2357-5328, DOI: https://doi.org/10.19053/01211129.v29.n54.2020.10853 influential factor in shade-grown plants. On the other hand, rainfall and soil moisture are determining variables in sun-grown plants, which is associated with the water requirements. These results suggest a differentiated management for crops depending on the kind of sun exposure to avoid compromising productivity, since there is no significant difference in the yield of both agricultural managements.","PeriodicalId":21428,"journal":{"name":"Revista Facultad De Ingenieria-universidad De Antioquia","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2020-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia\",\"authors\":\"Henry Lamos-Díaz, David Esteban Puentes-Garzón, Diego Alejandro Zárate-Caicedo\",\"doi\":\"10.19053/01211129.v29.n54.2020.10853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The identification of influencing factors in crop yield (kg·ha) provides essential information for decision-making processes related to the prediction and improvement of productivity, which gives farmers the opportunity to increase their income. The current study investigates the application of multiple machine learning algorithms for cocoa yield prediction and influencing factors identification. The Support Vector Machines (SVM) and Ensemble Learning Models (Random Forests, Gradient Boosting) are compared with Least Absolute Shrinkage and Selection Operator (LASSO) regression models. The considered predictors were climate conditions, cocoa variety, fertilization level and sun exposition in an experimental crop located in Rionegro, Santander. Results showed that Gradient Boosting is the best prediction alternative with Coefficient of determination (R) = 68%, Mean Absolute Error (MAE) = 13.32, and Root Mean Square Error (RMSE) = 20.41. The crop yield variability is explained mainly by the radiation one month before harvest, the accumulated rainfall on the harvest month, and the temperature one month before harvest. Likewise, the crop yields are evaluated based on the kind of sun exposure, and it was found that radiation one month before harvest is the most 1 Ph. D. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). hlamos@uis.edu.co. ORCID: 0000-0003-1778-9768 2 M. Sc. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). david.puentes1@correo.uis.edu.co. ORCID: 0000-0001-8178-2339 3 Ph. D. Corporación Colombiana de Investigación Agropecuaria (Rionegro-Santander, Colombia). dzarate@corpoica.org.co. ORCID: 0000-0001-9630-3927 Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia Revista Facultad de Ingeniería (Rev. Fac. Ing.) Vol. 29 (54), e10477. 2020. Tunja-Boyacá, Colombia. L-ISSN: 0121-1129, e-ISSN: 2357-5328, DOI: https://doi.org/10.19053/01211129.v29.n54.2020.10853 influential factor in shade-grown plants. On the other hand, rainfall and soil moisture are determining variables in sun-grown plants, which is associated with the water requirements. These results suggest a differentiated management for crops depending on the kind of sun exposure to avoid compromising productivity, since there is no significant difference in the yield of both agricultural managements.\",\"PeriodicalId\":21428,\"journal\":{\"name\":\"Revista Facultad De Ingenieria-universidad De Antioquia\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2020-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Revista Facultad De Ingenieria-universidad De Antioquia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.19053/01211129.v29.n54.2020.10853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Facultad De Ingenieria-universidad De Antioquia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.19053/01211129.v29.n54.2020.10853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia
The identification of influencing factors in crop yield (kg·ha) provides essential information for decision-making processes related to the prediction and improvement of productivity, which gives farmers the opportunity to increase their income. The current study investigates the application of multiple machine learning algorithms for cocoa yield prediction and influencing factors identification. The Support Vector Machines (SVM) and Ensemble Learning Models (Random Forests, Gradient Boosting) are compared with Least Absolute Shrinkage and Selection Operator (LASSO) regression models. The considered predictors were climate conditions, cocoa variety, fertilization level and sun exposition in an experimental crop located in Rionegro, Santander. Results showed that Gradient Boosting is the best prediction alternative with Coefficient of determination (R) = 68%, Mean Absolute Error (MAE) = 13.32, and Root Mean Square Error (RMSE) = 20.41. The crop yield variability is explained mainly by the radiation one month before harvest, the accumulated rainfall on the harvest month, and the temperature one month before harvest. Likewise, the crop yields are evaluated based on the kind of sun exposure, and it was found that radiation one month before harvest is the most 1 Ph. D. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). hlamos@uis.edu.co. ORCID: 0000-0003-1778-9768 2 M. Sc. Universidad Industrial de Santander (Bucaramanga-Santander, Colombia). david.puentes1@correo.uis.edu.co. ORCID: 0000-0001-8178-2339 3 Ph. D. Corporación Colombiana de Investigación Agropecuaria (Rionegro-Santander, Colombia). dzarate@corpoica.org.co. ORCID: 0000-0001-9630-3927 Comparison Between Machine Learning Models for Yield Forecast in Cocoa Crops in Santander, Colombia Revista Facultad de Ingeniería (Rev. Fac. Ing.) Vol. 29 (54), e10477. 2020. Tunja-Boyacá, Colombia. L-ISSN: 0121-1129, e-ISSN: 2357-5328, DOI: https://doi.org/10.19053/01211129.v29.n54.2020.10853 influential factor in shade-grown plants. On the other hand, rainfall and soil moisture are determining variables in sun-grown plants, which is associated with the water requirements. These results suggest a differentiated management for crops depending on the kind of sun exposure to avoid compromising productivity, since there is no significant difference in the yield of both agricultural managements.
期刊介绍:
Revista Facultad de Ingenieria started in 1984 and is a publication of the School of Engineering at the University of Antioquia.
The main objective of the journal is to promote and stimulate the publishing of national and international scientific research results. The journal publishes original articles, resulting from scientific research, experimental and or simulation studies in engineering sciences, technology, and similar disciplines (Electronics, Telecommunications, Bioengineering, Biotechnology, Electrical, Computer Science, Mechanical, Chemical, Environmental, Materials, Sanitary, Civil and Industrial Engineering).
In exceptional cases, the journal will publish insightful articles related to current important subjects, or revision articles representing a significant contribution to the contextualization of the state of the art in a known relevant topic. Case reports will only be published when those cases are related to studies in which the validity of a methodology is being proven for the first time, or when a significant contribution to the knowledge of an unexplored system can be proven.
All published articles have undergone a peer review process, carried out by experts recognized for their knowledge and contributions to the relevant field.
To adapt the Journal to international standards and to promote the visibility of the published articles; and therefore, to have a greater impact in the global academic community, after November 1st 2013, the journal will accept only manuscripts written in English for reviewing and publication.
Revista Facultad de Ingeniería –redin is entirely financed by University of Antioquia
Since 2015, every article accepted for publication in the journal is assigned a DOI number.