{"title":"基于机器学习算法的农业用地价格预测","authors":"Maksym Butenko, Volodymyr Pavlenko","doi":"10.36994/2788-5518-2023-01-05-19","DOIUrl":null,"url":null,"abstract":"The paper considers the prediction of land value using LightGBM, Fast Tree, and Fast Forest machine learning methods. The training dataset is collected from the Internet and contains of 800 rows (in some cases there are data gaps for certain attributes). An overview of the problems of collecting such data is made. From the full dataset (D1), three additional ones were created: without data on land rent price (D2), without data on normative monetary assessment (NMA) (D3) and without data on rent price and NMA (D4). Using the LightGBM method has the best prediction results, but in some cases Fast Tree’s prediction quality is similar to LightGBM. Removing NMA data from the dataset improves the quality of prediction, because after calculations it turned out that there is no correlation between market value and NMA. Also, unlike real estate value prediction, gradient boosting methods provides better results. A correct prediction is defined as a prediction with no more than 10% error, which follow similar approaches in other researches. Depending on the dataset and the chosen method of machine learning, the quality of prediction ranges from 35% to 92% of correct predictions. The paper describes research’s limitations and possible ways to improve the quality of land price predictions for reviewed machine learning methods.","PeriodicalId":165726,"journal":{"name":"Інфокомунікаційні та комп’ютерні технології","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AGRICULTURAL LAND PRICE PREDICTION USING MACHINE LEARNING ALGORITHMS\",\"authors\":\"Maksym Butenko, Volodymyr Pavlenko\",\"doi\":\"10.36994/2788-5518-2023-01-05-19\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper considers the prediction of land value using LightGBM, Fast Tree, and Fast Forest machine learning methods. The training dataset is collected from the Internet and contains of 800 rows (in some cases there are data gaps for certain attributes). An overview of the problems of collecting such data is made. From the full dataset (D1), three additional ones were created: without data on land rent price (D2), without data on normative monetary assessment (NMA) (D3) and without data on rent price and NMA (D4). Using the LightGBM method has the best prediction results, but in some cases Fast Tree’s prediction quality is similar to LightGBM. Removing NMA data from the dataset improves the quality of prediction, because after calculations it turned out that there is no correlation between market value and NMA. Also, unlike real estate value prediction, gradient boosting methods provides better results. A correct prediction is defined as a prediction with no more than 10% error, which follow similar approaches in other researches. Depending on the dataset and the chosen method of machine learning, the quality of prediction ranges from 35% to 92% of correct predictions. The paper describes research’s limitations and possible ways to improve the quality of land price predictions for reviewed machine learning methods.\",\"PeriodicalId\":165726,\"journal\":{\"name\":\"Інфокомунікаційні та комп’ютерні технології\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Інфокомунікаційні та комп’ютерні технології\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36994/2788-5518-2023-01-05-19\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Інфокомунікаційні та комп’ютерні технології","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36994/2788-5518-2023-01-05-19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
AGRICULTURAL LAND PRICE PREDICTION USING MACHINE LEARNING ALGORITHMS
The paper considers the prediction of land value using LightGBM, Fast Tree, and Fast Forest machine learning methods. The training dataset is collected from the Internet and contains of 800 rows (in some cases there are data gaps for certain attributes). An overview of the problems of collecting such data is made. From the full dataset (D1), three additional ones were created: without data on land rent price (D2), without data on normative monetary assessment (NMA) (D3) and without data on rent price and NMA (D4). Using the LightGBM method has the best prediction results, but in some cases Fast Tree’s prediction quality is similar to LightGBM. Removing NMA data from the dataset improves the quality of prediction, because after calculations it turned out that there is no correlation between market value and NMA. Also, unlike real estate value prediction, gradient boosting methods provides better results. A correct prediction is defined as a prediction with no more than 10% error, which follow similar approaches in other researches. Depending on the dataset and the chosen method of machine learning, the quality of prediction ranges from 35% to 92% of correct predictions. The paper describes research’s limitations and possible ways to improve the quality of land price predictions for reviewed machine learning methods.