Price Prediction and Determination of the Affecting Variables of the Real Estate by Using X-Means Clustering and CART Decision Trees

Sait Can Yucebas, S. Yalpir, Levent Genc, Melike Dogan
{"title":"Price Prediction and Determination of the Affecting Variables of the Real Estate by Using X-Means Clustering and CART Decision Trees","authors":"Sait Can Yucebas, S. Yalpir, Levent Genc, Melike Dogan","doi":"10.3897/jucs.98733","DOIUrl":null,"url":null,"abstract":"The use of machine learning in real estate is quite new. When the working area is large, the factors affecting the price may vary according to the geographical regions and socioeconomic factors. It is thought that the price prediction performance of a model that will reflect these differences will be more successful than a general model. Unsupervised learning methods can be used both to increase performance and to show the variation of different factors affecting the price according to regions. With this aim, a hybrid model of X-Means clustering and CART decision trees was established in this study.  This model successfully learned the geographical and physical variables that affect the price. The prediction performance of the model was compared with the direct capitalization method, which is the gold standard in the domain. The hybrid model has a superior performance over direct capitalization in terms of mean square error, root mean square error and adjusted R-Squared metrics. The scores were 72.86, 0.0057 and 0.978, respectively. The effect of clustering was also examined. Clustering increased the prediction performance by 36%. ","PeriodicalId":124602,"journal":{"name":"JUCS - Journal of Universal Computer Science","volume":"53 7","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JUCS - Journal of Universal Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/jucs.98733","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The use of machine learning in real estate is quite new. When the working area is large, the factors affecting the price may vary according to the geographical regions and socioeconomic factors. It is thought that the price prediction performance of a model that will reflect these differences will be more successful than a general model. Unsupervised learning methods can be used both to increase performance and to show the variation of different factors affecting the price according to regions. With this aim, a hybrid model of X-Means clustering and CART decision trees was established in this study.  This model successfully learned the geographical and physical variables that affect the price. The prediction performance of the model was compared with the direct capitalization method, which is the gold standard in the domain. The hybrid model has a superior performance over direct capitalization in terms of mean square error, root mean square error and adjusted R-Squared metrics. The scores were 72.86, 0.0057 and 0.978, respectively. The effect of clustering was also examined. Clustering increased the prediction performance by 36%. 
利用 X-Means 聚类和 CART 决策树预测价格并确定房地产的影响变量
机器学习在房地产领域的应用还是一个新生事物。当工作区域较大时,影响价格的因素可能会因地理区域和社会经济因素而有所不同。人们认为,能反映这些差异的模型的价格预测性能将比一般模型更成功。无监督学习方法既可用于提高性能,也可用于显示影响价格的不同因素在不同地区的差异。为此,本研究建立了 X-Means 聚类和 CART 决策树的混合模型。 该模型成功地学习了影响价格的地理和物理变量。该模型的预测性能与该领域的黄金标准--直接资本化方法进行了比较。就均方误差、均方根误差和调整后 R 平方指标而言,混合模型的性能优于直接资本化方法。得分分别为 72.86、0.0057 和 0.978。此外,还考察了聚类的效果。聚类使预测性能提高了 36%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信