利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构

IF 16.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Cihan Çılgın, Hadi Gökçen
{"title":"利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构","authors":"Cihan Çılgın, Hadi Gökçen","doi":"10.1007/s10614-024-10703-4","DOIUrl":null,"url":null,"abstract":"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction\",\"authors\":\"Cihan Çılgın, Hadi Gökçen\",\"doi\":\"10.1007/s10614-024-10703-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1007/s10614-024-10703-4\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1007/s10614-024-10703-4","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

人口的增长、科技的飞速发展、生活水平的提高、家庭结构和社会经济结构的变化、城市化进程的高速发展以及租房或购房需求的增加,都使房地产市场不断扩大,也使房地产市场更加活跃。房地产市场的这种激烈活动也成正比地加速了房地产价格预测研究。本研究的目的是提出一种模型架构,通过使用混合方法,通过聚类模型作为初步方法,实现对房地产当前市场价值的高精度预测,从而通过使用多种机器学习方法的堆叠集合实现更高的同质性。为了获得同质性更高的子市场,首先将收集到的数据集按照房间数量进行分组,然后通过聚类分析将每个组划分为若干个聚类。通过这种方法,可以获得更多同质的子市场,并提高预测的准确性。然后,针对每个确定的子市场,使用五倍交叉验证对 13 个不同的弱学习器进行训练。对每个弱学习器分别进行了特征选择和参数优化。然后,根据结果最佳的特征和参数集获得的预测结果被用于训练元学习器。整个过程的结果是,最终预测结果由错误率最低的元学习器生成。研究结果表明,即使在许多不同的房地产子市场价格波动较大的时期,也能显示出符合国际标准的高预测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction

A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction

Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Accounts of Chemical Research
Accounts of Chemical Research 化学-化学综合
CiteScore
31.40
自引率
1.10%
发文量
312
审稿时长
2 months
期刊介绍: Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信