Applying C atboost Regression Model for Prediction of House Prices

Rafea. M. Almejrb, O. Sallabi, A. Mohamed
{"title":"Applying C atboost Regression Model for Prediction of House Prices","authors":"Rafea. M. Almejrb, O. Sallabi, A. Mohamed","doi":"10.1109/ICEMIS56295.2022.9914345","DOIUrl":null,"url":null,"abstract":"Every year, the cost of houses rises, demanding the development of a method to forecast home prices in the future. This paper aims to apply the Catboost regression model by changing the iteration and learning rate. A feature and house value dataset for King County, Washington, is used. During the pre-processing of the data, extreme values are winsorized, and features with a high degree of correlation are removed. There are twelve possible models, including Catboost, RandomForestRegressor, and KNeighborsRegressor. Several metrics are used to assess them, such as Mean Squared Error (MSE), R-squared score, Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE) (MSE). The model with a low RMSE of 0.013166 performs well compared to other research, especially in the test set where R2 is 0.915256. In this study, Catboost is the model that performs the best overall and can be used to estimate home prices. The most significant factors affecting property prices are location, living area, and house condition. It is confirmed that the conclusions in this research are consistent with real-world experience after comparing and contrasting with other works.","PeriodicalId":191284,"journal":{"name":"2022 International Conference on Engineering & MIS (ICEMIS)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Engineering & MIS (ICEMIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEMIS56295.2022.9914345","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Every year, the cost of houses rises, demanding the development of a method to forecast home prices in the future. This paper aims to apply the Catboost regression model by changing the iteration and learning rate. A feature and house value dataset for King County, Washington, is used. During the pre-processing of the data, extreme values are winsorized, and features with a high degree of correlation are removed. There are twelve possible models, including Catboost, RandomForestRegressor, and KNeighborsRegressor. Several metrics are used to assess them, such as Mean Squared Error (MSE), R-squared score, Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE) (MSE). The model with a low RMSE of 0.013166 performs well compared to other research, especially in the test set where R2 is 0.915256. In this study, Catboost is the model that performs the best overall and can be used to estimate home prices. The most significant factors affecting property prices are location, living area, and house condition. It is confirmed that the conclusions in this research are consistent with real-world experience after comparing and contrasting with other works.
应用C - boost回归模型预测房价
房价每年都在上涨,这就要求开发一种预测未来房价的方法。本文旨在通过改变迭代和学习率来应用Catboost回归模型。本文使用了华盛顿州金县的特征和房屋价值数据集。在数据预处理过程中,对极值进行去噪处理,去除相关度较高的特征。有12种可能的模型,包括Catboost、RandomForestRegressor和KNeighborsRegressor。使用几个指标来评估它们,如均方误差(MSE), r平方分数,平均绝对误差(MAE)和均方根误差(RMSE) (MSE)。与其他研究相比,RMSE为0.013166的模型表现较好,特别是在R2为0.915256的测试集中。在本研究中,Catboost是整体表现最好的模型,可以用来估计房价。影响房价最重要的因素是地段、居住面积和房屋条件。通过与其他研究的对比,证实了本研究的结论与现实经验是一致的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信