Predictability of Belgian residential real estate rents using tree-based ML models and IML techniques

IF 1.5 Q3 URBAN STUDIES
Ian Lenaers, Kris Boudt, Lieven De Moor
{"title":"Predictability of Belgian residential real estate rents using tree-based ML models and IML techniques","authors":"Ian Lenaers, Kris Boudt, Lieven De Moor","doi":"10.1108/ijhma-11-2022-0172","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThe purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques.\n\n\nDesign/methodology/approach\nData on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots.\n\n\nFindings\nResults indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist.\n\n\nOriginality/value\nRent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.\n","PeriodicalId":14136,"journal":{"name":"International Journal of Housing Markets and Analysis","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Housing Markets and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijhma-11-2022-0172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"URBAN STUDIES","Score":null,"Total":0}
引用次数: 1

Abstract

Purpose The purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques. Design/methodology/approach Data on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots. Findings Results indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist. Originality/value Rent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.
基于树的ML模型和IML技术对比利时住宅房地产租金的可预测性
目的目的是双重的。首先,本研究旨在证明基于黑箱树的机器学习(ML)模型比标准线性回归(LR)特征模型具有更好的租金预测性能。其次,它展示了使用可解释机器学习(IML)技术分析基于树的ML模型的附加值。设计/方法/方法收集了比利时住宅租赁物业的数据。基于树的ML模型、随机森林回归和极限梯度增强回归被应用于推导租金预测模型,以将预测性能与LR模型进行比较。使用SHapley加性exPlanations(SHAP)特征重要性(FI)图和SHAP汇总图对租金预测中的重要因素的基于树的模型进行了解释。结果表明,在比利时住宅租金预测中,基于树的模型比LR模型表现更好。SHAP FI图一致认为,要价、地籍收入、地表宜居性、卧室数量、浴室数量和测量兴趣点接近程度的变量是主要预测因素。租金及其影响因素之间的关系方向是通过SHAP汇总图确定的。除了线性关系之外,还存在非线性关系。与房价预测相比,使用ML进行的原创/价值租金预测研究相对较少。此外,使用IML技术研究预测模型在房地产经济学中相对较新。此外,据作者所知,本研究首次从SHAP FI和SHAP汇总图中得出预测租金的驱动决定因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.80
自引率
29.40%
发文量
68
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信