Anders Hjort, J. Pensar, I. Scheel, D. E. Sommervoll
{"title":"House price prediction with gradient boosted trees under different loss functions","authors":"Anders Hjort, J. Pensar, I. Scheel, D. E. Sommervoll","doi":"10.1080/09599916.2022.2070525","DOIUrl":null,"url":null,"abstract":"ABSTRACT Many banks and credit institutions are required to assess the value of dwellings in their mortgage portfolio. This valuation often relies on an Automated Valuation Model (AVM). Moreover, these institutions often report the models accuracy by two numbers: The fraction of predictions within and range from the true values. Until recently, AVMs tended to be hedonic regression models, but lately machine learning approaches like random forest and gradient boosted trees have been increasingly applied. Both the traditional approaches and the machine learning approaches rely on minimising mean squared prediction error, and not the number of predictions in the and range. We investigate whether introducing a loss function closer to the AVMs actual loss measure improves performance in machine learning approaches, specifically for a gradient boosted tree approach. This loss function yields an improvement from to of predictions within of the true value on a data set of transactions from the Norwegian housing market between 2013 and 2015, with the biggest improvements in performance coming from the lower price segments. We also find that a weighted average of models with different loss functions improves performance further, yielding of the observations within of the true value.","PeriodicalId":45726,"journal":{"name":"Journal of Property Research","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Property Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09599916.2022.2070525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"URBAN STUDIES","Score":null,"Total":0}
引用次数: 11
Abstract
ABSTRACT Many banks and credit institutions are required to assess the value of dwellings in their mortgage portfolio. This valuation often relies on an Automated Valuation Model (AVM). Moreover, these institutions often report the models accuracy by two numbers: The fraction of predictions within and range from the true values. Until recently, AVMs tended to be hedonic regression models, but lately machine learning approaches like random forest and gradient boosted trees have been increasingly applied. Both the traditional approaches and the machine learning approaches rely on minimising mean squared prediction error, and not the number of predictions in the and range. We investigate whether introducing a loss function closer to the AVMs actual loss measure improves performance in machine learning approaches, specifically for a gradient boosted tree approach. This loss function yields an improvement from to of predictions within of the true value on a data set of transactions from the Norwegian housing market between 2013 and 2015, with the biggest improvements in performance coming from the lower price segments. We also find that a weighted average of models with different loss functions improves performance further, yielding of the observations within of the true value.
期刊介绍:
The Journal of Property Research is an international journal. The title reflects the expansion of research, particularly applied research, into property investment and development. The Journal of Property Research publishes papers in any area of real estate investment and development. These may be theoretical, empirical, case studies or critical literature surveys.