Explaining drivers of housing prices with nonlinear hedonic regressions

IF 4.9

Machine learning with applications Pub Date : 2025-07-04 DOI:10.1016/j.mlwa.2025.100707

Heng Wan , Pranab K. Roy Chowdhury , Jim Yoon , Parin Bhaduri , Vivek Srikrishnan , David Judi , Brent Daniel

{"title":"Explaining drivers of housing prices with nonlinear hedonic regressions","authors":"Heng Wan , Pranab K. Roy Chowdhury , Jim Yoon , Parin Bhaduri , Vivek Srikrishnan , David Judi , Brent Daniel","doi":"10.1016/j.mlwa.2025.100707","DOIUrl":null,"url":null,"abstract":"<div><div>Housing markets play a critical role in shaping the spatial and demographic evolution of urban areas. Simulating housing price dynamics can enhance projections of future urban development outcomes. However, traditional hedonic regressions for housing prices, which neglect nonlinear interactions among explanatory variables, often exhibit limited predictive performance. While machine learning (ML) methods can provide a more flexible representation of the relationships between predictors, they are often regarded as “black boxes” due to their complexity and lack of transparency. Interpretable ML techniques provide a promising route by combining the flexibility of ML methods with approaches to analyze the relationships between inputs and outputs. In this study, we employ interpretable ML to analyze the patterns driving the housing market in Baltimore, Maryland, USA. We train an Artificial Neural Network (ANN) to predict Baltimore housing prices based on structural characteristics (e.g., home size, number of stories) and locational attributes (e.g., distance to the city center). We then conduct sensitivity and Partial Dependence Plot (PDP) analyses to interpret the fitted ANN model. We find that the ML model achieves higher predictive accuracy and explains 16 % more of housing price variance than a traditional linear regression model. The interpretable ML model also reveals more nuanced and realistic nonlinear relationships between housing sales price and predictors as well as interactive effects underlying Baltimore home price dynamics. For instance, while the linear model indicates a steady housing price increase over time, our interpretable ML model detects a post-2008 decline, with smaller properties experiencing the sharpest drop.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"21 ","pages":"Article 100707"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827025000908","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Housing markets play a critical role in shaping the spatial and demographic evolution of urban areas. Simulating housing price dynamics can enhance projections of future urban development outcomes. However, traditional hedonic regressions for housing prices, which neglect nonlinear interactions among explanatory variables, often exhibit limited predictive performance. While machine learning (ML) methods can provide a more flexible representation of the relationships between predictors, they are often regarded as “black boxes” due to their complexity and lack of transparency. Interpretable ML techniques provide a promising route by combining the flexibility of ML methods with approaches to analyze the relationships between inputs and outputs. In this study, we employ interpretable ML to analyze the patterns driving the housing market in Baltimore, Maryland, USA. We train an Artificial Neural Network (ANN) to predict Baltimore housing prices based on structural characteristics (e.g., home size, number of stories) and locational attributes (e.g., distance to the city center). We then conduct sensitivity and Partial Dependence Plot (PDP) analyses to interpret the fitted ANN model. We find that the ML model achieves higher predictive accuracy and explains 16 % more of housing price variance than a traditional linear regression model. The interpretable ML model also reveals more nuanced and realistic nonlinear relationships between housing sales price and predictors as well as interactive effects underlying Baltimore home price dynamics. For instance, while the linear model indicates a steady housing price increase over time, our interpretable ML model detects a post-2008 decline, with smaller properties experiencing the sharpest drop.

查看原文本刊更多论文

用非线性享乐回归解释房价驱动因素

住房市场在塑造城市地区的空间和人口演变方面发挥着关键作用。模拟房价动态可以增强对未来城市发展结果的预测。然而，传统的房价享乐回归，忽略了解释变量之间的非线性相互作用，往往表现出有限的预测性能。虽然机器学习（ML）方法可以提供更灵活的预测器之间关系的表示，但由于其复杂性和缺乏透明度，它们通常被视为“黑盒子”。可解释的机器学习技术通过将机器学习方法的灵活性与分析输入和输出之间关系的方法相结合，提供了一条有前途的途径。在这项研究中，我们使用可解释的机器学习来分析驱动美国马里兰州巴尔的摩住房市场的模式。我们训练了一个人工神经网络（ANN），根据结构特征（例如，房屋大小、楼层数）和位置属性（例如，到市中心的距离）来预测巴尔的摩的房价。然后，我们进行敏感性和部分依赖图（PDP）分析来解释拟合的ANN模型。我们发现，与传统的线性回归模型相比，ML模型实现了更高的预测精度，并解释了16%的房价方差。可解释的ML模型还揭示了住房销售价格和预测者之间更微妙和现实的非线性关系，以及巴尔的摩房价动态背后的互动效应。例如，虽然线性模型表明房价随着时间的推移稳步上涨，但我们的可解释ML模型检测到2008年后的下跌，其中较小的房产经历了最急剧的下跌。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications

自引率

0.00%

发文量

审稿时长

98 days