A novel integrated machine learning and inferential modeling approach to explore non-linear effects of built environment on travel: a three-wave repeated cross-sectional study
{"title":"A novel integrated machine learning and inferential modeling approach to explore non-linear effects of built environment on travel: a three-wave repeated cross-sectional study","authors":"Niaz Mahmud Zafri , Ming Zhang","doi":"10.1016/j.tbs.2025.101146","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the dynamic effects of the built environment on travel in Austin, Texas, over a 20-year period. Using three waves of household travel surveys from 1997, 2006, and 2017, the research employs a repeated cross-sectional approach to address the limitations of traditional longitudinal and cross-sectional studies, and to more accurately estimate effect sizes. Methodologically, we introduce a novel integration of machine learning and inferential modeling to uncover non-linear relationships and threshold effects of the built environment characteristics on travel. Using Gradient Boosted Decision Trees (GBDT) and Partial Dependence Plots (PDPs), we first identify optimal threshold points in the relationships, which are then incorporated into piecewise multilevel models.</div><div>Findings from the study reveal that the built environment serves as a sustainable tool for managing travel in the long term, contributing 50 % or more to the total feature importance in predicting individual travel—surpassing the combined effects of personal and household characteristics. Improved transit accessibility, enhanced local and regional accessibility, higher population and employment densities, and greater diversity are all associated with significant reductions in travel—particularly within their identified thresholds—though the magnitude of their influence varies across time periods and shows diminishing marginal returns.</div><div>These findings highlight the potential of smart growth policies—such as expanding transit accessibility, promoting high-density and mixed-use development, and discouraging single-use development and peripheral sprawl—as effective strategies to reduce car dependency and manage travel demand. Moreover, the study demonstrates that the proposed integrated approach can effectively capture complex non-linear effects while enhancing flexibility and interpretability, reducing researcher bias, and enabling statistical inference—ultimately providing more robust and policy-relevant insights.</div></div>","PeriodicalId":51534,"journal":{"name":"Travel Behaviour and Society","volume":"42 ","pages":"Article 101146"},"PeriodicalIF":5.7000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Travel Behaviour and Society","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214367X25001644","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates the dynamic effects of the built environment on travel in Austin, Texas, over a 20-year period. Using three waves of household travel surveys from 1997, 2006, and 2017, the research employs a repeated cross-sectional approach to address the limitations of traditional longitudinal and cross-sectional studies, and to more accurately estimate effect sizes. Methodologically, we introduce a novel integration of machine learning and inferential modeling to uncover non-linear relationships and threshold effects of the built environment characteristics on travel. Using Gradient Boosted Decision Trees (GBDT) and Partial Dependence Plots (PDPs), we first identify optimal threshold points in the relationships, which are then incorporated into piecewise multilevel models.
Findings from the study reveal that the built environment serves as a sustainable tool for managing travel in the long term, contributing 50 % or more to the total feature importance in predicting individual travel—surpassing the combined effects of personal and household characteristics. Improved transit accessibility, enhanced local and regional accessibility, higher population and employment densities, and greater diversity are all associated with significant reductions in travel—particularly within their identified thresholds—though the magnitude of their influence varies across time periods and shows diminishing marginal returns.
These findings highlight the potential of smart growth policies—such as expanding transit accessibility, promoting high-density and mixed-use development, and discouraging single-use development and peripheral sprawl—as effective strategies to reduce car dependency and manage travel demand. Moreover, the study demonstrates that the proposed integrated approach can effectively capture complex non-linear effects while enhancing flexibility and interpretability, reducing researcher bias, and enabling statistical inference—ultimately providing more robust and policy-relevant insights.
期刊介绍:
Travel Behaviour and Society is an interdisciplinary journal publishing high-quality original papers which report leading edge research in theories, methodologies and applications concerning transportation issues and challenges which involve the social and spatial dimensions. In particular, it provides a discussion forum for major research in travel behaviour, transportation infrastructure, transportation and environmental issues, mobility and social sustainability, transportation geographic information systems (TGIS), transportation and quality of life, transportation data collection and analysis, etc.