Cagdas Kara , Abdulkadir Ozden , Arzu Altin Yavuz , Safak Bilgic
{"title":"Modeling and variable selection of healthcare trip behaviours using statistical learning techniques","authors":"Cagdas Kara , Abdulkadir Ozden , Arzu Altin Yavuz , Safak Bilgic","doi":"10.1016/j.tranpol.2025.05.008","DOIUrl":null,"url":null,"abstract":"<div><div>In an increasingly complex and fast-paced world, understanding healthcare-related travel behavior has become a critical challenge in transportation planning. Traditional models, including the Four-Step Transportation Model (FSTM), require further refinements to better capture evolving travel patterns, particularly the growing share of home-based other trips, which include shopping, leisure, and healthcare-related journeys. Among these, healthcare trips require special attention due to the increasing proportion of the aging population and the essential nature of medical accessibility.</div><div>This study aims to identify key variables influencing healthcare-related travel behaviors using variable selection techniques, specifically Least Absolute Shrinkage and Selection Operator (Lasso) and Elastic Net (ENet). The analysis is based on zonal-level aggregated data derived from household-based hospital travel surveys conducted in Eskişehir City. The model incorporates regional socioeconomic variables, including household characteristics, vehicle ownership rates, employment ratios, and age group distributions. Model performance is evaluated using Mean Squared Error (MSE) and Mean Absolute Error (MAE) as key success criteria.</div><div>The results indicate that the ENet model achieved the lowest Mean Squared Error (MSE), reducing the error by approximately 37 % compared to the OLS model, while the Lasso model yielded the lowest Mean Absolute Error (MAE), reflecting a 38 % improvement. Both methods effectively performed variable selection, retaining 10 out of 17 predictors in the final model. Significant variables positively associated with healthcare travel frequency include the proportions of individuals aged 30–49, 50–64, and over 65, as well as family density. These results suggest that household accompaniment patterns and age-related healthcare needs increase the frequency of such trips. In contrast, negative associations were observed between healthcare travel frequency and the share of the 6–17 age group, employment ratio, average number of cars per family, and the number of healthcare centers.</div></div>","PeriodicalId":48378,"journal":{"name":"Transport Policy","volume":"170 ","pages":"Pages 12-23"},"PeriodicalIF":6.3000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transport Policy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0967070X25001878","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
In an increasingly complex and fast-paced world, understanding healthcare-related travel behavior has become a critical challenge in transportation planning. Traditional models, including the Four-Step Transportation Model (FSTM), require further refinements to better capture evolving travel patterns, particularly the growing share of home-based other trips, which include shopping, leisure, and healthcare-related journeys. Among these, healthcare trips require special attention due to the increasing proportion of the aging population and the essential nature of medical accessibility.
This study aims to identify key variables influencing healthcare-related travel behaviors using variable selection techniques, specifically Least Absolute Shrinkage and Selection Operator (Lasso) and Elastic Net (ENet). The analysis is based on zonal-level aggregated data derived from household-based hospital travel surveys conducted in Eskişehir City. The model incorporates regional socioeconomic variables, including household characteristics, vehicle ownership rates, employment ratios, and age group distributions. Model performance is evaluated using Mean Squared Error (MSE) and Mean Absolute Error (MAE) as key success criteria.
The results indicate that the ENet model achieved the lowest Mean Squared Error (MSE), reducing the error by approximately 37 % compared to the OLS model, while the Lasso model yielded the lowest Mean Absolute Error (MAE), reflecting a 38 % improvement. Both methods effectively performed variable selection, retaining 10 out of 17 predictors in the final model. Significant variables positively associated with healthcare travel frequency include the proportions of individuals aged 30–49, 50–64, and over 65, as well as family density. These results suggest that household accompaniment patterns and age-related healthcare needs increase the frequency of such trips. In contrast, negative associations were observed between healthcare travel frequency and the share of the 6–17 age group, employment ratio, average number of cars per family, and the number of healthcare centers.
期刊介绍:
Transport Policy is an international journal aimed at bridging the gap between theory and practice in transport. Its subject areas reflect the concerns of policymakers in government, industry, voluntary organisations and the public at large, providing independent, original and rigorous analysis to understand how policy decisions have been taken, monitor their effects, and suggest how they may be improved. The journal treats the transport sector comprehensively, and in the context of other sectors including energy, housing, industry and planning. All modes are covered: land, sea and air; road and rail; public and private; motorised and non-motorised; passenger and freight.