Vivek Bhosale , Miguel San Payo , Gabriel Cipriano , António R. Andrade
{"title":"影响里斯本学校通勤交通选择的关键因素——一种机器学习方法","authors":"Vivek Bhosale , Miguel San Payo , Gabriel Cipriano , António R. Andrade","doi":"10.1016/j.cstp.2025.101557","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding the behaviour of students in choosing a transportation mode to school is crucial to promote Active Commuting to School (ACS) and the adoption of healthier lifestyles. Therefore, analysing all types of transportation modes with multiple factors/features is essential, though it can be a challenge in statistical modelling. The main objective of the present study was to determine the factors that contribute to the choice of a particular mode in school transportation, by using Machine Learning (ML) algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), Decision Tree (DT) and Multinomial Logistic Regression (MNL). Data from the ‘Hands Up’ Survey in Lisbon, Portugal, between 2018 and 2021, with 10 different modes of transportation were analysed. A range of factors including safety around school, socioeconomic status of schools’ parishes, school regime, school grades and the proximity of schools to the different public transportation modes were considered. The algorithms have been compared in terms of accuracy scores. The XGB algorithm shows the best performance (64 % accuracy and 0.33 Macro F1) for multi-class classification, while RT, DT and MNL provide accuracy of 40 %, 37 % and 47 % respectively. Weighted Average Feature Importance (WAFI) have been determined for all variables. For the best-performing algorithm, the XGB, the combination factor of school regime and school grade is the most relevant factor, contributing to around 21.2 % for multi-class classification. WAFI scores for each variable suggest that the proximity of schools to various public transports is an important factor contributing more than 50 % for the predominance of private car in school transportation.</div></div>","PeriodicalId":46989,"journal":{"name":"Case Studies on Transport Policy","volume":"21 ","pages":"Article 101557"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Key factors affecting transportation choices in school commuting in Lisbon – A machine learning approach\",\"authors\":\"Vivek Bhosale , Miguel San Payo , Gabriel Cipriano , António R. Andrade\",\"doi\":\"10.1016/j.cstp.2025.101557\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Understanding the behaviour of students in choosing a transportation mode to school is crucial to promote Active Commuting to School (ACS) and the adoption of healthier lifestyles. Therefore, analysing all types of transportation modes with multiple factors/features is essential, though it can be a challenge in statistical modelling. The main objective of the present study was to determine the factors that contribute to the choice of a particular mode in school transportation, by using Machine Learning (ML) algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), Decision Tree (DT) and Multinomial Logistic Regression (MNL). Data from the ‘Hands Up’ Survey in Lisbon, Portugal, between 2018 and 2021, with 10 different modes of transportation were analysed. A range of factors including safety around school, socioeconomic status of schools’ parishes, school regime, school grades and the proximity of schools to the different public transportation modes were considered. The algorithms have been compared in terms of accuracy scores. The XGB algorithm shows the best performance (64 % accuracy and 0.33 Macro F1) for multi-class classification, while RT, DT and MNL provide accuracy of 40 %, 37 % and 47 % respectively. Weighted Average Feature Importance (WAFI) have been determined for all variables. For the best-performing algorithm, the XGB, the combination factor of school regime and school grade is the most relevant factor, contributing to around 21.2 % for multi-class classification. WAFI scores for each variable suggest that the proximity of schools to various public transports is an important factor contributing more than 50 % for the predominance of private car in school transportation.</div></div>\",\"PeriodicalId\":46989,\"journal\":{\"name\":\"Case Studies on Transport Policy\",\"volume\":\"21 \",\"pages\":\"Article 101557\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Case Studies on Transport Policy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2213624X25001944\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Case Studies on Transport Policy","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213624X25001944","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Key factors affecting transportation choices in school commuting in Lisbon – A machine learning approach
Understanding the behaviour of students in choosing a transportation mode to school is crucial to promote Active Commuting to School (ACS) and the adoption of healthier lifestyles. Therefore, analysing all types of transportation modes with multiple factors/features is essential, though it can be a challenge in statistical modelling. The main objective of the present study was to determine the factors that contribute to the choice of a particular mode in school transportation, by using Machine Learning (ML) algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), Decision Tree (DT) and Multinomial Logistic Regression (MNL). Data from the ‘Hands Up’ Survey in Lisbon, Portugal, between 2018 and 2021, with 10 different modes of transportation were analysed. A range of factors including safety around school, socioeconomic status of schools’ parishes, school regime, school grades and the proximity of schools to the different public transportation modes were considered. The algorithms have been compared in terms of accuracy scores. The XGB algorithm shows the best performance (64 % accuracy and 0.33 Macro F1) for multi-class classification, while RT, DT and MNL provide accuracy of 40 %, 37 % and 47 % respectively. Weighted Average Feature Importance (WAFI) have been determined for all variables. For the best-performing algorithm, the XGB, the combination factor of school regime and school grade is the most relevant factor, contributing to around 21.2 % for multi-class classification. WAFI scores for each variable suggest that the proximity of schools to various public transports is an important factor contributing more than 50 % for the predominance of private car in school transportation.