Mohamad Ali Khalil, Mahmudur Rahman Fatmi, Muntahith Orvin
{"title":"为综合城市模型开发和微观模拟人口动态:逻辑回归与机器学习技术的比较","authors":"Mohamad Ali Khalil, Mahmudur Rahman Fatmi, Muntahith Orvin","doi":"10.1007/s11116-024-10468-7","DOIUrl":null,"url":null,"abstract":"<p>Studies have shown that sociodemographic attributes significantly influence individuals' transportation choices. However, not all travel demand models do not account for this effect when predicting future travel scenarios. On the other hand, current integrated urban models (IUMs) that incorporate demographic dynamics mostly rely on conventional logit models and rule-based models. These models may not be optimal for complex modeling since they do not fully capture the non-linear relationship between inputs and output. In this research, we explore the feasibility of utilizing machine learning (ML) models to enhance the prediction of demographic dynamics within our proposed IUM—known as ‘STELARS’, in conjunction with conventional logit models. To address the challenge of the black-box nature of ML, we employ an explainable AI technique (xAI) to gain insights into the influence of the factors and compare them with the interpretation revealed by the logit models. Three demographic components are considered: marriage/common-law formation, separation and divorce, and childbirth events, while other components were developed using rate-based models. The results (on the testing dataset) indicate that ML models outperform conventional logit models in terms of overall accuracy by a margin of up-to 3%. However, when considering the true positive accuracy (correctly predicting the event of interest), a significant improvement of 30–48% is observed. Additionally, the xAI analysis reveals consistent interpretation with the logit model. Subsequently, we implemented our demographic dynamics module within our integrated urban modeling system to predict population changes in the Okanagan region of Canada. The multi-year validation of the simulation results against Census data suggests a reasonably close prediction of the observed population. We also optimize the runtime of the demographic dynamics module using vectorization, reducing the simulation time for the demographic changes in our study area (comprising approximately 200,000 individuals living in 85,000 households) to just about 100 s for the total 10 years of simulation. The development and implementation of this advanced demographic dynamics module to accurately predict the life events of individuals adds a fundamental capacity to the STELARS to be built as an event-based microsimulation model.</p>","PeriodicalId":49419,"journal":{"name":"Transportation","volume":"5 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing and microsimulating demographic dynamics for an integrated urban model: a comparison between logistic regression and machine learning techniques\",\"authors\":\"Mohamad Ali Khalil, Mahmudur Rahman Fatmi, Muntahith Orvin\",\"doi\":\"10.1007/s11116-024-10468-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Studies have shown that sociodemographic attributes significantly influence individuals' transportation choices. However, not all travel demand models do not account for this effect when predicting future travel scenarios. On the other hand, current integrated urban models (IUMs) that incorporate demographic dynamics mostly rely on conventional logit models and rule-based models. These models may not be optimal for complex modeling since they do not fully capture the non-linear relationship between inputs and output. In this research, we explore the feasibility of utilizing machine learning (ML) models to enhance the prediction of demographic dynamics within our proposed IUM—known as ‘STELARS’, in conjunction with conventional logit models. To address the challenge of the black-box nature of ML, we employ an explainable AI technique (xAI) to gain insights into the influence of the factors and compare them with the interpretation revealed by the logit models. Three demographic components are considered: marriage/common-law formation, separation and divorce, and childbirth events, while other components were developed using rate-based models. The results (on the testing dataset) indicate that ML models outperform conventional logit models in terms of overall accuracy by a margin of up-to 3%. However, when considering the true positive accuracy (correctly predicting the event of interest), a significant improvement of 30–48% is observed. Additionally, the xAI analysis reveals consistent interpretation with the logit model. Subsequently, we implemented our demographic dynamics module within our integrated urban modeling system to predict population changes in the Okanagan region of Canada. The multi-year validation of the simulation results against Census data suggests a reasonably close prediction of the observed population. We also optimize the runtime of the demographic dynamics module using vectorization, reducing the simulation time for the demographic changes in our study area (comprising approximately 200,000 individuals living in 85,000 households) to just about 100 s for the total 10 years of simulation. The development and implementation of this advanced demographic dynamics module to accurately predict the life events of individuals adds a fundamental capacity to the STELARS to be built as an event-based microsimulation model.</p>\",\"PeriodicalId\":49419,\"journal\":{\"name\":\"Transportation\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11116-024-10468-7\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11116-024-10468-7","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Developing and microsimulating demographic dynamics for an integrated urban model: a comparison between logistic regression and machine learning techniques
Studies have shown that sociodemographic attributes significantly influence individuals' transportation choices. However, not all travel demand models do not account for this effect when predicting future travel scenarios. On the other hand, current integrated urban models (IUMs) that incorporate demographic dynamics mostly rely on conventional logit models and rule-based models. These models may not be optimal for complex modeling since they do not fully capture the non-linear relationship between inputs and output. In this research, we explore the feasibility of utilizing machine learning (ML) models to enhance the prediction of demographic dynamics within our proposed IUM—known as ‘STELARS’, in conjunction with conventional logit models. To address the challenge of the black-box nature of ML, we employ an explainable AI technique (xAI) to gain insights into the influence of the factors and compare them with the interpretation revealed by the logit models. Three demographic components are considered: marriage/common-law formation, separation and divorce, and childbirth events, while other components were developed using rate-based models. The results (on the testing dataset) indicate that ML models outperform conventional logit models in terms of overall accuracy by a margin of up-to 3%. However, when considering the true positive accuracy (correctly predicting the event of interest), a significant improvement of 30–48% is observed. Additionally, the xAI analysis reveals consistent interpretation with the logit model. Subsequently, we implemented our demographic dynamics module within our integrated urban modeling system to predict population changes in the Okanagan region of Canada. The multi-year validation of the simulation results against Census data suggests a reasonably close prediction of the observed population. We also optimize the runtime of the demographic dynamics module using vectorization, reducing the simulation time for the demographic changes in our study area (comprising approximately 200,000 individuals living in 85,000 households) to just about 100 s for the total 10 years of simulation. The development and implementation of this advanced demographic dynamics module to accurately predict the life events of individuals adds a fundamental capacity to the STELARS to be built as an event-based microsimulation model.
期刊介绍:
In our first issue, published in 1972, we explained that this Journal is intended to promote the free and vigorous exchange of ideas and experience among the worldwide community actively concerned with transportation policy, planning and practice. That continues to be our mission, with a clear focus on topics concerned with research and practice in transportation policy and planning, around the world.
These four words, policy and planning, research and practice are our key words. While we have a particular focus on transportation policy analysis and travel behaviour in the context of ground transportation, we willingly consider all good quality papers that are highly relevant to transportation policy, planning and practice with a clear focus on innovation, on extending the international pool of knowledge and understanding. Our interest is not only with transportation policies - and systems and services – but also with their social, economic and environmental impacts, However, papers about the application of established procedures to, or the development of plans or policies for, specific locations are unlikely to prove acceptable unless they report experience which will be of real benefit those working elsewhere. Papers concerned with the engineering, safety and operational management of transportation systems are outside our scope.