{"title":"Data-Driven Predictive Modeling of Citywide Crowd Flow for Urban Safety Management: A Case Study of Beijing, China","authors":"He Jiang, Xuxilu Zhang, Yao Dong, Jianzhou Wang","doi":"10.1002/for.3216","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Crowd flow forecasting is vital for urban planning, resource allocation, and public safety, particularly in the context of the COVID-19 pandemic. However, traditional predictive models struggle to capture the complex, nonlinear spatial–temporal relationships inherent in crowd flow data due to its irregular volatility. To address these limitations, this paper proposes the innovative citywide crowd flow prediction (CCFP) model, which merges statistical rules with machine learning techniques (XGBoost, LightGBM, and CatBoost). The CCFP model is specifically designed to leverage spatial dependencies and two-level periodicity (weekly and daily) in population flow to predict crowd flow indexes (\n<span></span><math>\n <semantics>\n <mrow>\n <mi>C</mi>\n <mi>F</mi>\n <mi>I</mi>\n </mrow>\n <annotation>$$ CFI $$</annotation>\n </semantics></math>) within specific areas. We employ an urban area graph created using the Node2Vec algorithm to capture the temporal and spatial nuances of human flow patterns. Notably, this study innovatively incorporates migration, weather, and epidemic data into machine-learning models for feature extraction. Moreover, it introduces weighted factors—\n<span></span><math>\n <semantics>\n <mrow>\n <mi>g</mi>\n <mi>r</mi>\n <mi>o</mi>\n <mi>w</mi>\n <mi>t</mi>\n <mi>h</mi>\n <mo>,</mo>\n <mspace></mspace>\n <mi>b</mi>\n <mi>a</mi>\n <mi>s</mi>\n <mi>e</mi>\n <mo>,</mo>\n <mspace></mspace>\n <mi>w</mi>\n <mi>e</mi>\n <mi>e</mi>\n <mi>k</mi>\n </mrow>\n <annotation>$$ growth, base, week $$</annotation>\n </semantics></math>, and \n<span></span><math>\n <semantics>\n <mrow>\n <mi>d</mi>\n <mi>a</mi>\n <mi>y</mi>\n </mrow>\n <annotation>$$ day $$</annotation>\n </semantics></math>—to enhance the accuracy of \n<span></span><math>\n <semantics>\n <mrow>\n <mi>C</mi>\n <mi>F</mi>\n <mi>I</mi>\n </mrow>\n <annotation>$$ CFI $$</annotation>\n </semantics></math> prediction. Among the combined models, CCFP outperforms others with remarkable scientific precision (root mean squared error, \n<span></span><math>\n <semantics>\n <mrow>\n <mi>R</mi>\n <mi>M</mi>\n <mi>S</mi>\n <mi>E</mi>\n <mo>=</mo>\n <mn>2.04</mn>\n </mrow>\n <annotation>$$ RMSE&#x0003D;2.04 $$</annotation>\n </semantics></math>; mean absolute error, \n<span></span><math>\n <semantics>\n <mrow>\n <mi>M</mi>\n <mi>A</mi>\n <mi>E</mi>\n <mo>=</mo>\n <mn>0.81</mn>\n </mrow>\n <annotation>$$ MAE&#x0003D;0.81 $$</annotation>\n </semantics></math>; mean absolute percentage error, \n<span></span><math>\n <semantics>\n <mrow>\n <mi>M</mi>\n <mi>A</mi>\n <mi>P</mi>\n <mi>E</mi>\n <mo>=</mo>\n <mn>0.13</mn>\n </mrow>\n <annotation>$$ MAPE&#x0003D;0.13 $$</annotation>\n </semantics></math>). Overall, the CCFP model represents a significant advancement in crowd flow prediction, offering valuable insights for urban safety management and city planning during pandemics.</p>\n </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 2","pages":"730-752"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/for.3216","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Crowd flow forecasting is vital for urban planning, resource allocation, and public safety, particularly in the context of the COVID-19 pandemic. However, traditional predictive models struggle to capture the complex, nonlinear spatial–temporal relationships inherent in crowd flow data due to its irregular volatility. To address these limitations, this paper proposes the innovative citywide crowd flow prediction (CCFP) model, which merges statistical rules with machine learning techniques (XGBoost, LightGBM, and CatBoost). The CCFP model is specifically designed to leverage spatial dependencies and two-level periodicity (weekly and daily) in population flow to predict crowd flow indexes (
) within specific areas. We employ an urban area graph created using the Node2Vec algorithm to capture the temporal and spatial nuances of human flow patterns. Notably, this study innovatively incorporates migration, weather, and epidemic data into machine-learning models for feature extraction. Moreover, it introduces weighted factors—
, and
—to enhance the accuracy of
prediction. Among the combined models, CCFP outperforms others with remarkable scientific precision (root mean squared error,
; mean absolute error,
; mean absolute percentage error,
). Overall, the CCFP model represents a significant advancement in crowd flow prediction, offering valuable insights for urban safety management and city planning during pandemics.
人流预测对城市规划、资源分配和公共安全至关重要,尤其是在 COVID-19 大流行的背景下。然而,由于人流数据的不规则波动性,传统的预测模型难以捕捉人流数据中固有的复杂、非线性时空关系。为了解决这些局限性,本文提出了创新性的全市人群流量预测模型(CCFP),该模型融合了统计规则和机器学习技术(XGBoost、LightGBM 和 CatBoost)。CCFP 模型专门设计用于利用人口流动的空间依赖性和两级周期性(每周和每天)来预测特定区域内的人流指数(C F I $$ CFI $$)。我们采用 Node2Vec 算法创建的城市区域图来捕捉人流模式在时间和空间上的细微差别。值得注意的是,本研究创新性地将移民、天气和流行病数据纳入机器学习模型,以提取特征。此外,它还引入了加权因子--增长、基数、周$$和日$$,以提高 C F I $$ CFI $$预测的准确性。在组合模型中,CCFP 以显著的科学精度(均方根误差,R M S E = 2.04 $$ RMSE=2.04 $$;平均绝对误差,M A E = 0.81 $$ MAE=0.81 $$;平均绝对百分比误差,M A P E = 0.13 $$ MAPE=0.13 $$)优于其他模型。总之,CCFP 模型代表了人流预测领域的重大进步,为大流行病期间的城市安全管理和城市规划提供了宝贵的见解。
期刊介绍:
The Journal of Forecasting is an international journal that publishes refereed papers on forecasting. It is multidisciplinary, welcoming papers dealing with any aspect of forecasting: theoretical, practical, computational and methodological. A broad interpretation of the topic is taken with approaches from various subject areas, such as statistics, economics, psychology, systems engineering and social sciences, all encouraged. Furthermore, the Journal welcomes a wide diversity of applications in such fields as business, government, technology and the environment. Of particular interest are papers dealing with modelling issues and the relationship of forecasting systems to decision-making processes.