{"title":"Forecasting urban travel demand with geo-AI: a combination of GIS and machine learning techniques utilizing uber data in New York City","authors":"Sana Haery, Alireza Mahpour, Alireza Vafaeinejad","doi":"10.1007/s12665-024-11900-y","DOIUrl":null,"url":null,"abstract":"<div><p>Presently, online ride-hailing firms and municipal transportation systems substantially influence urban traffic management. Considering the increasing use of these services, it is essential to assess and predict travel demand to optimize service efficiency. This study analyzes Uber data from New York City, employing automated machine learning algorithms to assess and forecast ride demand inside the city. In conventional transportation planning, forecasting travelers’ destination selections is a crucial stage. Traditional methodologies, such as physical models like gravity models, are constrained in their capacity to encompass the comprehensive array of elements affecting travel behavior, frequently addressing just two or three variables. This study employs machine learning methodologies for predicting passenger destination choices, illustrating that these algorithms can include a wider array of variables, hence providing enhanced forecast accuracy relative to conventional approaches. The utilization of ArcGIS Pro and Python modules for automation enhanced the efficiency of spatial analysis. A range of machine learning methodologies, such as decision trees, Light-GBM, XG-Boost, Cat-Boost, and hybrid models, were utilized to predict demand. The selected source demand forecasting model is an ensemble model, with a R<sup>2</sup> of 0.94 and a Mean Absolute Error of 91.63. The optimal model for predicting destination demand was identified as a composite model comprising eight distinct algorithms. The model’s R<sup>2</sup> is 0.95, while the Mean Absolute Error is 67.12. Moreover, the examination of environmental factors indicated that proximity to recreational activities, median age, and population density had the most substantial influence on predicting travel demand.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"83 20","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-024-11900-y","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Presently, online ride-hailing firms and municipal transportation systems substantially influence urban traffic management. Considering the increasing use of these services, it is essential to assess and predict travel demand to optimize service efficiency. This study analyzes Uber data from New York City, employing automated machine learning algorithms to assess and forecast ride demand inside the city. In conventional transportation planning, forecasting travelers’ destination selections is a crucial stage. Traditional methodologies, such as physical models like gravity models, are constrained in their capacity to encompass the comprehensive array of elements affecting travel behavior, frequently addressing just two or three variables. This study employs machine learning methodologies for predicting passenger destination choices, illustrating that these algorithms can include a wider array of variables, hence providing enhanced forecast accuracy relative to conventional approaches. The utilization of ArcGIS Pro and Python modules for automation enhanced the efficiency of spatial analysis. A range of machine learning methodologies, such as decision trees, Light-GBM, XG-Boost, Cat-Boost, and hybrid models, were utilized to predict demand. The selected source demand forecasting model is an ensemble model, with a R2 of 0.94 and a Mean Absolute Error of 91.63. The optimal model for predicting destination demand was identified as a composite model comprising eight distinct algorithms. The model’s R2 is 0.95, while the Mean Absolute Error is 67.12. Moreover, the examination of environmental factors indicated that proximity to recreational activities, median age, and population density had the most substantial influence on predicting travel demand.
期刊介绍:
Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth:
Water and soil contamination caused by waste management and disposal practices
Environmental problems associated with transportation by land, air, or water
Geological processes that may impact biosystems or humans
Man-made or naturally occurring geological or hydrological hazards
Environmental problems associated with the recovery of materials from the earth
Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources
Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials
Management of environmental data and information in data banks and information systems
Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment
In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.