{"title":"Improving ridership by predicting train occupancy levels","authors":"Muhammad Awais Shafique","doi":"10.1016/j.jpubtr.2024.100092","DOIUrl":null,"url":null,"abstract":"<div><p>With the frequent global breakouts of infectious diseases such as Covid-19 and the likes, passengers feel unsafe traveling in crowded trains. The reluctance to share public transport with others due to the risk of disease transmission may lower the ridership as well as decrease the comfort level of passengers. Providing them with future crowdedness levels may allow them to plan accordingly, hence regaining the lost confidence and improving their patronage. This study explores the less frequently investigated relationship among occupancy levels at a particular station over several train runs, to predict the future occupancy level with a delay of one run (day). Tackling the issue as a classification problem rather than a regression problem, train occupancy data, station data, and weather data are merged to develop the final dataset. Training data is stepwise increased from 1 month to 3 months. Similarly, 1–5 days of known occupancy levels are added to each data instance. Among the three classifiers used, XGBoost provides the best results. Some practical challenges to occupancy level prediction are also discussed at the end.</p></div>","PeriodicalId":47173,"journal":{"name":"Journal of Public Transportation","volume":"26 ","pages":"Article 100092"},"PeriodicalIF":2.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1077291X24000122/pdfft?md5=09a84209d146e1f92290710bd880a7ca&pid=1-s2.0-S1077291X24000122-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Public Transportation","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077291X24000122","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0
Abstract
With the frequent global breakouts of infectious diseases such as Covid-19 and the likes, passengers feel unsafe traveling in crowded trains. The reluctance to share public transport with others due to the risk of disease transmission may lower the ridership as well as decrease the comfort level of passengers. Providing them with future crowdedness levels may allow them to plan accordingly, hence regaining the lost confidence and improving their patronage. This study explores the less frequently investigated relationship among occupancy levels at a particular station over several train runs, to predict the future occupancy level with a delay of one run (day). Tackling the issue as a classification problem rather than a regression problem, train occupancy data, station data, and weather data are merged to develop the final dataset. Training data is stepwise increased from 1 month to 3 months. Similarly, 1–5 days of known occupancy levels are added to each data instance. Among the three classifiers used, XGBoost provides the best results. Some practical challenges to occupancy level prediction are also discussed at the end.
期刊介绍:
The Journal of Public Transportation, affiliated with the Center for Urban Transportation Research, is an international peer-reviewed open access journal focused on various forms of public transportation. It publishes original research from diverse academic disciplines, including engineering, economics, planning, and policy, emphasizing innovative solutions to transportation challenges. Content covers mobility services available to the general public, such as line-based services and shared fleets, offering insights beneficial to passengers, agencies, service providers, and communities.