{"title":"Improving ridership by predicting train occupancy levels","authors":"Muhammad Awais Shafique","doi":"10.1016/j.jpubtr.2024.100092","DOIUrl":null,"url":null,"abstract":"<div><p>With the frequent global breakouts of infectious diseases such as Covid-19 and the likes, passengers feel unsafe traveling in crowded trains. The reluctance to share public transport with others due to the risk of disease transmission may lower the ridership as well as decrease the comfort level of passengers. Providing them with future crowdedness levels may allow them to plan accordingly, hence regaining the lost confidence and improving their patronage. This study explores the less frequently investigated relationship among occupancy levels at a particular station over several train runs, to predict the future occupancy level with a delay of one run (day). Tackling the issue as a classification problem rather than a regression problem, train occupancy data, station data, and weather data are merged to develop the final dataset. Training data is stepwise increased from 1 month to 3 months. Similarly, 1–5 days of known occupancy levels are added to each data instance. Among the three classifiers used, XGBoost provides the best results. Some practical challenges to occupancy level prediction are also discussed at the end.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1077291X24000122/pdfft?md5=09a84209d146e1f92290710bd880a7ca&pid=1-s2.0-S1077291X24000122-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1077291X24000122","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
With the frequent global breakouts of infectious diseases such as Covid-19 and the likes, passengers feel unsafe traveling in crowded trains. The reluctance to share public transport with others due to the risk of disease transmission may lower the ridership as well as decrease the comfort level of passengers. Providing them with future crowdedness levels may allow them to plan accordingly, hence regaining the lost confidence and improving their patronage. This study explores the less frequently investigated relationship among occupancy levels at a particular station over several train runs, to predict the future occupancy level with a delay of one run (day). Tackling the issue as a classification problem rather than a regression problem, train occupancy data, station data, and weather data are merged to develop the final dataset. Training data is stepwise increased from 1 month to 3 months. Similarly, 1–5 days of known occupancy levels are added to each data instance. Among the three classifiers used, XGBoost provides the best results. Some practical challenges to occupancy level prediction are also discussed at the end.