Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada
{"title":"MMARRS:基于文本分析的多式联运和单式公共交通智能路线推荐和道路交通信息系统","authors":"Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada","doi":"10.1109/HNICEM.2018.8666343","DOIUrl":null,"url":null,"abstract":"Public commuting in the Philippines, particularly in the Metro Manila setting, continuously rises as a crucial problem due to also constantly voluminously increasing traffic congestion. Hence, this study proposes an intelligent route recommendation and road traffic information system that uses LTFRB data on 1) fare matrix computation (LRT 1, LRT 2, MRT 3, Bus, and Jeep) through utilizing Sakay.ph GTFS data, as well as 2) taxi pricing computation, and MMDA data (from their official Twitter account) on real-time traffic situation. Such data are used 1) to process the top three route recommendation choices, 2) to create attribute-based bag of words, extract appropriate dataset features, and classify the traffic congestion mode (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in the involved road/s using Latent Dirichlet Allocation (LDA), and 3) to rebuild the system model automatically in a certain time interval. In this study, 1) the traffic-related tweets from the official Twitter account of MMDA are fetched using Twitter Streaming API and filtered using Named Entity Recognition; 2) the filtered data are preprocessed by applying tokenization, frequency counting, and removal of unnecessary symbols; 3) the features from the preprocessed data are then extracted using Latent Dirichlet Allocation and are hereby used to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode); and 4) Linear Regression was used for pattern recognition. The results found are as follows: 1) 84% for the accuracy, 85% for the precision, and 83% for the recall garnered for the applied methodology using k-NN as the chosen classification model; 2) the advantage of supervised data acquisition over unsupervised data acquisition; and 3) traffic mode-based pattern extraction and evaluation. These results show the usability and practicality of the study to public commuting.","PeriodicalId":426103,"journal":{"name":"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"MMARRS: An Intelligent Route Recommendation and Road Traffic Information System for Multimodal and Unimodal Public Transportation using Text Analysis\",\"authors\":\"Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada\",\"doi\":\"10.1109/HNICEM.2018.8666343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Public commuting in the Philippines, particularly in the Metro Manila setting, continuously rises as a crucial problem due to also constantly voluminously increasing traffic congestion. Hence, this study proposes an intelligent route recommendation and road traffic information system that uses LTFRB data on 1) fare matrix computation (LRT 1, LRT 2, MRT 3, Bus, and Jeep) through utilizing Sakay.ph GTFS data, as well as 2) taxi pricing computation, and MMDA data (from their official Twitter account) on real-time traffic situation. Such data are used 1) to process the top three route recommendation choices, 2) to create attribute-based bag of words, extract appropriate dataset features, and classify the traffic congestion mode (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in the involved road/s using Latent Dirichlet Allocation (LDA), and 3) to rebuild the system model automatically in a certain time interval. In this study, 1) the traffic-related tweets from the official Twitter account of MMDA are fetched using Twitter Streaming API and filtered using Named Entity Recognition; 2) the filtered data are preprocessed by applying tokenization, frequency counting, and removal of unnecessary symbols; 3) the features from the preprocessed data are then extracted using Latent Dirichlet Allocation and are hereby used to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode); and 4) Linear Regression was used for pattern recognition. The results found are as follows: 1) 84% for the accuracy, 85% for the precision, and 83% for the recall garnered for the applied methodology using k-NN as the chosen classification model; 2) the advantage of supervised data acquisition over unsupervised data acquisition; and 3) traffic mode-based pattern extraction and evaluation. These results show the usability and practicality of the study to public commuting.\",\"PeriodicalId\":426103,\"journal\":{\"name\":\"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HNICEM.2018.8666343\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM.2018.8666343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
在菲律宾,特别是在马尼拉大都会,由于交通拥堵不断增加,公共通勤作为一个关键问题不断上升。因此,本研究以Sakay为工具,利用LTFRB资料计算1)车费矩阵(LRT 1、LRT 2、MRT 3、Bus、Jeep),提出一套智慧路线推荐与道路交通资讯系统。ph GTFS数据,以及2)出租车定价计算,以及MMDA数据(来自他们的官方Twitter账户)实时交通状况。利用这些数据1)处理前3个路线推荐选择,2)创建基于属性的词袋,提取适当的数据集特征,并使用Latent Dirichlet Allocation (LDA)对所涉及道路/s的交通拥堵模式(Light、Light to Moderate、Moderate、Moderate to Heavy和Heavy)进行分类,3)在一定的时间间隔内自动重建系统模型。在本研究中,1)使用Twitter Streaming API提取MMDA官方Twitter账号的流量相关推文,并使用Named Entity Recognition进行过滤;2)对过滤后的数据进行标记化、频率计数和去除不必要符号的预处理;3)利用Latent Dirichlet Allocation提取预处理数据中的特征,识别显著主题段(时间、日期、道路车道、道路方向、位置和交通方式);4)采用线性回归进行模式识别。结果表明:1)使用k-NN作为选择的分类模型的应用方法的准确率为84%,精密度为85%,召回率为83%;2)有监督数据采集相对于无监督数据采集的优势;3)基于交通模式的模式提取与评价。结果表明,本研究对公共通勤具有实用性和实用性。
MMARRS: An Intelligent Route Recommendation and Road Traffic Information System for Multimodal and Unimodal Public Transportation using Text Analysis
Public commuting in the Philippines, particularly in the Metro Manila setting, continuously rises as a crucial problem due to also constantly voluminously increasing traffic congestion. Hence, this study proposes an intelligent route recommendation and road traffic information system that uses LTFRB data on 1) fare matrix computation (LRT 1, LRT 2, MRT 3, Bus, and Jeep) through utilizing Sakay.ph GTFS data, as well as 2) taxi pricing computation, and MMDA data (from their official Twitter account) on real-time traffic situation. Such data are used 1) to process the top three route recommendation choices, 2) to create attribute-based bag of words, extract appropriate dataset features, and classify the traffic congestion mode (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in the involved road/s using Latent Dirichlet Allocation (LDA), and 3) to rebuild the system model automatically in a certain time interval. In this study, 1) the traffic-related tweets from the official Twitter account of MMDA are fetched using Twitter Streaming API and filtered using Named Entity Recognition; 2) the filtered data are preprocessed by applying tokenization, frequency counting, and removal of unnecessary symbols; 3) the features from the preprocessed data are then extracted using Latent Dirichlet Allocation and are hereby used to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode); and 4) Linear Regression was used for pattern recognition. The results found are as follows: 1) 84% for the accuracy, 85% for the precision, and 83% for the recall garnered for the applied methodology using k-NN as the chosen classification model; 2) the advantage of supervised data acquisition over unsupervised data acquisition; and 3) traffic mode-based pattern extraction and evaluation. These results show the usability and practicality of the study to public commuting.