基于文本分析的马尼拉大都市最拥堵道路智能交通信息系统

Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada
{"title":"基于文本分析的马尼拉大都市最拥堵道路智能交通信息系统","authors":"Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada","doi":"10.1109/HNICEM.2018.8666416","DOIUrl":null,"url":null,"abstract":"The increasing number of vehicles for every year in the Philippines produces a bigger percentage of causing severe traffic congestion in the three leading roads (Circumferential Road 5, EDSA, and Commonwealth Avenue) in Metro Manila. The researchers observed that most of the implemented technology used to detect traffic is by using a surveillance camera which requires labor and costly, while other web and mobile traffic information system used crowdsourcing and mostly owned by private. The researchers aim to develop a real-time road traffic information system by using the traffic-related tweets from MMDA (Metropolitan Manila Development Authority) official Twitter account to aid the current problem of providing sufficient road traffic information in the country. In this research, the traffic-related tweets from MMDA data are fetched using Twitter Streaming API, filtered by using Named Entity Recognition, preprocessed by applying tokenization, frequency counting and removal of unnecessary symbols, features were extracted by using Latent Dirichlet Allocation to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode) and Linear Regression was used for pattern recognition. Also, this study compares four machine learning algorithms (Naïve Bayes, Decision Tree, Random Tree, and k-Nearest Neighbor) for the proper identification of the most effective algorithm that will be used to for traffic mode classification. The experimental result shows that k-NN produced the best performance compared to other algorithms with 84.00% classification accuracy in anticipating the traffic congestion modes (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in Metro Manila, Philippines.","PeriodicalId":426103,"journal":{"name":"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"An Intelligent Road Traffic Information System using Text Analysis in the Most Congested Roads in Metro Manila\",\"authors\":\"Erika Ritzelle P. Bondoc, Francis Percival M. Caparas, John Eddie D. Macias, Vileser T. Naculangga, Jheanel E. Estrada\",\"doi\":\"10.1109/HNICEM.2018.8666416\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing number of vehicles for every year in the Philippines produces a bigger percentage of causing severe traffic congestion in the three leading roads (Circumferential Road 5, EDSA, and Commonwealth Avenue) in Metro Manila. The researchers observed that most of the implemented technology used to detect traffic is by using a surveillance camera which requires labor and costly, while other web and mobile traffic information system used crowdsourcing and mostly owned by private. The researchers aim to develop a real-time road traffic information system by using the traffic-related tweets from MMDA (Metropolitan Manila Development Authority) official Twitter account to aid the current problem of providing sufficient road traffic information in the country. In this research, the traffic-related tweets from MMDA data are fetched using Twitter Streaming API, filtered by using Named Entity Recognition, preprocessed by applying tokenization, frequency counting and removal of unnecessary symbols, features were extracted by using Latent Dirichlet Allocation to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode) and Linear Regression was used for pattern recognition. Also, this study compares four machine learning algorithms (Naïve Bayes, Decision Tree, Random Tree, and k-Nearest Neighbor) for the proper identification of the most effective algorithm that will be used to for traffic mode classification. The experimental result shows that k-NN produced the best performance compared to other algorithms with 84.00% classification accuracy in anticipating the traffic congestion modes (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in Metro Manila, Philippines.\",\"PeriodicalId\":426103,\"journal\":{\"name\":\"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HNICEM.2018.8666416\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology,Communication and Control, Environment and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM.2018.8666416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

菲律宾每年不断增加的车辆数量在马尼拉大都会的三条主要道路(环形5号公路,EDSA和联邦大道)造成严重交通拥堵的比例更大。研究人员观察到,大多数用于检测交通的实施技术都是通过使用监控摄像头,这需要人工和昂贵的成本,而其他网络和移动交通信息系统则采用众包方式,而且大多由私人拥有。研究人员的目标是开发一个实时道路交通信息系统,通过使用来自MMDA(马尼拉大都会发展局)官方Twitter账户的交通相关推文来帮助解决目前该国提供足够道路交通信息的问题。本研究使用Twitter Streaming API提取MMDA数据中的交通相关推文,使用命名实体识别进行过滤,使用标记化、频率计数和去除不必要符号进行预处理,使用Latent Dirichlet Allocation提取特征以识别重要主题片段(时间、日期、道路车道、道路方向、位置和交通方式),并使用线性回归进行模式识别。此外,本研究比较了四种机器学习算法(Naïve贝叶斯,决策树,随机树和k近邻),以正确识别将用于交通模式分类的最有效算法。实验结果表明,与其他算法相比,k-NN在预测菲律宾马尼拉大都会的交通拥堵模式(轻度、轻度到中度、中度、中度到重度和重度)方面的表现最好,分类准确率为84.00%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Intelligent Road Traffic Information System using Text Analysis in the Most Congested Roads in Metro Manila
The increasing number of vehicles for every year in the Philippines produces a bigger percentage of causing severe traffic congestion in the three leading roads (Circumferential Road 5, EDSA, and Commonwealth Avenue) in Metro Manila. The researchers observed that most of the implemented technology used to detect traffic is by using a surveillance camera which requires labor and costly, while other web and mobile traffic information system used crowdsourcing and mostly owned by private. The researchers aim to develop a real-time road traffic information system by using the traffic-related tweets from MMDA (Metropolitan Manila Development Authority) official Twitter account to aid the current problem of providing sufficient road traffic information in the country. In this research, the traffic-related tweets from MMDA data are fetched using Twitter Streaming API, filtered by using Named Entity Recognition, preprocessed by applying tokenization, frequency counting and removal of unnecessary symbols, features were extracted by using Latent Dirichlet Allocation to identify the significant topic segments (time, day, lane of road, road direction, location and traffic mode) and Linear Regression was used for pattern recognition. Also, this study compares four machine learning algorithms (Naïve Bayes, Decision Tree, Random Tree, and k-Nearest Neighbor) for the proper identification of the most effective algorithm that will be used to for traffic mode classification. The experimental result shows that k-NN produced the best performance compared to other algorithms with 84.00% classification accuracy in anticipating the traffic congestion modes (Light, Light to Moderate, Moderate, Moderate to Heavy, and Heavy) in Metro Manila, Philippines.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信