Leveraging Social Media as a Source of Mobility Intelligence: An NLP-Based Approach

IF 4.6 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Open Journal of Intelligent Transportation Systems Pub Date : 2023-08-24 DOI:10.1109/OJITS.2023.3308210

Tânia Fontes;Francisco Murços;Eduardo Carneiro;Joel Ribeiro;Rosaldo J. F. Rossetti

{"title":"Leveraging Social Media as a Source of Mobility Intelligence: An NLP-Based Approach","authors":"Tânia Fontes;Francisco Murços;Eduardo Carneiro;Joel Ribeiro;Rosaldo J. F. Rossetti","doi":"10.1109/OJITS.2023.3308210","DOIUrl":null,"url":null,"abstract":"This work presents a deep learning framework for analyzing urban mobility by extracting knowledge from messages collected from Twitter. The framework, which is designed to handle large-scale data and adapt automatically to new contexts, comprises three main modules: data collection and system configuration, data analytics, and aggregation and visualization. The text data is pre-processed using NLP techniques to remove informal words, slang, and misspellings. A pre-trained, unsupervised word embedding model, BERT, is used to classify travel-related tweets using a unigram approach with three dictionaries of travel-related target words: small, medium, and big. Public opinion is evaluated using VADER to classify travel-related tweets according to their sentiments. The mobility of three major cities was assessed: London, Melbourne, and New York. The framework demonstrates consistently high average performance, with a Precision of 0.80 for text classification and 0.77 for sentiment analysis. The framework can aggregate sparse information from social media and provide updated information in near real-time with high spatial resolution, enabling easy identification of traffic-related events. The framework is helpful for transportation decision-makers in operational control, tactical-strategic planning, and policy evaluation. For example, it can be used to improve the management of resources during traffic congestion or emergencies.","PeriodicalId":100631,"journal":{"name":"IEEE Open Journal of Intelligent Transportation Systems","volume":"4 ","pages":"663-681"},"PeriodicalIF":4.6000,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8784355/9999144/10229505.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Intelligent Transportation Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10229505/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

This work presents a deep learning framework for analyzing urban mobility by extracting knowledge from messages collected from Twitter. The framework, which is designed to handle large-scale data and adapt automatically to new contexts, comprises three main modules: data collection and system configuration, data analytics, and aggregation and visualization. The text data is pre-processed using NLP techniques to remove informal words, slang, and misspellings. A pre-trained, unsupervised word embedding model, BERT, is used to classify travel-related tweets using a unigram approach with three dictionaries of travel-related target words: small, medium, and big. Public opinion is evaluated using VADER to classify travel-related tweets according to their sentiments. The mobility of three major cities was assessed: London, Melbourne, and New York. The framework demonstrates consistently high average performance, with a Precision of 0.80 for text classification and 0.77 for sentiment analysis. The framework can aggregate sparse information from social media and provide updated information in near real-time with high spatial resolution, enabling easy identification of traffic-related events. The framework is helpful for transportation decision-makers in operational control, tactical-strategic planning, and policy evaluation. For example, it can be used to improve the management of resources during traffic congestion or emergencies.

查看原文本刊更多论文

利用社交媒体作为移动智能的来源:基于nlp的方法

这项工作提出了一个深度学习框架，通过从Twitter收集的消息中提取知识来分析城市流动性。该框架旨在处理大规模数据并自动适应新环境，包括三个主要模块:数据收集和系统配置、数据分析以及聚合和可视化。文本数据使用NLP技术进行预处理，以删除非正式单词、俚语和拼写错误。一个预训练的、无监督的词嵌入模型，BERT，使用一元图的方法对与旅游相关的推文进行分类，其中有三个与旅游相关的目标词字典:小、中、大。公众舆论评估使用VADER根据他们的情绪对旅游相关的推文进行分类。评估了三个主要城市的流动性:伦敦、墨尔本和纽约。该框架表现出一贯的高平均性能，文本分类的精度为0.80，情感分析的精度为0.77。该框架可以聚合来自社交媒体的稀疏信息，提供近实时、高空间分辨率的更新信息，方便识别交通相关事件。该框架有助于交通决策者进行运营控制、战术战略规划和政策评估。例如，在交通拥堵或突发事件时，可以使用它来改进资源管理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Open Journal of Intelligent Transportation Systems

CiteScore

5.40

自引率

0.00%

发文量