{"title":"A Deep Learning Approach to Outbreak related Tweet Detection","authors":"B. Jayawardhana, R. Rajapakse","doi":"10.1109/icac51239.2020.9357274","DOIUrl":null,"url":null,"abstract":"Due to the popularity of social media around the world, people use to report and discuss real-world events, personal health complications, and disaster situations through these platforms. These social media data streams can be used to track and detect different types of outbreaks. A mechanism is needed to identify outbreak-related tweets to predict the outbreak in advance. In this paper, we propose a deep learning model that can detect tweets related to different outbreaks Epidemics, Public Disorders, and Disasters. GloVe (Global Vectors for Word Representation) embeddings are used as the feature extraction technique as it can capture the semantic meanings of the tweets. Long Short-term Memory (LSTM) which is a specialized Recurrent Neural Network architecture is used as the classification algorithm. In the process, first, outbreak-related tweets were manually collected and curated. Pretrained GloVe word embeddings of 100 dimensions were then used to represent the words of the tweets. As the next step, a Deep Learning Model was trained by using LSTM technique on the curated dataset. Finally, the performance of the model was evaluated using a different dataset. With the results, it can be concluded that the proposed deep learning model is an accurate approach for outbreak-related tweet detection.","PeriodicalId":253040,"journal":{"name":"2020 2nd International Conference on Advancements in Computing (ICAC)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Advancements in Computing (ICAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icac51239.2020.9357274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Due to the popularity of social media around the world, people use to report and discuss real-world events, personal health complications, and disaster situations through these platforms. These social media data streams can be used to track and detect different types of outbreaks. A mechanism is needed to identify outbreak-related tweets to predict the outbreak in advance. In this paper, we propose a deep learning model that can detect tweets related to different outbreaks Epidemics, Public Disorders, and Disasters. GloVe (Global Vectors for Word Representation) embeddings are used as the feature extraction technique as it can capture the semantic meanings of the tweets. Long Short-term Memory (LSTM) which is a specialized Recurrent Neural Network architecture is used as the classification algorithm. In the process, first, outbreak-related tweets were manually collected and curated. Pretrained GloVe word embeddings of 100 dimensions were then used to represent the words of the tweets. As the next step, a Deep Learning Model was trained by using LSTM technique on the curated dataset. Finally, the performance of the model was evaluated using a different dataset. With the results, it can be concluded that the proposed deep learning model is an accurate approach for outbreak-related tweet detection.
由于社交媒体在全球的普及,人们习惯于通过这些平台报道和讨论现实世界的事件、个人健康并发症和灾难情况。这些社交媒体数据流可用于跟踪和检测不同类型的疫情。需要一种机制来识别与爆发相关的推文,以便提前预测爆发。在本文中,我们提出了一个深度学习模型,可以检测与不同爆发、流行病、公共疾病和灾难相关的推文。使用GloVe (Global Vectors for Word Representation)嵌入作为特征提取技术,因为它可以捕获推文的语义。长短期记忆(LSTM)是一种特殊的递归神经网络结构。在这个过程中,首先,与疫情爆发相关的推文被手动收集和整理。然后使用100维的预训练手套词嵌入来表示推文的单词。下一步,使用LSTM技术在整理的数据集上训练深度学习模型。最后,使用不同的数据集评估模型的性能。由此可以得出结论,所提出的深度学习模型是一种准确的爆发相关推文检测方法。