Sem-TED: Semantic Twitter Event Detection and Adapting with News Stories

2022 8th International Conference on Web Research (ICWR) Pub Date : 2022-05-11 DOI:10.1109/ICWR54782.2022.9786234

Zahra Akhgari, M. Malekimajd, H. Rahmani

{"title":"Sem-TED: Semantic Twitter Event Detection and Adapting with News Stories","authors":"Zahra Akhgari, M. Malekimajd, H. Rahmani","doi":"10.1109/ICWR54782.2022.9786234","DOIUrl":null,"url":null,"abstract":"Public acceptance of social networks has made the analysis of these networks essential. Event detection in these networks including Twitter is one of the most momentous subjects in the field of natural language processing and text mining. In this paper, we investigated how to link popular social media topics and news stories using transformer models and neural networks. Accordingly, this study consists of two parts: First, detecting popular topics and, second, linking them to the news. Event detection techniques have been applied to detect popular topics, while an event detection method comprises text preprocessing, text embedding using Sentence Transformer, dimension reduction using the UMAP algorithm, and grouping them using the HDBSCAN algorithm. To examine relevance or non-relevance between the news and topics, a single-layer perceptron neural network is applied, in which the output of the model indicates relevance or nonrelevance. We have implemented the mentioned parts and have investigated them on a small sampling of two known datasets. The evaluation outcomes reveal that the first part leads to an average improvement of 8% compared to the entity-based methods. Moreover, the results of the second part demonstrate that the used neural network in this study has a better performance comparing several other methods.","PeriodicalId":355187,"journal":{"name":"2022 8th International Conference on Web Research (ICWR)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR54782.2022.9786234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Public acceptance of social networks has made the analysis of these networks essential. Event detection in these networks including Twitter is one of the most momentous subjects in the field of natural language processing and text mining. In this paper, we investigated how to link popular social media topics and news stories using transformer models and neural networks. Accordingly, this study consists of two parts: First, detecting popular topics and, second, linking them to the news. Event detection techniques have been applied to detect popular topics, while an event detection method comprises text preprocessing, text embedding using Sentence Transformer, dimension reduction using the UMAP algorithm, and grouping them using the HDBSCAN algorithm. To examine relevance or non-relevance between the news and topics, a single-layer perceptron neural network is applied, in which the output of the model indicates relevance or nonrelevance. We have implemented the mentioned parts and have investigated them on a small sampling of two known datasets. The evaluation outcomes reveal that the first part leads to an average improvement of 8% compared to the entity-based methods. Moreover, the results of the second part demonstrate that the used neural network in this study has a better performance comparing several other methods.

查看原文本刊更多论文

语义Twitter事件检测和适应新闻故事

公众对社交网络的接受使得对这些网络的分析变得至关重要。包括Twitter在内的这些网络的事件检测是自然语言处理和文本挖掘领域最重要的课题之一。在本文中，我们研究了如何使用变压器模型和神经网络将流行的社交媒体话题和新闻故事联系起来。因此，本研究包括两个部分:第一，发现热门话题，第二，将其与新闻联系起来。事件检测技术已被应用于流行话题的检测，而事件检测方法包括文本预处理、使用Sentence Transformer的文本嵌入、使用UMAP算法的降维以及使用HDBSCAN算法的分组。为了检查新闻和主题之间的相关性或非相关性，应用了单层感知器神经网络，其中模型的输出表明相关性或非相关性。我们已经实现了上述部分，并在两个已知数据集的小样本上进行了调查。评估结果显示，与基于实体的方法相比，第一部分导致平均8%的改进。此外，第二部分的结果表明，与其他几种方法相比，本研究中使用的神经网络具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 8th International Conference on Web Research (ICWR)

自引率

0.00%

发文量