Sem-TED: Semantic Twitter Event Detection and Adapting with News Stories

Zahra Akhgari, M. Malekimajd, H. Rahmani
{"title":"Sem-TED: Semantic Twitter Event Detection and Adapting with News Stories","authors":"Zahra Akhgari, M. Malekimajd, H. Rahmani","doi":"10.1109/ICWR54782.2022.9786234","DOIUrl":null,"url":null,"abstract":"Public acceptance of social networks has made the analysis of these networks essential. Event detection in these networks including Twitter is one of the most momentous subjects in the field of natural language processing and text mining. In this paper, we investigated how to link popular social media topics and news stories using transformer models and neural networks. Accordingly, this study consists of two parts: First, detecting popular topics and, second, linking them to the news. Event detection techniques have been applied to detect popular topics, while an event detection method comprises text preprocessing, text embedding using Sentence Transformer, dimension reduction using the UMAP algorithm, and grouping them using the HDBSCAN algorithm. To examine relevance or non-relevance between the news and topics, a single-layer perceptron neural network is applied, in which the output of the model indicates relevance or nonrelevance. We have implemented the mentioned parts and have investigated them on a small sampling of two known datasets. The evaluation outcomes reveal that the first part leads to an average improvement of 8% compared to the entity-based methods. Moreover, the results of the second part demonstrate that the used neural network in this study has a better performance comparing several other methods.","PeriodicalId":355187,"journal":{"name":"2022 8th International Conference on Web Research (ICWR)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR54782.2022.9786234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Public acceptance of social networks has made the analysis of these networks essential. Event detection in these networks including Twitter is one of the most momentous subjects in the field of natural language processing and text mining. In this paper, we investigated how to link popular social media topics and news stories using transformer models and neural networks. Accordingly, this study consists of two parts: First, detecting popular topics and, second, linking them to the news. Event detection techniques have been applied to detect popular topics, while an event detection method comprises text preprocessing, text embedding using Sentence Transformer, dimension reduction using the UMAP algorithm, and grouping them using the HDBSCAN algorithm. To examine relevance or non-relevance between the news and topics, a single-layer perceptron neural network is applied, in which the output of the model indicates relevance or nonrelevance. We have implemented the mentioned parts and have investigated them on a small sampling of two known datasets. The evaluation outcomes reveal that the first part leads to an average improvement of 8% compared to the entity-based methods. Moreover, the results of the second part demonstrate that the used neural network in this study has a better performance comparing several other methods.
语义Twitter事件检测和适应新闻故事
公众对社交网络的接受使得对这些网络的分析变得至关重要。包括Twitter在内的这些网络的事件检测是自然语言处理和文本挖掘领域最重要的课题之一。在本文中,我们研究了如何使用变压器模型和神经网络将流行的社交媒体话题和新闻故事联系起来。因此,本研究包括两个部分:第一,发现热门话题,第二,将其与新闻联系起来。事件检测技术已被应用于流行话题的检测,而事件检测方法包括文本预处理、使用Sentence Transformer的文本嵌入、使用UMAP算法的降维以及使用HDBSCAN算法的分组。为了检查新闻和主题之间的相关性或非相关性,应用了单层感知器神经网络,其中模型的输出表明相关性或非相关性。我们已经实现了上述部分,并在两个已知数据集的小样本上进行了调查。评估结果显示,与基于实体的方法相比,第一部分导致平均8%的改进。此外,第二部分的结果表明,与其他几种方法相比,本研究中使用的神经网络具有更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信