On the locality of keywords in Twitter streams

H. Abdelhaq, Michael Gertz
{"title":"On the locality of keywords in Twitter streams","authors":"H. Abdelhaq, Michael Gertz","doi":"10.1145/2676552.2676554","DOIUrl":null,"url":null,"abstract":"The continuously increasing popularity of social media sites such as Twitter and Facebook has recently led to a number of approaches to detect and extract event information from social media streams. Such events play an important role, e.g., in supporting location-based services and improving situational awareness. Moreover, the introduction of GPS-equipped communication devises has led to an increase in the percentage of geo-tagged messages. These help to detect localized events, i.e., events occurring at a certain location, such as sport events or accidents. The main entities that indicate a localized event are local keywords that exhibit a surge in usage at the event location. In this paper, we propose an approach to extract local keywords from a Twitter stream by (1) identifying local keywords, and (2) estimating the central location of each keyword. This extraction process is performed in an online fashion using a sliding window on the Twitter stream. In addition, we address the problem of spatial outliers that adversely affect a proper identification of local keywords. Outliers occur when people far away from an event location use related keywords in their Tweets. We handle this problem by adjusting the spatial distribution of keywords based on their co-occurrence with place names that may refer to the location of an event. We evaluate the performance of our framework to reliably and efficiently extracting local keywords and estimating their central locations using a Twitter dataset.","PeriodicalId":272840,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoStreaming","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoStreaming","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2676552.2676554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

The continuously increasing popularity of social media sites such as Twitter and Facebook has recently led to a number of approaches to detect and extract event information from social media streams. Such events play an important role, e.g., in supporting location-based services and improving situational awareness. Moreover, the introduction of GPS-equipped communication devises has led to an increase in the percentage of geo-tagged messages. These help to detect localized events, i.e., events occurring at a certain location, such as sport events or accidents. The main entities that indicate a localized event are local keywords that exhibit a surge in usage at the event location. In this paper, we propose an approach to extract local keywords from a Twitter stream by (1) identifying local keywords, and (2) estimating the central location of each keyword. This extraction process is performed in an online fashion using a sliding window on the Twitter stream. In addition, we address the problem of spatial outliers that adversely affect a proper identification of local keywords. Outliers occur when people far away from an event location use related keywords in their Tweets. We handle this problem by adjusting the spatial distribution of keywords based on their co-occurrence with place names that may refer to the location of an event. We evaluate the performance of our framework to reliably and efficiently extracting local keywords and estimating their central locations using a Twitter dataset.
论推特信息流中关键词的位置性
随着 Twitter 和 Facebook 等社交媒体网站的不断普及,最近出现了许多从社交媒体流中检测和提取事件信息的方法。这些事件在支持定位服务和提高态势感知等方面发挥着重要作用。此外,配备全球定位系统的通信设备的引入也导致了地理标记信息比例的增加。这些信息有助于检测本地化事件,即在某个地点发生的事件,如体育赛事或事故。表明本地化事件的主要实体是在事件发生地使用率激增的本地关键词。在本文中,我们提出了一种从 Twitter 信息流中提取本地关键词的方法,具体方法是:(1)识别本地关键词;(2)估计每个关键词的中心位置。这一提取过程是利用 Twitter 流上的滑动窗口以在线方式进行的。此外,我们还解决了空间异常值对正确识别本地关键词产生不利影响的问题。当远离事件地点的人在其推文中使用相关关键词时,就会出现离群值。我们根据关键词与地名的共现情况来调整关键词的空间分布,从而解决这一问题。我们使用 Twitter 数据集评估了我们的框架在可靠、高效地提取本地关键词并估计其中心位置方面的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信