发现和早期预测社交媒体紧急信息的流行演变模式

IF 2.4 3区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Delin Yuan, Yang Li
{"title":"发现和早期预测社交媒体紧急信息的流行演变模式","authors":"Delin Yuan, Yang Li","doi":"10.1108/ajim-10-2023-0450","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.</p><!--/ Abstract__block -->","PeriodicalId":53152,"journal":{"name":"Aslib Journal of Information Management","volume":"72 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discovering and early predicting popularity evolution patterns of social media emergency information\",\"authors\":\"Delin Yuan, Yang Li\",\"doi\":\"10.1108/ajim-10-2023-0450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3>Purpose</h3>\\n<p>When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.</p><!--/ Abstract__block -->\\n<h3>Design/methodology/approach</h3>\\n<p>We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.</p><!--/ Abstract__block -->\\n<h3>Findings</h3>\\n<p>The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.</p><!--/ Abstract__block -->\\n<h3>Originality/value</h3>\\n<p>This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.</p><!--/ Abstract__block -->\",\"PeriodicalId\":53152,\"journal\":{\"name\":\"Aslib Journal of Information Management\",\"volume\":\"72 1\",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aslib Journal of Information Management\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1108/ajim-10-2023-0450\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aslib Journal of Information Management","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/ajim-10-2023-0450","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

目的当突发事件发生时,特定时间段内公众对社交媒体上突发事件信息的关注度会形成突发事件信息的流行度演变规律。我们收集了新浪微博平台上与 COVID-19 疫情相关的数据,并应用 K-Shape 聚类算法识别出五种不同的突发事件信息流行度演变模式。这些模式包括强双峰、弱双峰、短时单峰、慢升温单峰和慢衰减单峰。面向早期监测和预警,我们开发了一套综合特征系统,包括发布者特征、信息特征和早期特征。在早期特征中,数据测量是在紧急信息发布后 1 小时的时间窗口内进行的。考虑到实时响应和分析速度,我们采用了经典的机器学习方法来预测相关模式。结果最佳预测模型和随机森林(RF)的综合预测结果表明其性能令人印象深刻,精确度、召回率和 F1 分数均达到 88%。此外,每个模式预测的 F1 值都超过了 87%。特征重要性分析结果表明,早期特征对模式预测的贡献最大,其次是信息特征和发布者特征。原创性/价值 本研究从时间维度揭示了社交媒体突发事件信息流行度的增长与下降、出现与消失的现象和特殊规律,找出了社交媒体突发事件信息流行度演变的模式。同时,对相关规律进行早期预测,探究其背后的作用因素。这些发现有助于社会媒体突发事件信息发布策略的制定、网络舆论引导和风险监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Discovering and early predicting popularity evolution patterns of social media emergency information

Purpose

When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.

Design/methodology/approach

We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.

Findings

The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.

Originality/value

This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Aslib Journal of Information Management
Aslib Journal of Information Management COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
5.30
自引率
19.20%
发文量
79
期刊介绍: Aslib Journal of Information Management covers a broad range of issues in the field, including economic, behavioural, social, ethical, technological, international, business-related, political and management-orientated factors. Contributors are encouraged to spell out the practical implications of their work. Aslib Journal of Information Management Areas of interest include topics such as social media, data protection, search engines, information retrieval, digital libraries, information behaviour, intellectual property and copyright, information industry, digital repositories and information policy and governance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信