{"title":"发现和早期预测社交媒体紧急信息的流行演变模式","authors":"Delin Yuan, Yang Li","doi":"10.1108/ajim-10-2023-0450","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.</p><!--/ Abstract__block -->","PeriodicalId":53152,"journal":{"name":"Aslib Journal of Information Management","volume":"72 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discovering and early predicting popularity evolution patterns of social media emergency information\",\"authors\":\"Delin Yuan, Yang Li\",\"doi\":\"10.1108/ajim-10-2023-0450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3>Purpose</h3>\\n<p>When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.</p><!--/ Abstract__block -->\\n<h3>Design/methodology/approach</h3>\\n<p>We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.</p><!--/ Abstract__block -->\\n<h3>Findings</h3>\\n<p>The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.</p><!--/ Abstract__block -->\\n<h3>Originality/value</h3>\\n<p>This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.</p><!--/ Abstract__block -->\",\"PeriodicalId\":53152,\"journal\":{\"name\":\"Aslib Journal of Information Management\",\"volume\":\"72 1\",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aslib Journal of Information Management\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1108/ajim-10-2023-0450\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aslib Journal of Information Management","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/ajim-10-2023-0450","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
目的当突发事件发生时,特定时间段内公众对社交媒体上突发事件信息的关注度会形成突发事件信息的流行度演变规律。我们收集了新浪微博平台上与 COVID-19 疫情相关的数据,并应用 K-Shape 聚类算法识别出五种不同的突发事件信息流行度演变模式。这些模式包括强双峰、弱双峰、短时单峰、慢升温单峰和慢衰减单峰。面向早期监测和预警,我们开发了一套综合特征系统,包括发布者特征、信息特征和早期特征。在早期特征中,数据测量是在紧急信息发布后 1 小时的时间窗口内进行的。考虑到实时响应和分析速度,我们采用了经典的机器学习方法来预测相关模式。结果最佳预测模型和随机森林(RF)的综合预测结果表明其性能令人印象深刻,精确度、召回率和 F1 分数均达到 88%。此外,每个模式预测的 F1 值都超过了 87%。特征重要性分析结果表明,早期特征对模式预测的贡献最大,其次是信息特征和发布者特征。原创性/价值 本研究从时间维度揭示了社交媒体突发事件信息流行度的增长与下降、出现与消失的现象和特殊规律,找出了社交媒体突发事件信息流行度演变的模式。同时,对相关规律进行早期预测,探究其背后的作用因素。这些发现有助于社会媒体突发事件信息发布策略的制定、网络舆论引导和风险监测。
Discovering and early predicting popularity evolution patterns of social media emergency information
Purpose
When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.
Design/methodology/approach
We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.
Findings
The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.
Originality/value
This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.
期刊介绍:
Aslib Journal of Information Management covers a broad range of issues in the field, including economic, behavioural, social, ethical, technological, international, business-related, political and management-orientated factors. Contributors are encouraged to spell out the practical implications of their work. Aslib Journal of Information Management Areas of interest include topics such as social media, data protection, search engines, information retrieval, digital libraries, information behaviour, intellectual property and copyright, information industry, digital repositories and information policy and governance.