{"title":"Discovering and early predicting popularity evolution patterns of social media emergency information","authors":"Delin Yuan, Yang Li","doi":"10.1108/ajim-10-2023-0450","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.</p><!--/ Abstract__block -->","PeriodicalId":53152,"journal":{"name":"Aslib Journal of Information Management","volume":"72 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aslib Journal of Information Management","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/ajim-10-2023-0450","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
When emergencies occur, the attention of the public towards emergency information on social media in a specific time period forms the emergency information popularity evolution patterns. The purpose of this study is to discover the popularity evolution patterns of social media emergency information and make early predictions.
Design/methodology/approach
We collected the data related to the COVID-19 epidemic on the Sina Weibo platform and applied the K-Shape clustering algorithm to identify five distinct patterns of emergency information popularity evolution patterns. These patterns include strong twin peaks, weak twin peaks, short-lived single peak, slow-to-warm-up single peak and slow-to-decay single peak. Oriented toward early monitoring and warning, we developed a comprehensive characteristic system that incorporates publisher features, information features and early features. In the early features, data measurements are taken within a 1-h time window after the release of emergency information. Considering real-time response and analysis speed, we employed classical machine learning methods to predict the relevant patterns. Multiple classification models were trained and evaluated for this purpose.
Findings
The combined prediction results of the best prediction model and random forest (RF) demonstrate impressive performance, with precision, recall and F1-score reaching 88%. Moreover, the F1 value for each pattern prediction surpasses 87%. The results of the feature importance analysis show that the early features contribute the most to the pattern prediction, followed by the information features and publisher features. Among them, the release time in the information features exhibits the most substantial contribution to the prediction outcome.
Originality/value
This study reveals the phenomena and special patterns of growth and decline, appearance and disappearance of social media emergency information popularity from the time dimension and identifies the patterns of social media emergency information popularity evolution. Meanwhile, early prediction of related patterns is made to explore the role factors behind them. These findings contribute to the formulation of social media emergency information release strategies, online public opinion guidance and risk monitoring.
期刊介绍:
Aslib Journal of Information Management covers a broad range of issues in the field, including economic, behavioural, social, ethical, technological, international, business-related, political and management-orientated factors. Contributors are encouraged to spell out the practical implications of their work. Aslib Journal of Information Management Areas of interest include topics such as social media, data protection, search engines, information retrieval, digital libraries, information behaviour, intellectual property and copyright, information industry, digital repositories and information policy and governance.