Hao Luo, Guixiang Cheng, Zhongying Deng, Haiyang Chi, Xin Yan
{"title":"文本信息流融合与信息传播规模预测的全局视角","authors":"Hao Luo, Guixiang Cheng, Zhongying Deng, Haiyang Chi, Xin Yan","doi":"10.1111/exsy.70089","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Accurately predicting the size of information dissemination has important theoretical and practical significance for formulating content distribution strategies, optimising network resource allocation and conducting effective public opinion management. Current research on the cascade growth size of social media information dissemination mostly focuses on network structure and user behaviour analysis. Still, it neglects the crucial role of textual information in driving information dissemination. We propose a deep learning framework called CasText, which integrates multisource features such as text information, global propagation graphs and local propagation structures to more accurately predict the size of information propagation. Using Sentence-BERT to extract deep semantic features of text and combining it with GNN, precise capture of the interaction between text information and cascading structures has been achieved; using DeepWalk to view the entire social network as a complex graphic structure, high-dimensional feature representations of each social media user can be automatically learned. This global perspective helps to reveal broader dissemination patterns and potential influence paths, thereby improving the accuracy of predicting the size of future information dissemination. In multiple comparative experiments based on a real Weibo cascaded text retweeting dataset, the CasText model improved the MSLE index by 3.1% compared to the baseline model, significantly demonstrating the effectiveness of multisource feature fusion in predicting information dissemination size. We further confirmed the importance of text information, global propagation graphs and local propagation embeddings in improving model performance through ablation experiments.</p>\n </div>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 8","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CasText: Fusion of Text Information Flow and Global Perspective for Predicting the Size of Information Dissemination\",\"authors\":\"Hao Luo, Guixiang Cheng, Zhongying Deng, Haiyang Chi, Xin Yan\",\"doi\":\"10.1111/exsy.70089\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Accurately predicting the size of information dissemination has important theoretical and practical significance for formulating content distribution strategies, optimising network resource allocation and conducting effective public opinion management. Current research on the cascade growth size of social media information dissemination mostly focuses on network structure and user behaviour analysis. Still, it neglects the crucial role of textual information in driving information dissemination. We propose a deep learning framework called CasText, which integrates multisource features such as text information, global propagation graphs and local propagation structures to more accurately predict the size of information propagation. Using Sentence-BERT to extract deep semantic features of text and combining it with GNN, precise capture of the interaction between text information and cascading structures has been achieved; using DeepWalk to view the entire social network as a complex graphic structure, high-dimensional feature representations of each social media user can be automatically learned. This global perspective helps to reveal broader dissemination patterns and potential influence paths, thereby improving the accuracy of predicting the size of future information dissemination. In multiple comparative experiments based on a real Weibo cascaded text retweeting dataset, the CasText model improved the MSLE index by 3.1% compared to the baseline model, significantly demonstrating the effectiveness of multisource feature fusion in predicting information dissemination size. We further confirmed the importance of text information, global propagation graphs and local propagation embeddings in improving model performance through ablation experiments.</p>\\n </div>\",\"PeriodicalId\":51053,\"journal\":{\"name\":\"Expert Systems\",\"volume\":\"42 8\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70089\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70089","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
CasText: Fusion of Text Information Flow and Global Perspective for Predicting the Size of Information Dissemination
Accurately predicting the size of information dissemination has important theoretical and practical significance for formulating content distribution strategies, optimising network resource allocation and conducting effective public opinion management. Current research on the cascade growth size of social media information dissemination mostly focuses on network structure and user behaviour analysis. Still, it neglects the crucial role of textual information in driving information dissemination. We propose a deep learning framework called CasText, which integrates multisource features such as text information, global propagation graphs and local propagation structures to more accurately predict the size of information propagation. Using Sentence-BERT to extract deep semantic features of text and combining it with GNN, precise capture of the interaction between text information and cascading structures has been achieved; using DeepWalk to view the entire social network as a complex graphic structure, high-dimensional feature representations of each social media user can be automatically learned. This global perspective helps to reveal broader dissemination patterns and potential influence paths, thereby improving the accuracy of predicting the size of future information dissemination. In multiple comparative experiments based on a real Weibo cascaded text retweeting dataset, the CasText model improved the MSLE index by 3.1% compared to the baseline model, significantly demonstrating the effectiveness of multisource feature fusion in predicting information dissemination size. We further confirmed the importance of text information, global propagation graphs and local propagation embeddings in improving model performance through ablation experiments.
期刊介绍:
Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper.
As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.