{"title":"\"Towards Re-Inventing Psychohistory\": Predicting the Popularity of Tomorrow's News from Yesterday's Twitter and News Feeds.","authors":"Jiachen Sun, Peter Gloor","doi":"10.1007/s11518-020-5470-4","DOIUrl":null,"url":null,"abstract":"<p><p>Rapid advances in machine learning combined with wide availability of online social media have created considerable research activity in predicting what might be the news of tomorrow based on an analysis of the past. In this work, we present a deep learning forecasting framework which is capable to predict tomorrow's news topics on Twitter and news feeds based on yesterday's content and topic-interaction features. The proposed framework starts by generating topics from words using word embeddings and K-means clustering. Then temporal topic-networks are constructed where two topics are linked if the same user has worked on both topics. Structural and dynamic metrics calculated from networks along with content features and past activity, are used as input of a long short-term memory (LSTM) model, which predicts the number of mentions of a specific topic on the subsequent day. Utilizing dependencies among topics, our experiments on two Twitter datasets and the HuffPost news dataset demonstrate that selecting a topic's historical local neighbors in the topic-network as extra features greatly improves the prediction accuracy and outperforms existing baselines.</p>","PeriodicalId":17150,"journal":{"name":"Journal of Systems Science and Systems Engineering","volume":"30 1","pages":"85-104"},"PeriodicalIF":1.7000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s11518-020-5470-4","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems Science and Systems Engineering","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s11518-020-5470-4","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/11/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Rapid advances in machine learning combined with wide availability of online social media have created considerable research activity in predicting what might be the news of tomorrow based on an analysis of the past. In this work, we present a deep learning forecasting framework which is capable to predict tomorrow's news topics on Twitter and news feeds based on yesterday's content and topic-interaction features. The proposed framework starts by generating topics from words using word embeddings and K-means clustering. Then temporal topic-networks are constructed where two topics are linked if the same user has worked on both topics. Structural and dynamic metrics calculated from networks along with content features and past activity, are used as input of a long short-term memory (LSTM) model, which predicts the number of mentions of a specific topic on the subsequent day. Utilizing dependencies among topics, our experiments on two Twitter datasets and the HuffPost news dataset demonstrate that selecting a topic's historical local neighbors in the topic-network as extra features greatly improves the prediction accuracy and outperforms existing baselines.
期刊介绍:
Journal of Systems Science and Systems Engineering is an international journal published bimonthly. It aims to foster new thinking and research, to help decision makers to understand the mechanism and complexity of economic, engineering, management, social and technological systems, and learn new developments in theory and practice that could help to improve the performance of systems.
The Journal publishes papers that address the theory, methodology and applications relating to systems science and systems engineering; applications and practical experience of systems engineering in various fields of industry, agriculture, service sector, environment, finance, operating management, E-commerce, logistics, information systems. Technical notes solving practical problems and reviews are also welcome.