“走向重新发明心理历史”:从昨天的推特和新闻源预测明天新闻的受欢迎程度。

IF 1.7 4区 管理学 Q3 OPERATIONS RESEARCH & MANAGEMENT SCIENCE
Jiachen Sun, Peter Gloor
{"title":"“走向重新发明心理历史”:从昨天的推特和新闻源预测明天新闻的受欢迎程度。","authors":"Jiachen Sun,&nbsp;Peter Gloor","doi":"10.1007/s11518-020-5470-4","DOIUrl":null,"url":null,"abstract":"<p><p>Rapid advances in machine learning combined with wide availability of online social media have created considerable research activity in predicting what might be the news of tomorrow based on an analysis of the past. In this work, we present a deep learning forecasting framework which is capable to predict tomorrow's news topics on Twitter and news feeds based on yesterday's content and topic-interaction features. The proposed framework starts by generating topics from words using word embeddings and K-means clustering. Then temporal topic-networks are constructed where two topics are linked if the same user has worked on both topics. Structural and dynamic metrics calculated from networks along with content features and past activity, are used as input of a long short-term memory (LSTM) model, which predicts the number of mentions of a specific topic on the subsequent day. Utilizing dependencies among topics, our experiments on two Twitter datasets and the HuffPost news dataset demonstrate that selecting a topic's historical local neighbors in the topic-network as extra features greatly improves the prediction accuracy and outperforms existing baselines.</p>","PeriodicalId":17150,"journal":{"name":"Journal of Systems Science and Systems Engineering","volume":"30 1","pages":"85-104"},"PeriodicalIF":1.7000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s11518-020-5470-4","citationCount":"0","resultStr":"{\"title\":\"\\\"Towards Re-Inventing Psychohistory\\\": Predicting the Popularity of Tomorrow's News from Yesterday's Twitter and News Feeds.\",\"authors\":\"Jiachen Sun,&nbsp;Peter Gloor\",\"doi\":\"10.1007/s11518-020-5470-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Rapid advances in machine learning combined with wide availability of online social media have created considerable research activity in predicting what might be the news of tomorrow based on an analysis of the past. In this work, we present a deep learning forecasting framework which is capable to predict tomorrow's news topics on Twitter and news feeds based on yesterday's content and topic-interaction features. The proposed framework starts by generating topics from words using word embeddings and K-means clustering. Then temporal topic-networks are constructed where two topics are linked if the same user has worked on both topics. Structural and dynamic metrics calculated from networks along with content features and past activity, are used as input of a long short-term memory (LSTM) model, which predicts the number of mentions of a specific topic on the subsequent day. Utilizing dependencies among topics, our experiments on two Twitter datasets and the HuffPost news dataset demonstrate that selecting a topic's historical local neighbors in the topic-network as extra features greatly improves the prediction accuracy and outperforms existing baselines.</p>\",\"PeriodicalId\":17150,\"journal\":{\"name\":\"Journal of Systems Science and Systems Engineering\",\"volume\":\"30 1\",\"pages\":\"85-104\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s11518-020-5470-4\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems Science and Systems Engineering\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1007/s11518-020-5470-4\",\"RegionNum\":4,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/11/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"OPERATIONS RESEARCH & MANAGEMENT SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems Science and Systems Engineering","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s11518-020-5470-4","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/11/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

机器学习的快速发展与广泛可用的在线社交媒体相结合,创造了大量的研究活动,根据对过去的分析来预测明天可能发生的新闻。在这项工作中,我们提出了一个深度学习预测框架,该框架能够根据昨天的内容和主题交互特征预测Twitter和新闻提要上明天的新闻主题。该框架首先使用词嵌入和K-means聚类从词中生成主题。然后构建时间主题网络,如果同一用户在两个主题上工作,则将两个主题链接在一起。从网络中计算出的结构和动态指标以及内容特征和过去的活动,被用作长短期记忆(LSTM)模型的输入,该模型预测了第二天特定主题的提及次数。利用主题之间的依赖关系,我们在两个Twitter数据集和HuffPost新闻数据集上的实验表明,在主题网络中选择主题的历史本地邻居作为额外的特征大大提高了预测精度,并且优于现有的基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
"Towards Re-Inventing Psychohistory": Predicting the Popularity of Tomorrow's News from Yesterday's Twitter and News Feeds.

Rapid advances in machine learning combined with wide availability of online social media have created considerable research activity in predicting what might be the news of tomorrow based on an analysis of the past. In this work, we present a deep learning forecasting framework which is capable to predict tomorrow's news topics on Twitter and news feeds based on yesterday's content and topic-interaction features. The proposed framework starts by generating topics from words using word embeddings and K-means clustering. Then temporal topic-networks are constructed where two topics are linked if the same user has worked on both topics. Structural and dynamic metrics calculated from networks along with content features and past activity, are used as input of a long short-term memory (LSTM) model, which predicts the number of mentions of a specific topic on the subsequent day. Utilizing dependencies among topics, our experiments on two Twitter datasets and the HuffPost news dataset demonstrate that selecting a topic's historical local neighbors in the topic-network as extra features greatly improves the prediction accuracy and outperforms existing baselines.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Systems Science and Systems Engineering
Journal of Systems Science and Systems Engineering 管理科学-运筹学与管理科学
CiteScore
2.70
自引率
16.70%
发文量
23
审稿时长
>12 weeks
期刊介绍: Journal of Systems Science and Systems Engineering is an international journal published bimonthly. It aims to foster new thinking and research, to help decision makers to understand the mechanism and complexity of economic, engineering, management, social and technological systems, and learn new developments in theory and practice that could help to improve the performance of systems. The Journal publishes papers that address the theory, methodology and applications relating to systems science and systems engineering; applications and practical experience of systems engineering in various fields of industry, agriculture, service sector, environment, finance, operating management, E-commerce, logistics, information systems. Technical notes solving practical problems and reviews are also welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信