Yejin Park , Seonkyu Lim , Changdai Gu , Arida Ferti Syafiandini , Min Song
{"title":"Forecasting topic trends of blockchain utilizing topic modeling and deep learning-based time-series prediction on different document types","authors":"Yejin Park , Seonkyu Lim , Changdai Gu , Arida Ferti Syafiandini , Min Song","doi":"10.1016/j.joi.2025.101639","DOIUrl":null,"url":null,"abstract":"<div><div>Topic trends in rapidly evolving domains like blockchain are dynamic and pose prediction challenges. To address this, we propose a novel framework that integrates topic modeling, clustering, and time-series deep learning models. These models include both non-graph-based and graph-based approaches. Blockchain-related documents of three types—academic papers, patents, and news articles—are collected and preprocessed. Random and topic subgraphs are constructed as inputs for model training and forecasting across various time epochs. The four models (LSTM, GRU, AGCRN, and A3T-GCN) are trained on random subgraphs, and the trained models forecast topic trends using topic subgraphs. We also analyze the distinctive characteristics of each document type and investigate the causal relationships between them. The results indicate that non-graph-based models, such as LSTM, perform better on periodic data like academic papers, whereas graph-based models, such as AGCRN and A3T-GCN, excel at capturing non-periodic patterns in patents and news articles. Our framework demonstrates robust performance, offering a versatile tool for blockchain-related trend analysis and forecasting. The code and environments are available at <span><span>https://github.com/textmining-org/topic-forecasting</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":"19 2","pages":"Article 101639"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informetrics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157725000033","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Topic trends in rapidly evolving domains like blockchain are dynamic and pose prediction challenges. To address this, we propose a novel framework that integrates topic modeling, clustering, and time-series deep learning models. These models include both non-graph-based and graph-based approaches. Blockchain-related documents of three types—academic papers, patents, and news articles—are collected and preprocessed. Random and topic subgraphs are constructed as inputs for model training and forecasting across various time epochs. The four models (LSTM, GRU, AGCRN, and A3T-GCN) are trained on random subgraphs, and the trained models forecast topic trends using topic subgraphs. We also analyze the distinctive characteristics of each document type and investigate the causal relationships between them. The results indicate that non-graph-based models, such as LSTM, perform better on periodic data like academic papers, whereas graph-based models, such as AGCRN and A3T-GCN, excel at capturing non-periodic patterns in patents and news articles. Our framework demonstrates robust performance, offering a versatile tool for blockchain-related trend analysis and forecasting. The code and environments are available at https://github.com/textmining-org/topic-forecasting.
期刊介绍:
Journal of Informetrics (JOI) publishes rigorous high-quality research on quantitative aspects of information science. The main focus of the journal is on topics in bibliometrics, scientometrics, webometrics, patentometrics, altmetrics and research evaluation. Contributions studying informetric problems using methods from other quantitative fields, such as mathematics, statistics, computer science, economics and econometrics, and network science, are especially encouraged. JOI publishes both theoretical and empirical work. In general, case studies, for instance a bibliometric analysis focusing on a specific research field or a specific country, are not considered suitable for publication in JOI, unless they contain innovative methodological elements.