Citation Count Prediction Using Different Time Series Analysis Models

Priyam Porwal, M. Devare
{"title":"Citation Count Prediction Using Different Time Series Analysis Models","authors":"Priyam Porwal, M. Devare","doi":"10.1109/IBSSC56953.2022.10037553","DOIUrl":null,"url":null,"abstract":"The paper helps to predict the future citation value of a fresh dataset of research papers by considering the past values of the citation count of paper using univariate time series analysis models and evaluating their performance through various evaluation metrics. It is important to predict future citation count as it helps to assess researcher's achievements, promotions, fund allocation, etc. This research is in addition to past research where for prediction, different parameters like content of paper, author details, venue impact etc. were considered. The real and original data for the dataset was extracted from the Google Scholar profile of top ranked authors. Three models of time series, Autoregressive Integrated moving average(ARIMA), Simple exponential smoothing (SES), and Holt winter's exponential Smoothing (HWES) are applied to observe the result variations. The models obtained error metric values for the complete dataset. All four-evaluation metrics were calculated. The best results for the predictions for citation count were obtained from the Simple exponential smoothing and Holt winter's exponential Smoothing models, whose values were almost the same for all evaluation metrics because of almost no change in formula. Among all fourerror metrics mentioned in the design, MASE gave sensible results, with almost all values being less than 1. The results showed similar graphs for both Simple exponential smoothing and Holt winter's exponential smoothing models for actual and predicted values of citation count as there is negligible difference in formula.","PeriodicalId":426897,"journal":{"name":"2022 IEEE Bombay Section Signature Conference (IBSSC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC56953.2022.10037553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The paper helps to predict the future citation value of a fresh dataset of research papers by considering the past values of the citation count of paper using univariate time series analysis models and evaluating their performance through various evaluation metrics. It is important to predict future citation count as it helps to assess researcher's achievements, promotions, fund allocation, etc. This research is in addition to past research where for prediction, different parameters like content of paper, author details, venue impact etc. were considered. The real and original data for the dataset was extracted from the Google Scholar profile of top ranked authors. Three models of time series, Autoregressive Integrated moving average(ARIMA), Simple exponential smoothing (SES), and Holt winter's exponential Smoothing (HWES) are applied to observe the result variations. The models obtained error metric values for the complete dataset. All four-evaluation metrics were calculated. The best results for the predictions for citation count were obtained from the Simple exponential smoothing and Holt winter's exponential Smoothing models, whose values were almost the same for all evaluation metrics because of almost no change in formula. Among all fourerror metrics mentioned in the design, MASE gave sensible results, with almost all values being less than 1. The results showed similar graphs for both Simple exponential smoothing and Holt winter's exponential smoothing models for actual and predicted values of citation count as there is negligible difference in formula.
利用不同时间序列分析模型预测引文数量
本文利用单变量时间序列分析模型考虑论文被引次数的过去值,并通过各种评价指标对其表现进行评价,从而预测新研究论文数据集的未来被引价值。预测未来的引文数量对评估科研人员的科研成果、晋升、经费分配等具有重要意义。本研究是对以往研究的补充,在以往的研究中,为了进行预测,考虑了不同的参数,如论文内容、作者详细信息、场地影响等。该数据集的真实和原始数据是从谷歌学术排名靠前的作者的个人资料中提取的。采用自回归综合移动平均(ARIMA)、简单指数平滑(SES)和Holt winter指数平滑(hes)三种时间序列模型观察结果的变化。模型得到了完整数据集的误差度量值。计算所有四个评价指标。简单指数平滑模型和Holt winter的指数平滑模型对引文数的预测效果最好,由于公式几乎没有变化,所以对所有评价指标的预测结果基本一致。在设计中提到的四个误差指标中,MASE给出了合理的结果,几乎所有的值都小于1。结果表明,简单指数平滑模型和Holt winter的指数平滑模型对引文计数的实际值和预测值的图相似,公式差异可以忽略不计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信