统计学中影响引文数量的因素建模

Olcay Alpay, N. Danacıoğlu, E. Çankaya
{"title":"统计学中影响引文数量的因素建模","authors":"Olcay Alpay, N. Danacıoğlu, E. Çankaya","doi":"10.21541/apjess.1075099","DOIUrl":null,"url":null,"abstract":"Citation is considered as the most popular quality assessment metric for scientific papers, and it is thus important to determine what factors promote the citation count of a paper in comparison to the others in the same field. The main aim of this study is to model the citation counts of the research published in SCI or SCI-Expanded journals of Statistics field with the growing number of scientific works in Turkey. It is well known that the right-skewed nature of the counts makes the classical regression modelling inappropriate, even if the log transformation of counts is applied [1]. Due to the fact that distribution of citation counts involves a great number of zeros, this study serves for an additional aim that is to model the counts with advanced discrete regression models for a more precise prediction [2]. Data collected for this study consist of the citation counts of all scientific papers produced by 261 Statisticians in between 2005-2017. Discrete models varying from Poisson to Zero-Inflated or Hurdle were constructed by possible influential factors, such as the publication age, the number of references, the journal category etc. Predictive performances of alternative discrete models were compared via AIC and Vuong test [3]. Results suggested that Zero Inflated Negative Binomial and Hurdle Negative Binomial mixture models are the best forms to predict the zero inflation of citation counts [4]. In addition, the influential factors of the final model were interpreted to make some suggestions to Statisticians to increase the citation counts of their work.","PeriodicalId":234411,"journal":{"name":"Academic Platform Journal of Engineering and Smart Systems","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modelling of Factors Influencing the Citation Counts in Statistics\",\"authors\":\"Olcay Alpay, N. Danacıoğlu, E. Çankaya\",\"doi\":\"10.21541/apjess.1075099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Citation is considered as the most popular quality assessment metric for scientific papers, and it is thus important to determine what factors promote the citation count of a paper in comparison to the others in the same field. The main aim of this study is to model the citation counts of the research published in SCI or SCI-Expanded journals of Statistics field with the growing number of scientific works in Turkey. It is well known that the right-skewed nature of the counts makes the classical regression modelling inappropriate, even if the log transformation of counts is applied [1]. Due to the fact that distribution of citation counts involves a great number of zeros, this study serves for an additional aim that is to model the counts with advanced discrete regression models for a more precise prediction [2]. Data collected for this study consist of the citation counts of all scientific papers produced by 261 Statisticians in between 2005-2017. Discrete models varying from Poisson to Zero-Inflated or Hurdle were constructed by possible influential factors, such as the publication age, the number of references, the journal category etc. Predictive performances of alternative discrete models were compared via AIC and Vuong test [3]. Results suggested that Zero Inflated Negative Binomial and Hurdle Negative Binomial mixture models are the best forms to predict the zero inflation of citation counts [4]. In addition, the influential factors of the final model were interpreted to make some suggestions to Statisticians to increase the citation counts of their work.\",\"PeriodicalId\":234411,\"journal\":{\"name\":\"Academic Platform Journal of Engineering and Smart Systems\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Academic Platform Journal of Engineering and Smart Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21541/apjess.1075099\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Platform Journal of Engineering and Smart Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21541/apjess.1075099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

引文被认为是科学论文最受欢迎的质量评估指标,因此确定哪些因素促进了一篇论文的引文数量与同一领域的其他论文相比是很重要的。本研究的主要目的是模拟随着土耳其科学著作数量的增加,在SCI或SCI扩展的统计领域期刊上发表的研究的引用计数。众所周知,即使应用计数的对数变换,计数的右偏斜性质也使得经典回归建模不合适[1]。由于引文计数的分布涉及大量的零,本研究还有一个额外的目的,即使用先进的离散回归模型对引文计数进行建模,以获得更精确的预测[2]。本研究收集的数据包括2005-2017年间261名统计学家发表的所有科学论文的引用计数。根据可能的影响因素,如出版年龄、参考文献数量、期刊类别等,构建了从泊松到零膨胀或障碍的离散模型。通过AIC和Vuong检验比较不同离散模型的预测性能[3]。结果表明,零膨胀负二项和障碍负二项混合模型是预测引文零膨胀的最佳形式[4]。此外,对最终模型的影响因素进行了解释,为统计学家提高被引次数提出了建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Modelling of Factors Influencing the Citation Counts in Statistics
Citation is considered as the most popular quality assessment metric for scientific papers, and it is thus important to determine what factors promote the citation count of a paper in comparison to the others in the same field. The main aim of this study is to model the citation counts of the research published in SCI or SCI-Expanded journals of Statistics field with the growing number of scientific works in Turkey. It is well known that the right-skewed nature of the counts makes the classical regression modelling inappropriate, even if the log transformation of counts is applied [1]. Due to the fact that distribution of citation counts involves a great number of zeros, this study serves for an additional aim that is to model the counts with advanced discrete regression models for a more precise prediction [2]. Data collected for this study consist of the citation counts of all scientific papers produced by 261 Statisticians in between 2005-2017. Discrete models varying from Poisson to Zero-Inflated or Hurdle were constructed by possible influential factors, such as the publication age, the number of references, the journal category etc. Predictive performances of alternative discrete models were compared via AIC and Vuong test [3]. Results suggested that Zero Inflated Negative Binomial and Hurdle Negative Binomial mixture models are the best forms to predict the zero inflation of citation counts [4]. In addition, the influential factors of the final model were interpreted to make some suggestions to Statisticians to increase the citation counts of their work.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信