What predicts citation counts and translational impact in headache research? A machine learning analysis

IF 5 2区 医学 Q1 CLINICAL NEUROLOGY
Antonios Danelakis, Helge Langseth, Parashkev Nachev, Amy Nelson, Marte-Helene Bjørk, Manjit S. Matharu, Erling Tronvik, Arne May, Anker Stubberud
{"title":"What predicts citation counts and translational impact in headache research? A machine learning analysis","authors":"Antonios Danelakis, Helge Langseth, Parashkev Nachev, Amy Nelson, Marte-Helene Bjørk, Manjit S. Matharu, Erling Tronvik, Arne May, Anker Stubberud","doi":"10.1177/03331024241251488","DOIUrl":null,"url":null,"abstract":"BackgroundWe aimed to develop the first machine learning models to predict citation counts and the translational impact, defined as inclusion in guidelines or policy documents, of headache research, and assess which factors are most predictive.MethodsBibliometric data and the titles, abstracts, and keywords from 8600 publications in three headache-oriented journals from their inception to 31 December 2017 were used. A series of machine learning models were implemented to predict three classes of 5-year citation count intervals (0–5, 6–14 and, >14 citations); and the translational impact of a publication. Models were evaluated out-of-sample with area under the receiver operating characteristics curve (AUC).ResultsThe top performing gradient boosting model predicted correct citation count class with an out-of-sample AUC of 0.81. Bibliometric data such as page count, number of references, first and last author citation counts and h-index were among the most important predictors. Prediction of translational impact worked optimally when including both bibliometric data and information from the title, abstract and keywords, reaching an out-of-sample AUC of 0.71 for the top performing random forest model.ConclusionCitation counts are best predicted by bibliometric data, while models incorporating both bibliometric data and publication content identifies the translational impact of headache research.","PeriodicalId":10075,"journal":{"name":"Cephalalgia","volume":"84 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cephalalgia","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/03331024241251488","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

BackgroundWe aimed to develop the first machine learning models to predict citation counts and the translational impact, defined as inclusion in guidelines or policy documents, of headache research, and assess which factors are most predictive.MethodsBibliometric data and the titles, abstracts, and keywords from 8600 publications in three headache-oriented journals from their inception to 31 December 2017 were used. A series of machine learning models were implemented to predict three classes of 5-year citation count intervals (0–5, 6–14 and, >14 citations); and the translational impact of a publication. Models were evaluated out-of-sample with area under the receiver operating characteristics curve (AUC).ResultsThe top performing gradient boosting model predicted correct citation count class with an out-of-sample AUC of 0.81. Bibliometric data such as page count, number of references, first and last author citation counts and h-index were among the most important predictors. Prediction of translational impact worked optimally when including both bibliometric data and information from the title, abstract and keywords, reaching an out-of-sample AUC of 0.71 for the top performing random forest model.ConclusionCitation counts are best predicted by bibliometric data, while models incorporating both bibliometric data and publication content identifies the translational impact of headache research.
是什么预测了头痛研究的引用次数和转化影响?机器学习分析
背景我们旨在开发首个机器学习模型,以预测头痛研究的引用次数和转化影响(定义为纳入指南或政策文件),并评估哪些因素最具预测性。方法我们使用了三种以头痛为导向的期刊自创刊至2017年12月31日期间8600篇出版物的文献计量数据、标题、摘要和关键词。采用了一系列机器学习模型来预测5年被引次数间隔的三个等级(0-5次、6-14次和>14次);以及出版物的转化影响力。结果表现最好的梯度提升模型预测了正确的引用次数等级,样本外AUC为0.81。文献计量数据(如页数、参考文献数、第一作者和最后作者的引用次数以及 h 指数)是最重要的预测因素。当同时包含文献计量学数据和来自标题、摘要和关键词的信息时,对转化影响的预测效果最佳,表现最好的随机森林模型的样本外 AUC 为 0.71。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cephalalgia
Cephalalgia 医学-临床神经学
CiteScore
10.10
自引率
6.10%
发文量
108
审稿时长
4-8 weeks
期刊介绍: Cephalalgia contains original peer reviewed papers on all aspects of headache. The journal provides an international forum for original research papers, review articles and short communications. Published monthly on behalf of the International Headache Society, Cephalalgia''s rapid review averages 5 ½ weeks from author submission to first decision.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信