Early identification of breakthrough research from sleeping beauties using machine learning

IF 3.4 2区 管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Xin Li, Xiaodi Ma, Ye Feng
{"title":"Early identification of breakthrough research from sleeping beauties using machine learning","authors":"Xin Li,&nbsp;Xiaodi Ma,&nbsp;Ye Feng","doi":"10.1016/j.joi.2024.101517","DOIUrl":null,"url":null,"abstract":"<div><p>Breakthrough research is groundbreaking and transformative scientific research that can lead to new frontiers and even trigger substantial changes in the scientific paradigm. Early identification of breakthrough research is crucial for scientists, R&amp;D experts, and policymakers. \"Sleeping Beauty in Science\" is a category of papers characterized as \"delayed recognition\", which is considered as the crucial carriers of breakthrough research. Machine learning methods can extract and learn high-quality information from a priori knowledge to predict future trends. In this paper, to address the shortcomings of existing studies on the early identification of breakthrough research, we propose a framework for identifying breakthrough research from sleeping beauties using machine learning. In this framework, we first construct machine learning models to obtain the relationship patterns between historical sleeping beauties and their citation trends. Then, we use these relational patterns to identify potential sleeping beauties. Secondly, we construct a breakthrough index based on the essential features of breakthrough research, then we apply it to identify breakthrough research among potential sleeping beauties, enabling the early identification of breakthrough research. Finally, an empirical study is conducted in the chemistry research field to verify the validity and flexibility of this framework. The results show that the framework can effectively identify breakthrough research from sleeping beauties. This paper contributes to the early identification of breakthrough research, evaluating academic results, and exploring research frontiers. Additionally, it will also provide methodological support for the decision-making of R&amp;D experts and policymakers.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":"18 2","pages":"Article 101517"},"PeriodicalIF":3.4000,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informetrics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157724000300","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Breakthrough research is groundbreaking and transformative scientific research that can lead to new frontiers and even trigger substantial changes in the scientific paradigm. Early identification of breakthrough research is crucial for scientists, R&D experts, and policymakers. "Sleeping Beauty in Science" is a category of papers characterized as "delayed recognition", which is considered as the crucial carriers of breakthrough research. Machine learning methods can extract and learn high-quality information from a priori knowledge to predict future trends. In this paper, to address the shortcomings of existing studies on the early identification of breakthrough research, we propose a framework for identifying breakthrough research from sleeping beauties using machine learning. In this framework, we first construct machine learning models to obtain the relationship patterns between historical sleeping beauties and their citation trends. Then, we use these relational patterns to identify potential sleeping beauties. Secondly, we construct a breakthrough index based on the essential features of breakthrough research, then we apply it to identify breakthrough research among potential sleeping beauties, enabling the early identification of breakthrough research. Finally, an empirical study is conducted in the chemistry research field to verify the validity and flexibility of this framework. The results show that the framework can effectively identify breakthrough research from sleeping beauties. This paper contributes to the early identification of breakthrough research, evaluating academic results, and exploring research frontiers. Additionally, it will also provide methodological support for the decision-making of R&D experts and policymakers.

利用机器学习从睡美人中及早发现突破性研究成果
突破性研究是具有开创性和变革性的科学研究,可以开辟新的前沿领域,甚至引发科学范式的重大变革。及早发现突破性研究对于科学家、研发专家和政策制定者来说至关重要。"科学睡美人 "是一类以 "延迟识别 "为特征的论文,被认为是突破性研究的重要载体。机器学习方法可以从先验知识中提取和学习高质量信息,从而预测未来趋势。本文针对现有研究在早期识别突破性研究方面存在的不足,提出了一种利用机器学习从睡美人中识别突破性研究的框架。在这个框架中,我们首先构建机器学习模型,以获取历史上的睡美人与其引用趋势之间的关系模式。然后,我们利用这些关系模式来识别潜在的 "睡美人"。其次,我们根据突破性研究的基本特征构建了突破性指数,并将其应用于识别潜在 "睡美人 "中的突破性研究,从而实现对突破性研究的早期识别。最后,我们在化学研究领域开展了实证研究,以验证该框架的有效性和灵活性。结果表明,该框架能有效地从 "睡美人 "中识别出突破性研究。本文有助于早期识别突破性研究、评估学术成果和探索研究前沿。此外,它还将为研发专家和决策者的决策提供方法论支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Informetrics
Journal of Informetrics Social Sciences-Library and Information Sciences
CiteScore
6.40
自引率
16.20%
发文量
95
期刊介绍: Journal of Informetrics (JOI) publishes rigorous high-quality research on quantitative aspects of information science. The main focus of the journal is on topics in bibliometrics, scientometrics, webometrics, patentometrics, altmetrics and research evaluation. Contributions studying informetric problems using methods from other quantitative fields, such as mathematics, statistics, computer science, economics and econometrics, and network science, are especially encouraged. JOI publishes both theoretical and empirical work. In general, case studies, for instance a bibliometric analysis focusing on a specific research field or a specific country, are not considered suitable for publication in JOI, unless they contain innovative methodological elements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信