{"title":"Early identification of breakthrough research from sleeping beauties using machine learning","authors":"Xin Li, Xiaodi Ma, Ye Feng","doi":"10.1016/j.joi.2024.101517","DOIUrl":null,"url":null,"abstract":"<div><p>Breakthrough research is groundbreaking and transformative scientific research that can lead to new frontiers and even trigger substantial changes in the scientific paradigm. Early identification of breakthrough research is crucial for scientists, R&D experts, and policymakers. \"Sleeping Beauty in Science\" is a category of papers characterized as \"delayed recognition\", which is considered as the crucial carriers of breakthrough research. Machine learning methods can extract and learn high-quality information from a priori knowledge to predict future trends. In this paper, to address the shortcomings of existing studies on the early identification of breakthrough research, we propose a framework for identifying breakthrough research from sleeping beauties using machine learning. In this framework, we first construct machine learning models to obtain the relationship patterns between historical sleeping beauties and their citation trends. Then, we use these relational patterns to identify potential sleeping beauties. Secondly, we construct a breakthrough index based on the essential features of breakthrough research, then we apply it to identify breakthrough research among potential sleeping beauties, enabling the early identification of breakthrough research. Finally, an empirical study is conducted in the chemistry research field to verify the validity and flexibility of this framework. The results show that the framework can effectively identify breakthrough research from sleeping beauties. This paper contributes to the early identification of breakthrough research, evaluating academic results, and exploring research frontiers. Additionally, it will also provide methodological support for the decision-making of R&D experts and policymakers.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":"18 2","pages":"Article 101517"},"PeriodicalIF":3.4000,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informetrics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157724000300","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Breakthrough research is groundbreaking and transformative scientific research that can lead to new frontiers and even trigger substantial changes in the scientific paradigm. Early identification of breakthrough research is crucial for scientists, R&D experts, and policymakers. "Sleeping Beauty in Science" is a category of papers characterized as "delayed recognition", which is considered as the crucial carriers of breakthrough research. Machine learning methods can extract and learn high-quality information from a priori knowledge to predict future trends. In this paper, to address the shortcomings of existing studies on the early identification of breakthrough research, we propose a framework for identifying breakthrough research from sleeping beauties using machine learning. In this framework, we first construct machine learning models to obtain the relationship patterns between historical sleeping beauties and their citation trends. Then, we use these relational patterns to identify potential sleeping beauties. Secondly, we construct a breakthrough index based on the essential features of breakthrough research, then we apply it to identify breakthrough research among potential sleeping beauties, enabling the early identification of breakthrough research. Finally, an empirical study is conducted in the chemistry research field to verify the validity and flexibility of this framework. The results show that the framework can effectively identify breakthrough research from sleeping beauties. This paper contributes to the early identification of breakthrough research, evaluating academic results, and exploring research frontiers. Additionally, it will also provide methodological support for the decision-making of R&D experts and policymakers.
期刊介绍:
Journal of Informetrics (JOI) publishes rigorous high-quality research on quantitative aspects of information science. The main focus of the journal is on topics in bibliometrics, scientometrics, webometrics, patentometrics, altmetrics and research evaluation. Contributions studying informetric problems using methods from other quantitative fields, such as mathematics, statistics, computer science, economics and econometrics, and network science, are especially encouraged. JOI publishes both theoretical and empirical work. In general, case studies, for instance a bibliometric analysis focusing on a specific research field or a specific country, are not considered suitable for publication in JOI, unless they contain innovative methodological elements.